|
Neohapsis is currently accepting applications for employment. For more information, please visit our website www.neohapsis.com or email hr@neohapsis.com |
From: AIX Service Mail Server (aixserv
austin.ibm.com)Date: Tue Apr 10 2001 - 02:17:18 CDT
APAR: IY11750 COMPID: 5765D5101 REL: 111
ABSTRACT: ADAPTER FAILURES OR SWAPS ON WRONG NODE - HACMP/ES
PROBLEM DESCRIPTION:
A customer with gigabit ethernet adapters, pulled the cable
from the service adapter on one of the two machines and the
result was a swap_adapter event on both machines rather than
just the one where he had pulled the cable.
PROBLEM SUMMARY:
This problem happens when gigabit ethernet adapters
are used in an HACMP/ES cluster.
When adapter events (like pulling the adapter cable)
happen in one node, secondary and unexpected adapter
swap events may happen in another node.
The problem is more likely to occur in a 2-node HACMP/ES
cluster. If a standby adapter is brought down in one
of the nodes, the standby adapter in the other node
forms a "singleton adapter group", at the Topology
Services subsystem's level. This forces the subsystem
to use code that reads the adapter incoming packet counter
to determine whether the adapter is alive.
With gigabit ethernet adapters, the incoming packet
counter is not incremented in the same way as with
other kinds of adapters. As a result, the adapter is
sometimes considered dead (because the counter is sometimes
not incremented). Consequently, unexpected adapter swap
and "failed standby" events may be run on the node.
PROBLEM CONCLUSION:
The fix to the problem is a change to code in the
/usr/sbin/rsct/bin/netmonAdapterHealth program.
With the change, a longer interval is given between each
attempt at verifying whether the incoming packet
counter is being incremented. With the fix, the
chance that the counter will be incremented for a
working adapter will be close to 100%.
------
APAR: IY14068 COMPID: 576554802 REL: 110
ABSTRACT: ADD SUPPORT TO OPEN, CLOSE, READ, AND WRITE LINE SEQUENTIAL
PROBLEM DESCRIPTION:
AIX 4.2 introduced support for native files larger than 2GB. IBM
COBOL Set for AIX was developed on AIX 4.1.3 and doesn't support
the larger files. Customers have requested support. This APAR
adds support to open, close, read, and write line sequential
This support is only for ORGANIZATION IS LINE SEQUENTIAL.
LOCAL FIX:
Split large files into files smaller than 2GB and process the
smaller files.
TEMPORARY FIX:
Install, do not commit, Schwab.bff from
machine platypus.stl.ibm.com That machine is internal to IBM
developers. Customers need to create a PMR (or ETR), explain why
they wish an early shipment, understand that this early fix is
not fully tested, then queue that PMR to queue LNGWS,114 within
the U.S.A. or to WLNGWS,136 outside of the USA.
------
APAR: IY14763 COMPID: 5765D5100 REL: 311
ABSTRACT: THE CORE DUMP IS DUE TO A CODE BUG IN THE RECOVERY LOGIC.
PROBLEM DESCRIPTION:
== CORE_DUMP fault_service_Worm_RTG_SP as SIG# 11==
fault_service_Worm_RTG_SP caused segmentation vioration and
switch of Eprimary, Ebackup and some nodes were downed.
The worm caused the error while recovering the following error.
-- fs_daemon_print.file --
// print_the_time: Time = Sun Nov 5 04:25:44 2000
fs_daemon_fsm_main: got request, type = service-msg-received
fs_daemon_fsm_main: packet Service Command (SC) = fa
displayPacket: Packet type = TBS_SVC_CMD_ERROR_STATUS: //
PROBLEM SUMMARY:
A segment violation and core dump can occur during switch
recovery if the reinitNeighbors() function is invoked.
reinitNeighbors() is being called with the wrong
parameter. The function is called with the device id
(an uninitialized integer value), whereas it expects the
pointer to the device structure.
PROBLEM CONCLUSION:
Switch recovery code was changed to call
reinitNeighbors() with the pointer to the
device structure.
------
APAR: IY15401 COMPID: 5765D5101 REL: 111
ABSTRACT: HAGSD CORE DUMP HAS CORRUPTED STACK TRACE
PROBLEM DESCRIPTION:
Segmentation fault in SProviderMessgae.C causes corrupted stack
trace and hagsd to core.
PROBLEM SUMMARY:
The problem was due to a bug in a code
which should validate an array element
before using it since it may be reserved.
PROBLEM CONCLUSION:
After the fix of the problem to validate
the array element, the GSdaemon should not
core dumped.
------
APAR: IY15642 COMPID: 5765D5100 REL: 311
ABSTRACT: SP EXCLUSIVE ACCOUNTING CREATES NEGATIVE NUMBERS IN THE
PROBLEM DESCRIPTION:
when using sp exclusive accounting - the /var/adm/fee files
some times has negative numbers - which makes no sense.
This is caused whenan acctg files AIX fstat in acctexcl.c
returns an error which can not be duplicated. A suggested
solution is for us to handle the fstat error in acctexcl.c
PROBLEM SUMMARY:
When customers use excluse_node accounting,
there are instances when the /var/adm/fee
file has a job charge for a userid, that
is negative.
For example:
4077 jet 0 0 0 0 0 0 0 0 0 0 0 0 -961019740.000000 0 0 0
4077 jet 0 0 0 0 0 0 0 0 0 0 0 0 961020006.000000 0 0 0
This only occurs when:
a job is running using exclusive node accounting
AND
nrunacct (SP accounting) program is run on the node
before the user's job is finished.
We attempt to calculate how much time userid's
have had excluse node usage thus far, even though
their job has not completed yet.
Acctexcl is called from nrunacct and tries to
access the input file statistical buffer (stat_buf).
However, were calling the function to access this
structure, fstat, with incorrect parameters, resulting
in 0 values in the stat_buf fields. Acctjob (also run
from nrunacct) uses this value, which is 0, to calculate
the total job charge fee for each user. Since we started
with 0, the algorithm returns a negative number, which
is incorrect. The code was changed to point to the
correct address for the stat_buf structure.
PROBLEM CONCLUSION:
nrunacct (SP accounting) program calls
executable acctexcl when SP exclusive
accounting is set to true.
If a job is still running when nrunacct
is run, we calculate how much time so far,
userid's have had exclusive node usage.
acctexcl is called from /usr/lpp/ssp/bin
nrunacct to keep track of the start and
end times of a job, per user.
In acctexcl we were using fstat to get
the address of the file statistical buffer
(stat_buf), passing it a file descriptor
pointer and the address for stat_buf.
Within the structure stat_buf, we were
using the field stat_buf.st_mtime, which
is the last data modification for a
specific userid. This field was always
0. This 0 value was then passed to
acctjob, which is also run in nrunacct.
Acctjob calculates the fee that should be
charged a userid for exclusive node usage.
This algorithm is based on subtracting
a value from the stat_buf.st_mtime value,
resulting in a negative number.
The code was changed to point correctly to
the real address of the stat_buf structure,
in which st_mtime value is correct.
OLD:
--> fstat(fd_acct, &stat_buf);
stb.ac_btime = stat_buf.st_mtime;
NEW:
--> if (stat(argv 1 , &stat_buf) == 0) {
stb.ac_btime = stat_buf.st_mtime;
------
APAR: IY15763 COMPID: 5765B8100 REL: 220
ABSTRACT: DIRECTTALK SUPPORT FOR SCI INTERFACE OF CISCO ICM V4.1
PROBLEM DESCRIPTION:
This APAR enables DirectTalk to use the Service Control Interfac
e of Cisco ICM V4.1.
PROBLEM SUMMARY:
This APAR enables DirectTalk to use the Service
Control Interface of Cisco ICM V4.1
PROBLEM CONCLUSION:
Support added to allow use of Cisco ICM
V4.1
------
APAR: IY16041 COMPID: 5765D5100 REL: 311
ABSTRACT: NEED TO ENSURE TUNING.CUST IS ALWAYS RUN WHEN /ETC/RC.NET IS RUN
PROBLEM DESCRIPTION:
On a SP node, the network options are set by running
/tftpboot/tuning.cust. This script is called by /etc/rc.sp
when the node is booted. This overides any changes in
/etc/rc.net since rc.sp runs after rc.net. If a cfgmgr is
run, it will call /etc/rc.net. This means the "no" values
set in rc.net will now overide what was in tuning.cust.
To ensure tuning.cust is always dominant over rc.net on a
SP node, we need to make sure tuning.cust is always run
whenever rc.net is run.
LOCAL FIX:
If cfgmgr is run on, also run /tftpboot/tuning.cust if it
exists.
PROBLEM SUMMARY:
On an SP node, the network options are set by running
/tftpboot/tuning.cust. This script is called by /etc/rc.sp
when the node is booted. This overides any changes in
/etc/rc.net since rc.sp runs after rc.net. If cfgmgr is
run, it will call /etc/rc.net. This means the "no" values
set in rc.net will now overide what was in tuning.cust.
PROBLEM CONCLUSION:
/etc/rc.sp has been modified to add code to /etc/rc.net.
If /etc/rc.net is invoked at a time other then when the
node is rebooted, (determined by the fact that
/usr/sbin/srcmstr is not an active process),
it will now call /tftpboot/tuning.cust if it exists.
------
APAR: IY16125 COMPID: 5765D5100 REL: 311
ABSTRACT: ADD -P FLAG TO PTPECTRL USAGE STATEMENT
PROBLEM DESCRIPTION:
A new flag (-p Set Poll Count) was added to the ptpectrl command
While the -p flag is documneted in the ptpectrl man page, it
does not appear in the usage statement that is displayed when
you run ptepctrl -h. Customers would not know this feature is
available to them.
PROBLEM SUMMARY:
The -p flag (Set Poll Count) is documented in the ptpectrl
man page, but does not appear in the usage statement that
is displayed when you run ptpectrl -h.
PROBLEM CONCLUSION:
The -p flag (set poll count) will be displayed when you
run "ptpectrl -h", or when you enter an invalid flag,
for example "ptpectrl -a -x".
------
APAR: IY16179 COMPID: 5765D5100 REL: 311
ABSTRACT: SMITTY AND PRESPECTIVES DO NOT SUPPORT PARTITION SIZE MORE THAN
PROBLEM DESCRIPTION:
smitty and prespectives do not support partition size more than
256 MB when user tries to create VSD.
PROBLEM SUMMARY:
The createvsd command supports physical partition sizes of
512 and 1024 mb, but the smit interface was never updated
to accept these values. Perspectives, which invokes smit
config_data has the same problem.
PROBLEM CONCLUSION:
The smit panel "Create a Virtual Shared Disk" has been
expanded to allow physical partition sizes of 512 and 1024
megabytes.
------
APAR: IY16249 COMPID: 5765D5100 REL: 311
ABSTRACT: SPDELFRAM FAILS TO DELETE INFORMATION
PROBLEM DESCRIPTION:
spdelfram fails to delete frame information because spdelnode
passes a non-zero return code.
LOCAL FIX:
The work around for this problem is to issue spdelfram a second
time.
PROBLEM SUMMARY:
When issuing spdelfram to delete a frame that has just been
created, spdelnode fails because it is trying to remove
node information that has not been entered yet.
Because the nodes have not yet been configured, spdelnode
issues error messages when it attempts to delete the NIM
resources for the node as well the Adapter class
information for the node in the SDR. As a result, the
spdelfram command fails.
PROBLEM CONCLUSION:
spdelnode was modified to verify that a node has NIM
resources prior to attempting to remove them. It will also
verify that Adapter information exists in the SDR for a
node prior to attempting to delete it. As a result,
spdelnode will no longer issue error message when invoked
for a node that has not been configured. This will allow
spdelfram to delete a frame that has just been created.
------
APAR: IY16289 COMPID: 5765B9501 REL: 310
ABSTRACT: MNMSG DEADLOCK
PROBLEM DESCRIPTION:
mnMsg deadlock - known problem in 1.3. Parts of 298678 and
319508 from 1.3 may need to be retrofitted to 1.2.
LOCAL FIX:
The deadlock can be broken by recyclong the node(s) that show
hits on the following:
mmfsadm dump waiters | egrep 'Msg handler mnMsg.*change_lock_sha
rm waiting to set lx-lock-held flag'
PROBLEM SUMMARY:
Deadlock between two applications
using the fsync call against the same file
PROBLEM CONCLUSION:
Correct locking within the fsync call
------
APAR: IY16305 COMPID: 5765D5100 REL: 311
ABSTRACT: DIAG -C -D CSS0 BRINGS UP PROBLEM DETERMINATION SCREEN
PROBLEM DESCRIPTION:
executing diag -c -d css0 should not take the user to ELA screen
s, it appears css0 diagnostic method does not use the -c flag
anymore.
I think we want to check DA_CONSOLE_TRUE before calling ela_run
damode bits runing diags -c:
DA_CONSOLE_FALSE 0x00080000
da mode bits when running diags without the -c flag:
DA_CONSOLE_TRUE 0x00040000
PROBLEM SUMMARY:
When running diags -c -d css0, the diag method was calling
diagrpt without first checking for a console causing the
user to be prompted.
PROBLEM CONCLUSION:
The diag method first checks for a console before running
diagrpt.
------
APAR: IY16490 COMPID: 5765B9501 REL: 310
ABSTRACT: GPFS HANG ON META-NODE
PROBLEM DESCRIPTION:
gpfs hang on meta-node
PROBLEM CONCLUSION:
When getting the mnode token, use the LO_NOLX flag to avoid
deadlock when the current lkobj status is invalid.
Also, do not init the mnode lkobj if it has already been
initialized
by some other thread.
------
APAR: IY16543 COMPID: 5765B9501 REL: 310
ABSTRACT: FILE SYSTEM CANNOT BE REBALANCED
PROBLEM DESCRIPTION:
file system cannot be rebalanced
PROBLEM CONCLUSION:
ReuseBitmap caused bad endMask to be generated for last word
of the map
when the number of bits was exactly a multiple of 32.
Since the disk allocation code used this function
extensively, if there was
a multiple of 32 disks in the filesystem strange things
happened.
------
APAR: IY16559 COMPID: 5765B9501 REL: 310
ABSTRACT: FIX FOR IY16289 INCOMPLETE
PROBLEM DESCRIPTION:
IY16289 did not account for some other Release 3 design changes,
so the message handler thread needed to reacquire the lock with
a stronger mode to delete the invalid object. The thread
deadlocked with itself since it already held a RO lock, then
asked for an LX lock.
PROBLEM SUMMARY:
Follow-on to IY16289
------
APAR: IY16710 COMPID: 5765D5100 REL: 311
ABSTRACT: VSD/RVSD ENHANCEMENTS
PROBLEM DESCRIPTION:
VSD/RVSD support for the Subsystem
Device Driver (SDD) for the Enterprise Storage Server (ESS)
PROBLEM SUMMARY:
VSD/RVSD enhancements
PROBLEM CONCLUSION:
VSD/RVSD enhancements
------
APAR: IY16789 COMPID: 5765D6100 REL: 210
ABSTRACT: CRXLF90 GIVES: LD: 0711-317 ERROR: UNDEFINED SYMBOL: AIO_READ64
PROBLEM DESCRIPTION:
crxlf90 gives:
ld: 0711-317 ERROR: Undefined symbol: aio_read64
ld: 0711-317 ERROR: Undefined symbol: aio_write64
ld: 0711-317 ERROR: Undefined symbol: aio_suspend64
ld: 0711-317 ERROR: Undefined symbol: aio_cancel64
ld: 0711-317 ERROR: Undefined symbol: .einfo
LOCAL FIX:
at line 88 of the script crxlf90
add -l xlf90 before the \
PROBLEM SUMMARY:
The crxlf90 compilation script is missing a reference
to the xlf90 library.
PROBLEM CONCLUSION:
-lxlf90 can be added to the script so that customers do
not have to include this change as part of the
customization required for this script to run.
------
APAR: IY16980 COMPID: 5765B9501 REL: 310
ABSTRACT: DEADLOCK IN RENAME ON MULTIPLE NODES
PROBLEM DESCRIPTION:
Deadlock in rename on multiple nodes
PROBLEM SUMMARY:
GPFS deadlock when running multiple renames.
PROBLEM CONCLUSION:
GPFS locking error is rename
------
APAR: IY17356 COMPID: 5765C3403 REL: 433
ABSTRACT: AIX 4330-08 RECOMMENDED MAINTENANCE LEVEL
PROBLEM DESCRIPTION:
This APAR delivers the 4330-08 Recommended Maintenance Level for
AIX 4.3.3. This package applies to AIX 4.3.3 only, and should
be ordered specifying a service level of 4330 or higher.
This is a packaging APAR only. It will not appear in the list
of APARs on the SMIT "Update Software by Fix (APAR)" panel, nor
will the 'instfix' command show this APAR as being installed
after the updates delivered by this package are installed.
To install all updates from this package that apply to installed
filesets on your system, use the command:
smit update_all
Installing all updates is recommended. However, you can install
selected updates from this package using the command:
smit update_by_fix
Once installed, you can determine if all fileset updates for the
4330-08 level are installed using the following command. The
'oslevel' command will still indicate AIX 4.3.3.0.
instfix -ik 4330-08_AIX_ML
If the output indicates that not all filesets are found, you
can determine which filesets are not updated using the command:
instfix -ciqk 4330-08_AIX_ML | grep ":-:"
PROBLEM SUMMARY:
Packaging only.
------
APAR: IY17479 COMPID: 5765D5100 REL: 311
ABSTRACT: KERBEROS 4 SECURITY PROBLEM
PROBLEM DESCRIPTION:
kerberos 4 security problem
PROBLEM SUMMARY:
Security Issue.
------
APAR: IY17492 COMPID: 5765B9501 REL: 310
ABSTRACT: IN-MEMORY BUFFER ACCOUNTING INCORRECT FOR EXPANDED FRAGMENT.
PROBLEM DESCRIPTION:
In-memory buffer accounting incorrect for expanded fragment.
PROBLEM SUMMARY:
GPFS self check logic stopped GPFS because of incorrect
accounting for end of file in certain situations where the
buffer containing the file is stolen
PROBLEM CONCLUSION:
Correct buffer management logic
------
APAR: IY17496 COMPID: 5765D5101 REL: 111
ABSTRACT: KERBEROS 4 SECURITY PROBLEM
PROBLEM DESCRIPTION:
kerberos 4 security problem
PROBLEM SUMMARY:
Security Issue.
------
APAR: IY17741 COMPID: 5765C3403 REL: 430
ABSTRACT: SRN 66E-301 TESTING DVD IN GOOD MACHINE PATH
PROBLEM DESCRIPTION:
diagnostics returns SRN (Unit Attention) on Good Machine Path
PROBLEM CONCLUSION:
small code modification
------
APAR: IY17742 COMPID: 5765C3403 REL: 430
ABSTRACT: 803 SRN WITH DEVICE BUSY: DOESN'T RECOGNIZE INCORRECT MEDIA
PROBLEM DESCRIPTION:
Confusion on Diagnostic Selection Menu for DVD-RAM Drive.
PROBLEM CONCLUSION:
Small code changes
------
APAR: IY17748 COMPID: 5765C3407 REL: 210
ABSTRACT: V2R1 PTF-12
PROBLEM DESCRIPTION:
Added filesets for AIX V2R1 NSM PTF12
------
APAR: IY17749 COMPID: 5765C3407 REL: 210
ABSTRACT: V2R1 PTF-12 DUMMY
PROBLEM DESCRIPTION:
Added filesets for AIX V2R1 NSM PTF12
------
APAR: IY17750 COMPID: 576554801 REL: 110
ABSTRACT: ADD SUPPORT TO OPEN, CLOSE, READ, AND WRITE LINE SEQUENTIAL
PROBLEM DESCRIPTION:
AIX 4.2 introduced support for native files larger than 2GB. IBM
COBOL Set for AIX was developed on AIX 4.1.3 and doesn't support
the larger files. Customers have requested support. This APAR
adds support to open, close, read, and write line sequential
This support is only for ORGANIZATION IS LINE SEQUENTIAL.
LOCAL FIX:
Split large files into files smaller than 2GB and process the
smaller files.
TEMPORARY FIX:
Install, do not commit, Schwab.bff from
machine platypus.stl.ibm.com That machine is internal to IBM
developers. Customers need to create a PMR (or ETR), explain why
they wish an early shipment, understand that this early fix is
not fully tested, then queue that PMR to queue LNGWS,114 within
the U.S.A. or to WLNGWS,136 outside of the USA.
------
APAR: IY17785 COMPID: 5765C3403 REL: 430
ABSTRACT: FC BOOT INSTALLS WITH/WITHOUT EMC SYMMETRIX FILESET
PROBLEM DESCRIPTION:
When booting from FC Symmetrix, adding EMC software after
installing system may make system unbootable.
PROBLEM CONCLUSION:
Fix issue with migrating from generic OS FC disk to
more specific FC disk type.
------
APAR: IY17870 COMPID: 5765D5100 REL: 311
ABSTRACT: LATEST PSSP 3.1.1 FIXES AS OF MARCH 2001
PROBLEM DESCRIPTION:
This is the lastest PSSP ptf as of March 2001.
Order this apar to get all of the ptfs as of March 2001.
PROBLEM SUMMARY:
This is a packaging apar for PSSP 3.1.1 fixes
as of March 2001.
PROBLEM CONCLUSION:
This is a packaging apar for PSSP 3.1.1
fixes as of March 2001.
------
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]