Diagnosis Guide
The following information needs to be collected from the node that presents
the problem. For connectivity-related problems, the same information is
needed from the other nodes. If collecting data from all the nodes is
not feasible, data should be collected from at least the following
nodes:
- The node's Downstream Neighbor on all networks. This is the
node whose IP address is immediately lower than the address of the node where
the problem was seen. The node with the lowest IP address has a
Downstream Neighbor of the node with the highest IP address.
-
The Group Leader node, which is the node with the highest IP address in the
network.
- The control workstation
Collect the files listed in number 1, 3d, and 3c (bullets 2 and 3).
Then, issue phoenix.snap to collect the remaining
information. See phoenix.snap dump.
-
FFDC dump files: /var/adm/ffdc/dumps/*
- Topology Services Files
- Service and User log files for the daemon.
-
On PSSP,
/var/ha/log/hats.dd.hhmmss.partition_name*.
-
On HACMP,
/var/ha/log/topsvcs.dd.hhmmss.cluster_name*.
Here, dd is the day of the month when the daemon was started, and
hhmmss is the time when the daemon was started.
- Startup Script log.
- On PSSP,
/var/ha/log/hats.partition_name*.
- On HACMP, /var/ha/log/topsvcs.default*.
-
Entire contents of the daemon run directory.
- On PSSP,
/var/ha/run/hats.partition_name/*.
- On HACMP,
/var/ha/run/topsvcs.cluster_name/*.
This includes the machines.lst file and temporary files used
by the startup script. It also includes the log files for the
/usr/sbin/rsct/bin/netmonAdapterHealth program, which is used to
determine adapter status. The daemon's core files are also stored
here.
- Output of the lssrc -ls command.
- On PSSP nodes, issue the command: lssrc -ls hats.
- On the PSSP control workstation, issue the command: lssrc -ls
hats.partition_name.
- On HACMP, issue the command: lssrc -ls topsvcs.
- Information used by the startup scripts:
- On any PSSP node, issue these commands and record the output:
- SDRGetObjects -x SP cw_ipaddrs
- SDRGetObjects -x TS_Config Frequency Sensitivity Run_FixPri
FixPri_Value Log_Length Pinning
- SDRGetObjects -x Subnet
- SDRGetObjects -x Network
- SDRGetObjects -x Adapter
- SDRGetObjects -G -x Adapter
- SDRRetrieveFile hats.machines.lst
local_file_name
- SDRGetObjects host_responds
-
/usr/lpp/ssp/bin/lsauthpts -c
-
/usr/bin/dcecp -c keytab show rsct/partition_name/hats
-member
-
/usr/lpp/ssp/bin/splstdata -p
- On HACMP nodes, issue these commands and record the output:
-
odmget HACMPnim
- odmget HACMPtopsvcs
-
/usr/es/sbin/cluster/utilities/clhandle -ac
- /usr/es/sbin/cluster/utilities/clhandle -c
-
/usr/es/sbin/cluster/utilities/cllsif -Sc
-
/usr/es/sbin/cluster/utilities/cllsclstr -Sc
-
/usr/es/sbin/cluster/utilities/clmixver (output and return value)
- System Data
- Installation data
-
Output of the command: lslpp -L
- Contents of files:
-
/usr/sbin/rsct/optlevel.rsct.basic.rte
-
/usr/sbin/rsct/optlevel.rsct.basic.hacmp
-
/usr/sbin/rsct/optlevel.rsct.basic.sp
-
The authentication method in use. Issue this command on the control
workstation:
splstdata -p
The entry "ts_auth_methods" lists the active authentication methods in use
for the SP trusted services (such as Topology Services).
- Network and adapter data
- Output of commands:
-
netstat -in
-
ifconfig (all interfaces)
- netstat -s
- netstat -m
- netstat -rn
- netstat -D
-
netstat en* (for example, en0, en1 and so forth)
-
If the problem is still occurring, output of ping to addresses in the
machines.lst file, located in the daemon run directory.
-
If the problem is still occurring, output of the iptrace
command. Follow this sequence:
- iptrace /tmp/iptrace.out
- ps -ef | grep iptrace
- Wait one minute, and then issue: kill pid
-
Save a copy of the file /tmp/iptrace.out. This is a
binary file.
-
Memory - output of the commands:
- vmstat 5 5
- vmstat -s
- vmtune
The full path name is: /usr/samples/kernel.
[ Top of Page | Previous Page | Next Page | Table of Contents | Index ]