WE RECEIVE A LED 404 UPON REBOOT OF A 590.

ITEM: RTA000041910



QUESTION:                                                                       
My customer recently put in an additional 128 MB of memory on a 590.            
When they rebooted, they received a 404 on the LED.  It hung on this            
values which I could not find in the Problem Solving Guide and Ref..            
They proceeded to reboot the system and everything was fine.  What does         
this undocumented LED code mean?  Also at about the same time has the           
reboot there several of the following errors in the error log.                  
ERROR LABEL:    DISK_ERR4                                                       
ERROR ID:       1581762B                                                        
                                                                                
Date/Time:       Mon Mar 28 11:11:55                                            
Sequence Number: 144837                                                         
Machine Id:      000011187000                                                   
Node Id:         000011187000                                                   
Class:           H                                                             
Type:            TEMP                                                           
Resource Name:   hdisk0                                                         
Resource Class:  disk                                                           
Resource Type:   1000mb                                                         
Location:                                                                       
VPD:                                                                            
                                                                                
Error Description                                                               
DISK OPERATION ERROR                                                            
                                                                                
ERROR LABEL:    SCSI_ERR2                                                       
ERROR ID:       C14C511C                                                        
Sequence Number: 144836                                                         
Machine Id:      000011187000                                                   
Node Id:         000011187000                                                  
Class:           H                                                              
Resource Name:   scsi0                                                          
Resource Class:  adapter                                                        
Resource Type:   hscsi                                                          
Location:        00-08                                                          
                                                                                
We have several of the SCSI_ERR2, but everything seems to be fine.              
The machine is up and running in Multiuser mode.  Should we be                  
concerned?                                                                      
                                                                                
---------- ---------- ---------- --------- ---------- ----------                
A:  404 LED / SCSI / DISK errors                                                
  It would appear you have two, possibly three, unrelated problems.             
                                                                                
  If the 404 occurred immediately on powering up, it means that a power        
  supply problem was present.  The chart on page 2-1520-2 of the 7013           
  POWERstation and POWERserver Installation and Service Guide                   
  (SA23-2622-07) details these conditions.                                      
                                                                                
  A 4 in the first position means that the fan connected to the P49C            
  connector failed.  The 0 in the second position does not represent a          
  power failure. The 4 in the third position means that there was a             
  power failure inside the power supply.  Since the machine came up             
  fine after it was rebooted, no actual hardware faults appears present.        
  Rather, it seems that the power to the RISC was interrupted or not            
  conditioned properly for a very brief period of time.                         
                                                                                
  If the 404 happened later during the IPL sequence (during the 100 or          
  200 series of IPL codes), then the answer is not so readily found.            
  If this is the case, please contact Hardware Services.                       
                                                                                
  The error codes you have included in your item might or might not be          
  related.  However, they are not serious.  The DISK_ERR4 is a bad block        
  error and is fully recoverable.  This error happens when AIX attempts         
  to write to a disk and finds a bad spot at the write destination.  AIX        
  marks this block as unusable and recovers by writing the data in an           
  alternate area reserved for such errors.  This situation is generally         
  no cause for alarm - about 95% of a disk is actually useable.  Bad            
  blocks account for the remaining 5 .  However, if you start noticing          
  an increasing frequency of error 158172B, this could mean that you are        
  getting low on disk space.                                                    
                                                                                
  The DISK_ERR4 message could also arise from SCSI bus cabling and/or           
  termination problems.  In this case, the DISK_ERR4 message is directly        
  tied to the SCSI_ERR2 (C14C511C) error - (both are SCSI errors).             
  These errors can result from improperly terminated SCSI buses.  They          
  can also arise from bad cables, loose connectors, and SCSI buses that         
  exceed the maximum supported bus lengths.                                     
                                                                                
  To resolve these errors, make sure that the proper terminator is on           
  the SCSI bus (for example, a SCSI-1 bus requires a SCSI-1 terminator.         
  In addition, a single-ended terminator will not work on a differential        
  bus - and differential terminator will not work on a single-ended             
  bus).                                                                         
                                                                                
  Also, make sure all cables are firmly connected and are within                
  supported SCSI bus lengths.  The supported SCSI bus lengths are:              
                                                                                
      SCSI-1 Single-ended - 6m                                                  
      SCSI-2 Single-ended - 3m                                                 
      SCSI-2 Differential - 19m                                                 
                                                                                
  You may also wish to verify that the cables themselves are                    
  functioning properly.  One way to do this is to locate a cable that           
  you are sure is not defective.  Then, systematically replace each             
  cable in the SCSI chain and see if you can duplicate the SCSI errors.         
  (Note: be sure to power down the system when plugging and unplugging          
  adapters and devices.  "Hot plugging" SCSI devices or cables is not           
supported and can easily damage both the SCSI adapters and devices).            
                                                                                
  Note: if you feel that the DISK_ERR4 errors are not related to the            
  above SCSI errors, you may wish to certify the disk drive.  This can          
  be done through the diagnostic Service Aids.  The disk certification          
  should report on whether or not the drive is in good condition.               
                                                                               
---------- ---------- ---------- --------- ---------- ----------                
                                                                                
                                                                                
This item was created from library item Q658314      CRFLR                      
                                                                                
Additional search words:                                                        
CRFLR HARDWARE IX LED MAY94 OZNEW REBOOT RECEIVE RECEIVED RISCSYSTEM            
RISCSYSU SYS SYSTEM UNIT UPON 404 590                                           
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                                               


WWQA: ITEM: RTA000041910 ITEM: RTA000041910
Dated: 10/1996 Category: RISCSYSU
This HTML file was generated 99/06/24~12:43:16
Comments or suggestions? Contact us