System Management Interrupt (SMI) Handler - (NF 8500R - Type 8681)


System Management Interrupt (SMI) Handler

System Management Interrupt (SMI) Handler is the means of detecting system errors and logging error messages into the system error log.
It is installed during the POST sequence at check point A9, and is functional thereafter.
SMI Handler detects errors that are generated by system hardware such as CPU, memory and PCI devices.
Any SMI-logged error message has "SMI Hdlr" in the SOURCE field as part of the system error message entry.
Error messages are logged at different levels as system errors are detected.
A single system failure could be the combination of errors, and it will cause multiple unique error messages to be logged in the error log.
For example, a single PCI device failure will cause multiple PCI errors, and it will also cause multiple errors at the PCI Bridge level.
Each of those errors will create an entry in the system error log.
It is very important to retrieve all the SMI error messages (SOURCE = SMI Hdlr), the data in the ERROR CODE and ERROR DATA fields for each message, and the sequence in which the error messages were posted to the system error log.

The following table describes SMI error messages, along with possible failing FRUs or appropriate action to be taken.

SMI Error Message    FRU/Action
   
   Memory UNC ECC    DIMM yy, port A
   Error on port A, DIMM yy  
   Memory UNC ECC    DIMM yy, port B
   Error on port B, DIMM yy  
   Memory SBC ECC    DIMM yy, port A
   Error on port A, DIMM yy  
   Memory SBC ECC    DIMM yy, port B
   Error on port B, DIMM yy  
   UNC on P6    Suspect FRUs in the following order:
   Processor Bus A 1. Processors on bus A
  2. Processor daughter board A
  3. Processor controller board
   UNC on P6    Suspect FRUs in the following order:
   Processor Bus B 1. Processors on bus B
  2. Processor daughter board B
  3. Processor controller board
   Error on processor An    Suspect FRUs in the following order:
  1. Run diagnostics on the processors
  2. Processor An
  3. Processor daughter board A
  4. Processor controller board
   Error on Processor Bn    Suspect FRUs in the following order:
  1. Processor Bn
  2. Run diagnostics on the processors
  3. Processor daughter board B
  4. Processor controller board
   P6 I/O bus UNC error    If message "UNC Error on PCI bus slot x" is also displayed,
     follow the FRU/Action for that message first, then suspect
     FRUs in the following order:
  1. I/O board
  2. Processor controller board
   UNC Error on PCI bus M    Suspect FRUs in the following order:
  1. Any PCI adapters on bus M (including I/O Function card or
      System Management Adapter for bus A)
  2. System management adapter cable
  3. I/O board
   P6 I/O Bus Hard Failure    Reinstall the device driver for the last PCI adapter installed,
   Response (HFR)    then suspect FRUs in the following order:
  1. The last installed PCI adapter
  2. Any other PCI adapter
  3. System Management Adapter
  4. I/O Function card
  5. I/O board
   UNC Error on PCI bus slot x    Suspect FRUs in the following order:
  1. PCI adapter in slot x
  2. I/O board
  3. Another adapter on the same PCI bus Address Parity
   Error (APE) on PCI bus M    Suspect FRUs in the following order:
  1. Any PCI adapters on bus M (including I/O function card
      or System Management Adapter for bus A)
  2. System management adapter cable
  3. I/O board
   SBC error on P6 Process Bus A    Suspect FRUs in the following order:
  1. Run diagnostics on the processors
  2. Any processor on bus A
  3. Processor daughter board A
  4. Processor controller board
   SBC error on P6 Process Bus B    Suspect FRUs in the following order:
  1. Run diagnostics on the processors
  2. Any processor on bus B
  3. Processor daughter board B
  4. Processor controller board
   SBC error on P6 I/O bus    Suspect FRUs in the following order:
  1. I/O board
  2. Processor controller board
   Uncorrectable MCA error    Suspect any processor on bus A and B
  1. Run diagnostics on the processors
   Correctable MCA error    Suspect any processor on bus A and B
  1. Run diagnostics on the processors
   SBC Memory error has occurred    Suspect any DIMM on memory bus A and B
  1. Run diagnostics on the memory


Please see the LEGAL  -  Trademark notice.
Feel free - send a Email-NOTE  for any BUG on this page found - Thank you.