System hang management allows users to run mission critical applications continuouly while improving application availablity. System hang detection alerts the system administrator of possible problems and then allows the administrator to log in as root or to reboot the system to resolve the problem.
The shconf command is invoked when System Hang Detection is enabled. shconf configures which events are surveyed and what actions are to be taken if such events occur.
The user can specify the five actions described below and can specify the priority level to check, the time out while no process or thread executes at a lower or equal priority, the terminal device for the warning action and the getty action:
For the Launch a command and Give a special getty
options, SHD will launch the special getty or the specified command
at the highest priority. The special getty will print a
warning message specifying that it is a recovering getty running at
priority 0. The following table lists the default values when the SHD
is enabled. Only one action is enabled per type of detection.
Option | Enablement | Priority | Timeout (seconds) |
---|---|---|---|
Log an error in errlog file | disabled | 60 | 120 |
Display a warning message | disabled | 60 | 120 |
Give a recovering getty | enabled | 60 | 120 |
Launch a command | disabled | 60 | 120 |
Reboot the system | disabled | 39 | 300 |
Note : When Launch a recovering getty on a console is enabled, the shconf script adds the -u flag to the getty line in the inittab that is associated with the console login.
shdaemon is a process launched by init. It is in charge of handling the detection of system hang. It retrieves configuration information, initiates working structures, and starts detection times set in by the user.
shdaemon runs at priority 0 (zero).
You can manage the SHD configuration from the SMIT System Environments menu. From the System Environments menu, select Change / Show Characteristics of Operating System, then System Hang Detection. The options in this menu allow system administrators to enable or disable the detection mechanism.
The Manage System Hang Detection menu contains the following items:
The System Hang Detection Status menu displays the current state (enable or disable) of the SHD feature. The only change that can be made from this option is either to enable or disable system hang detection.
The Change / Show Current Configuration for Priority Problem Detection menu displays the current time-out and the process priority for each action:
Log an Error in the Error Log [disable] Detection Time-out [120] Process Priority [60] Display a warning message on a console [disable] Detection Time-out [120] Process Priority [60] Terminal Device [console] Launch a recovering getty on a console [enable] Detection Time-out [120] Process Priority [60] Terminal Device [console] Launch a command [disable] Detection Time-out [120] Process Priority [60] Script [ ] Automatically REBOOT system after Detection [disable] Detection Time-out [300] Process Priority [39]