9037-002 Maintenance Introduction.

9037-002 Maintenance Introduction.

Bernard TANGUY
Dept. 1590
TANGUYB at IBMFR
Tel. (0)4-67-34-44-90
S/390 P.E. Montpellier
IBM FRANCE

22nd/09/2000.

Document Number 9037-002

September 2000.

Last version updates: see abstracts below.

Documentation issued by: P.E. MOP Dpt 1590.


Preface

This document is an introduction to the 9037 Model 2 Sysplex Timer maintenance.

Its intent is to complete the course and to give hands-on about the installation, maintenance, documentation and education.

It will be maintained from our common experience.

Any comments or suggestions are welcome.

Do not copy this document, request the last version from the web MOP PE server (http://moppe.mop.ibm.com) or VM (contact EPSG or myself).

Last version updates:


Previous version updates:


Chapter 1. 9037 Model 2 Documentation - Education


1.1 Customer Documentation


1.2 C.E. Documentation


1.3 Tools

Fiber optic wrap plugs (CLO and CPC ports).

Fiber optic protective plug (must be present as often as a fiber is not connected including to return a card for repair).

Service key.

Torque tool for logic cards (unique torque specifications).

CBT Course (WIN/xx Sysplex Timer training course: 3 diskettes). This 2 hours Computer Based Terminal Course runs under WIN-OS/2, WINDOWS 3.1, WIN/NT. The following documentation is necessary:

Use the CBT course for offline education prior to the installation.

The last version of the CBT course can be requested with the command "TOOLS SENDTO LEXVMIC1 TOOLS SERVTOOL GET 9037CBT PACKAGE". Follow the instructions in the file "9037CBT PACKAGE" to create and install the diskettes. The timestamp for the 9037CBT* DSKFBIN files is 24 Oct 1996.

Useful "helps" from the Console Application (or CBT Standalone C.A.)

To switch between the CBT course (or the Operator panel), the Standalone Console Application and the Console Help if all of them are open, use ALT+TAB (OS/2 or WIN/NT console).

To clear check marks of completed modules and lessons use "CRTL+C".

Report any error observed during the CBT course use.


1.4 Other Documentation/redbook/PE links:

 
  • 9037 Redbook:
    "S/390 Time Management and IBM 9037 Sysplex Timer redbook link"
     
  • 9729 Wavelength Division Multiplexer in Large Systems Environment:
    "9729 redbook Link"
     
  • 2029 Redbook:
    "IBM 2029 Fiber Saver Implementation Guide, SG24-5608"
     
  • FocusPE link for ETS protocols description, Interconnect products year 2000 information, Sysplex Timer Console Application on the S/390 HMC II, Model 1 to Model 2 migration procedure, Exception Letters, LIC, etc.
    "FocusPE link"
     
  • The Montpellier P.E. "9037-002 Maintenance Introduction" document is on the MOP PE server:
    "MOP PE server"
     
  • Interconnect Product Alert site (2029 / 9032 / 9037) :
    "ICP alerts"

  • Chapter 2. Installation


    2.1 Configurations:

    The 9037 model 2 exists in Basic, Expanded Basic and Expanded Availability configurations. Ports are available by increment of 4 from 4 to 24.

    A standby console is recommended, it is optional for every configuration.

    A Basic configuration is not upgradable concurrently, as the CLO card needs to be installed.

    Two ETR networks can be monitored from a console.


    2.2 Features:

    A basic config is 4 ports.

    The M.A.U. is not automatically delivered with the 9037-002.

    Here are the feature codes for 9037-002:


    2.3 Recommendations:

    Use the 9037 Planning tasks, Configuration Planning checklist and the Configuration worksheet (Planning Guide). Do not let a single point of failure in the installation (e.g. same power sources, same path for the fibers, same place for the boxes, etc.):

    The Planning Guide lists recommendations for a continuous availability .

    The total cable distance between both boxes (CLO links) cannot exceed 40 kms (RPQ 8P1955).

    The CPC Transmit and Receive fibers must have matched lengths within 10 meters (using trunks and distribution panels).

    Example of single point of failure consequences: Both CLO fibers paths (A and B) must be different as the loss of both signals will get the secondary timer to the "Dormant" status i.e. not emitting to the CPCs until the resynchronisation with the primary is successful via one CLO link. This could lead to system(s) down if the alternate ETR port of one or more CECs does not receive the ETR signal from the primary timer as a consequence of the current repeater failure (same paths used), or for any other reason. Also read the "Sysplex disaster recovery" chapter in this document.

    Prior to the timers installation you need to determine:


    2.4 LAN:

    The console communicates with the "local" timer connected to the LAN. With the "remote" timer (not connected to the LAN) it communicates through the CLO link. Both timers can be LAN attached to facilitate the network management..

    The default data rate for the timer and the console is 16 Mbps.

    A dedicated LAN is recommended. A congested LAN isn't.

    If the timers are connected through a router, it must employ bridging functions. Routing functions are not supported.


    2.5 Initialization / Offset changes:

    You MUST enter the Time Offsets values (Even at 00:00), and the Leap seconds (Even at 0). Then, when the Timer Network Time is entered the "Initialize" push button is unmasked, allowing you to initialize with the valid offsets formats.

    During normal operations, a planned offset change (DST + Leap seconds) will be applied on date, even without Console Application connexion. In case of power outage at this very moment, offsets will be applied after the next IML following this planned date. On 967X refer to the TIP H015665 to update manually the S.E. clock not automatically updated for DST modifications (no UTC change) if necessary.

    If a DST offset change is not seen by MVS put the timers maintenance mode ON then OFF to force MVS to analyze the offsets. This problem is corrected by the following MCL: Driver 12 stream F11130 MCL003 - Driver A2 stream F10985 MCL006.-

    Here are the next DST dates:


    2.6 Distant Sites Installation Sequence (E.A. Configuration):

    Refer to the Installation chapter of the Maintenance Information Manual and follow its procedure for each step below.

    Until it is clearly stated in the Installation Sequence checklist it may be profitable to install distant sites with the following sequence if time constraints are to be considered or 2 C.E.s not available:

    Follow the Installation instructions modifying the sequence as follows:

    1. On one site ( the secondary site ):
    2. On the other site ( the main site ):

    2.7 Upgrade an Expanded Basic Configuration to Expanded Availability:

    Follow the M.I.M. procedure. Notice that if one CEC has an ETR Master card with only one port connected it will not be possible to restart the "EB" timer as requested at the end of the upgrade in order to provide the proper port configuration to MVS for next IPL.

    2.8 Distant Sites Upgrade Sequence (E.B. to E.A. Configuration):

    The following refers to the "Upgrade" procedure" - MIM Page 6-93 - and covers a very specific case.

    If the new timer installed in the remote site is NOT to be LAN attached:

    First install it on the main site near the existing E.B. one to have it LAN attached in order to perform the upgrade procedure with the console.

    When the new timer is inserted in the existing ETR network and just before connecting the CPC fibers you'll need to power off the new OFFLINE timer, disconnect the CLO cables (do not disconnect them before powering off), and move it to its destination site.

    Note that moving the timer after the CPC fibers attachement would require a "SCP connectivity" verification in order to ensure that two OPERATIONAL/ENABLED ports are available for each SCP image and no disruption will occur to the customer operations. Follow the "Service Impact Analysis" procedure as if you were performing a non hot-plug part replacement.


    2.9 Extended distance (2029/9729):

    See the above "Recommendations" chapter to install repeaters. Be aware of cables lengths limitations. The CPC Transmit and Receive fibers must have matched lengths within 10 meters (using trunks and distribution panels). Distances supported for the 2029: 50 km for point-to-point configurations; 35 km for hubbed-ring configurations between the hub and the farthest remote site. (Distances are device and protocol dependent).

    Note that 2029 Sysplex Timer links cannot be configured as "high availability" channel ( not supported ), configure them as a "base channel". Sysplex Timer links are not supported in hubbed ring configurations.

    RPQ 8P1955 (CF/Sysplex Timer Link extension with 9729 /2029): This RPQ is a pricing vehicle that addresses deviations from the allowed link distance, link budget or other physical layer specification for the Sysplex Timer and Coupling Facility Links when used in conjunction with the IBM 9729 Wavelength Division Multiplexor or 2029 (DWDM). With approval of this RPQ, distances can be increased to 40 km for both timer-to-timer and timer-to-CPC connections. It is extremely important to note that the timer-to-timer CLO link distance cannot exceed 40 km without obtaining explicit IBM approval. Extending the CLO link distance beyond 40 km could result in the two timers being out of synchronization with each other, thereby resulting in a potential data integrity exposure. The out-of-synchronization condition is not detectable during normal system operation. Support of this RPQ may require modifications of the standard 9729 product; these modifications may include, but are not limited to, hardware changes such as removal of pluggable optical attenuator and custom card rework on the 9729 laser and receiver cards. Approval is based on modelling and experimental measurements made in the S/390 Poughkeepsie Hardware Development Laboratory.

    9729: The ETR support is provided with the ETR/ESCON card PN 85H7277.

    Pls read the chapter 3 named "IBM 9729 in a Large System Environment" in the 9729 redbook SG242138 below):

    "9729 Redbook"


    2.10 Timer Network display:

    The top box displayed on the Sysplex Timer Network Console window is the timer with the IP address configured in the Timer Network Connection Definition window. The bottom one is the timer attached to it through the CLO link. According to the Timer IP address given in the Network Connection Definition dialog box be careful when using functions, the top box may differ from one console to the other, or after a modification of the definition. Refer to the Unit Id of each box.

    The Unit Operational State ("Dormant"-"Starting"-"Offline"-"Online"- "Service") is described in the appendix B of the Maintenance Information manual or in the Online Help (PF1).


    2.11 Console configuration:

    If the active console is failing you may reconfigure the standby console as active. Use the "Timer Network Connection definition" dialog box.

    Initial passwords are ADMIN, MAINT, OPERATOR. If a password is deleted for a level, it is automatically deleted for subsequent levels. The highest authority password (Administrative pw) can be used for any level.


    2.12 Consoles switching:

    How to switch between the active local and remote standby consoles ?

    When you switch the local active as standby you have a warning that the link will be momentarily dropped. Then if you try to get the sysplex network console window the Link status indicator stays red with a message "Duplicate console" in the Console type indicator (bottom line) This is a normal situation as 2 consoles of the same type cannot co-exist. (Active or Standby). When, from the remote site, you switch the previously standby as active there is an automatic recovery on the local console which becomes standby without any other intervention. Some 2/20 seconds may be necessary depending on the installation (public busy LAN, HMC, etc.).


    2.13 DCAF control (Remote Access Console):

    Refer to the chapter 10 of the MIM to set up and use a remote Console. Or refer to the "S/390 Console installation and User's guide" GA22-7291.

    A DCAF tcp/ip problem exists when using a 6887KAG HMC as a controler to a 6562 HMC target. This is due to the MATROX video driver v2.14 of the 6562. Fix: V2.21 driver which will be in the next HMC driver CD ROM (DR14 or above when available).

    Important points:


    2.14 Adding a Port card:

    Install Feature ports (FC4040) MES instructions are available.

    2.15 TIPS (EXTRACT of 9037 model 2 related tips):

    TDR H135009 "PROCEDURE TO MIGRATE FROM A 9037-001 TO 9037 002". Refer to the "9037MIGR" package under MKTTOOLS. To obtain a copy of the planned outage migration procedure use the command "REQUEST 9037MIGR FROM FOCUSPE AT PKEDVM9". A limited outage migration procedure (concurrent) is available.

    TDR H161444 "ALLOW 9037-2 (ETR) MIGRATION ON 9021 711 BASE MODELS". 9021 patches required for the above migration (G status on 10/15/1997)

    TDR H094436 "RESERVED FOR SYSPLEX TIMER CEC PATCHES". 3090/9021/9121 hardware patches and MVS APAR/PTFs

    TDR H163842 "9037 MODEL 1 CONSOLE YEAR 2000 SUPPORT" (Jan 1st 2000 is a saturday - March 1st 2000 is a wednesday due to 02/29)

    TDR H107416 "ADDITIONAL BOC/TOD INFORMATION". Guidelines for time changes in a Sysplex environment.

    TDR H015665 "PROCEDURE TO SET CPC TOD ON 9672'S OR 2003"

    TDR H126950 "9672 9673 9674 BOC TOD CLOCK INFO"

    TDR H13617 "SETTING THE DATE AND TIME IN MVS"

    TDR H057867 "TELEPHONE DATE CODE PROVIDERS URL + PHONE (MODEM2 PROTOCOL"

    TDR H022920 "967X MVS/ETR REFCODE DEFINITIONS (753A01XX)"

    TDR H131142 "967X MVS/ETR REFCODE DEFINITIONS (753A01XX)"

    TDR H123826 "IEA261I DURING MVS IPL BUT ETR PORTS OPERATIONAL".

    TDR H012151 "967X CALLS HOME WITH 753A0101"

    TDR H126171 "MVS MESSAGE IEA262I ON IPL / SE REFCODE 753A0102"

    TDR H134542 "RC 753A0100 ON 967X PROCESSORS"

    TDR H00892 "IEA272I RC 042 MVS MESSAGE INFORMATION"

    TDR H16445 "MVS MESSAGE IEA272I RC40 GENERATED ON 9037 CONSOLE SHUTDOWN"

    TDR H131232 (Potential Sysplex down sit. with 9037-001) "9037 MODEL 001 RC040 COMMUNICATIONS LOSS WITH CONSOLE"

    TDR H045772 "ETR PORT NOT OPER: MVS IEA262I MSG+NO CALL HOME RC 753A0102"

    TDR H057928 "RESTORE TIMER AND ESCD CONSOLE FILES AFTER UPGRADE TO DRV 22"

    TDR H166191 "TRAP 000D AFTER 02.00.00 CODE ECA009 INSTALL"

    TDR H123826 "IEA261I DURING MVS IPL BUT ETR PORTS OPERATIONAL"

    TDR H057587 "9032/9037 C.A. HMC BACKUP FILE OVERWRITTEN - NOT RESTORED".

    TDR H057553 "HMC UPGRADE DR98 TO DRA2 FAILURE".

    TDR H123632 "9037-002 ALL PORTS FAIL RC32. WRAP TESTS FAIL ALL PORT"

    TDR H057415 "9037002 RC 077/093/006 DURING AN INIT WITH A NEW NETWORK ID"

    H057418 "WSAASYNCSELECT() FUNCTION ERR. USING THE 9037002 C.A. on HMC"

    H057419 "9037002 C.A. FRENCH KEYBOARD CUSTOMIZATION ON HMC AT D92W"

    TDR H057441 "77320118 / 77320100 RESULTING IN A 100% MCM REP. CALL HOME"

    TDR H16656 "AFTER HMC CODE UPGRADE 9032 AND/OR 9037 CONSOLE MISSING"

    TDR H04112 "HMC GETS ANYNET ERROR : ANY0352I WHILE LOADING NWWAIT-SECS"

    TDR H161369 "REPLACING A MOD 002 CTP CARD, POSSIBLE LICC VERSION CHANGE" (RC 014)

    TDR H055797 "9037 MODEL 002 USING EXTERNAL TIME SOURCE (ETS) FROM HOPF"

    TDR H054682 "9037 MODEL 002 USING EXTERNAL TIME SOURCE (ETS) FROM HOPF"

    TDR H135103 "9037 MODEL 002 FEATURE CODE 4910"

    TDR H107214 "9037 USA TIME PHONE NUMBERS"(VOICE-MODEM)."

    TDR H136366 "DCAF LEVEL ON THE HMC CAN BE DISPLAYED"

    TDR H132707 "SEAMLESS APPLICATIONS HANG UNDER OS/2 2.11"

    TDR H131749 "NEW MAU'S(8228) MUST BE INITIALIZED BEFORE USE"

    TDR H135769 "PC 330 MODEL 6576 DCAF RESTRICTION FOR 9032 / 9033"

    TDR H135763 "WARNING - TWO UNIQUE RESTORE TAPES FOR 9037"

    TDR H134179 "WORDING IN 9037 MODEL 1 MIM CAN CAUSE CUSTOMER OUTAGE".

    TDR H057790 "INSTALL LATEST MICROCODE.. VERSION 02.00.00.(MODEL 002 ONLY)"

    TDR H057794 "REPLACE CURRENT POWER SUPPLY IN SELECTED 9037 MODEL 002."

    TDR H161516 "CRITICAL EC UPDATES LIC TO VERSION 01.02.01 (MODEL 2 ONLY)". See ECA008 in the EC chapter.

    TDR H057483 "9037-002 TIMER LIC VERSION 01.02.01 IMMEDIATE AVAILABILITY"

    TDR H161550 "CORRECT A POTENTIAL PROBLEM WITH CTP CARD 9037 MOD 2 ONLY". See ECA007 in the EC chapter.

    TDR H162807 "UPDATE CONSOLE/SYSTEM LIC TO VERSION 01.02.00 (MODEL 2 ONLY)". This TIP provides the console application improvements.

    TDR H16482 "9037 MODEL 2 INTERIM FIX LIC" (LIC 01.01.04)" (RC044 + shutdown // Two bridges connexions // RC032 all ports + POSTS RC089 problems).

    TDR H135763 "INSTALLATION NOTES FOR ECA004 9037 MODEL 2"

    TDR H136497 "ECA004 - GA LEVEL CLO CARD, LIC & DOCUMENTATION"

    TDR H136779 "INSTALLATION NOTES FOR ECA005 9037 MODEL 2"

    TDR H136498 "ECA005 - ECE46347 REPLACE LOGIC BOARD" (see ECA004).

    TDR H132964 "MISSING STEP FOR INSTALL/REMOVE 9037 FEATURE PORTS" (MODEL 1 only).


    Chapter 3. Maintenance Information


    3.1 Maintenance guidelines

    No preventive maintenance.

    The current MIM version (SY27-2641-02) is completed by the Service Exception Letter - T200SR01 October 30th, 1998 - A copy can be obtained from the server with the following VM command: REQUEST T200SR01 PACKAGE FROM FOCUSPE AT PKEDVM9

    The objective of the MIM is to maintain a continuous availability, with NO outage if a unit power off is necessary (E.A. configuration) during an intervention:

    1. A "Service Impact Analysis procedure" must be performed before a 9037-002 is taken out of service to asset the impact on the Sysplex and avoid operations outage.
    2. A procedure to "Prepare the 9037-002 For Service" must be performed to isolate one timer from the sysplex configuration when it is necessary to service it.
    Only ports cards are hot-pluggable.

    The console is required to:

    1. Install the timer.
    2. Define or change the ETR network definitions (active/standby console - Modify the IP address to communicate with the other LAN attached box if necessary).
    3. Manage and monitor the ETR networks ( Two networks maximum).
    4. Display/set/adjust the time and date.
    5. Access the logs.
    Under normal timer operations a console outage has no effect but for managing or maintenance capabilities. (e.g. scheduled offsets will take place even if the console application is not operational). The maintenance mode can be set on a timer so that a standby console has access to options/functions normally available from the active console, e.g. "Port Control" functions. Main options/functions are only accessible from the active console, they cannot be used from the standby console in either mode:
    1. Configure Time Offsets.
    2. Initialize Timer Network.
    3. Adjust Time (calculate or adjust).
    Refer to the table 2-1 page 2-179 of the Maintenance Information manual for options/functions available from the standby console ("Console functions available when Maintenance mode is set/cleared" table.)

    More recommendations:

    1. Do not close the Console Application, when not needed minimize the window.
    2. Do not leave the Maintenance mode set, critical error reporting could be lost, i.e. not reported to attached CPCs.
    3. The console System Error Indicator should be reset as soon as a System Error event is analyzed/repaired to avoid masking the next one. Refer to the Event Codes in appendix A of the Maintenance Information manual to identify such System Error Light events (SEL). This is not necessarily linked to the severity (2 and above).
    4. Use an ESD wrist strap to remove FRUs.
    5. The handle must only be used to replace the logic board.

    3.2 Problem determination - "Maintenance Information" documentation:

    Event reporting and Reason codes: Appendix A. Events on the operator panel display in the following format: Reason code/severity level/byte 0 of the event data (rrr/s/xx). Refer to the appendix A table for the event report distribution (Console , Operator panel, CPC).

    The MVS Msg IEA100I is generated by informational only reason codes. Any reason code of a higher severity, when reported to the CPC, is logged with the MVS Msg IEA272I and results in a service call. So does an event detected by MVS, e.g. a non-operational port (IEA262I). Every 967X call home is performed under RC 753A010* (See Tip H131142). On 967X, the "A1_record" S.E. log entry reports the reason code when available to MVS along with the MVS message.

    Diagnostic Information: Chapter 2 (MAPS and Utilities).

    "Service Impact Analysis procedure" and "Prepare the 9037-002 For Service" : Chapter 3.

    Offline/Online verification procedure used after the replacement of a FRU: Chapter 3 (Repair Information).

    Operator Panel - Console Messages (Unit status line messages, Error messages, LIC install messages): Appendix B.

    MVS/ESA messages: Appendix C.

    TPF messages: Appendix D.


    3.3 Support request:

    If you request support for a failure, provide proper informations to the support team:
    1. Use the Data Collection Procedure, MIM page 2-187, send the COLLECT.* files to the support VM below, and record it in the PMH. If possible, perform it on both consoles. When the first console COLLECT.* files are saved, rename them to avoid overwritting.
    2. Before uploading both set of files to VM in binary, rename them with a file name related to the pmh number and type of console, then send them to "MOPLOGS AT MOPVMA". (e.g. 06060ACT.TRM - and 06060SBY.TRM).

    3.4 Timer LIC and Console Application levels:

    Whenever a CTP card is replaced, ensure the new card's LIC level matches the one you replaced per TIP H161369.

    The levels are identified as "XX.YY.ZZ" for "Version.Release.Patch".

    To check both levels:


    3.5 Console Restore tapes - optical disk -diskettes P/Ns:

    Level 01.02.00 - Console restore tape for 6576: 76H7811, the P/N is unique for each type of console. Refer to the TIP H135763. Level 01.02.00 - Console restore diskettes (3): - PNs are 76H7807 - 76H7808 - 76H7809 at EC 3880.

    Level (01.03.00) and 0S/2 Warp 4.0 (6589 PC):

    Level 02.00.00 diskettes:

    3.6 Timer LIC install:

    The timer code is pre-loaded. The Timer LIC is loaded on the console for backup purposes during the installation or during an upgrade.

    LIC Install Messages are explained in the appendix page B-9.

    Follow the appropriate procedure to install a new version of the LIC on the timers: "Installing LIC on a Timer Network" in the Maintenance Information (An existing version can't be re-installed).

    Do not perform a code download in a Basic or EB configuration while the timer is ONLINE: this would cause a Sysplex down situation. Instead, plan for a system maintenance slot (timer network outage) and follow the "Service Impact Analysis" procedure (MIM).

    The LIC install is concurrent with customer operations in E.A. configuration provided some requirements are met: Follow the procedure in the Maintenance Information Manual to insure that a LIC install will not cause a customer outage in an E.A. configuration. During the sequenced timer restart following the code load, it may NOT BE CONCURRENT in some situations:

    For such situations stopping the transmission during the timer restart (the "remote" timer is restarted first - i.e. the one communicating with the console through the CLO links) will cause either a side switching on MPs where it is applicable, which may not be conclusive, or a switch to local mode resulting in a customer outage (Disabled wait for one or more MVS images).

    Do not rely on the Port State indicator (An arrow is up when the port is transmitting normally). This is not a guarantie of the receiver status. Refer to the Service Impact Procedure and verify with the MVS command "D ETR, DATA" that both ports are OPERATIONAL and ENABLED for each MVS image before the LIC install is attempted. The route command hp2."RO *ALL,D ETR,DATA" can be issued to see all partitions ports in a row. On MPs only one side status can be displayed ( 0 or 1, not both).

    Of course, during the Timer code load operation the boxes are restarted (stop transmitting to the CPCs) one after the other to avoid a disruption:


    3.7 MVS Commands:

    To verify that each SCP image using the Sysplex Timer has an OPERATIONALand ENABLED redundant connection to the other Timer before stopping the Timer requiring service (Power off - Restart - LIC Installation):

    Enter "D ETR, DATA" at the SCP console to display:

    IEA282I 03.50.05 ETR STATUS
     

    SYNCHRONISATION MODE = ETR       CPC SIDE = 0
    CPC PORT 0 <=== ACTIVE            CPC PORT 1
     

    OPERATIONAL                          OPERATIONAL
    ENABLED                                    ENABLED
    ETR NET ID = 10                         ETR NET ID = 10
    ETR PORT = 02                           ETR PORT = 02
    ETR ID = 02                                 ETR ID = 03
     
     

    Verify the results with the Service Impact Procedure, the last line displayed is the Unit Address of each box. (Details of MVS msg IEA282I in Appendix C of the Maintenance Information manual).

    Another useful command, to put ENABLED a port DISABLED by MVS (After analyze and repair to avoid a further re-occurence of this DISABLED -not stepping- state):

    Enter "SETETR PORT=n" at the SCP console (n=0 or 1).


    3.8 WAIT CODES 0A2:

    LPAR WAIT codes 0A2 are related to the Sysplex Timer only if the Reason Code is '0C' (MVS V4) or '114' (MVS V5). Refer to the MVS documentation for the Operator and Sys.Programmer responses for this Wait code. Also refer to the IXC220W message details.

    3.9 Token ring failure:

    The console to timers communications is established as follow:

    Console >---tkrg---> timer "A" >---CLO Links A/B---> timer "B"

    Follow the related MAPs and TIPS. Token ring errors must be investigated before the occurence of a RC 138 "Token ring permanently disabled" that halts the recursive recovery and disables tkrg communications. This situation is recoverable only with an IML of the unit (DISRUPTIVE for non E.A.). To perform an IML in a non-disruptive way (E.A. config. only), follow the maps to perform a dummy CTP replacement. ('Service Impact Analysis' and 'Preparing the Sysplex Timer for service' procedures). If the other box can be accessed from the console, restart the failing unit using the 'utility' menu (CLO links communications must be available). If not, force the IML from the box: power off/on the failing unit. Then continue with the MIM procedure to restore the configuration.

    The CTP card replacement should be attempted in the last, as it is disruptive except for EA configurations: If possible create a 'private' LAN with only a MAU, the console and the timer in the configuration, this will prevent interferences with other equipments existing on a customer 'public' LAN and should facilitate the investigations.

    Isolating parts from the current LAN will show up if the problem is coming from the console or timer or from another component on the current LAN.


    3.10 Loss of communication -RC 040/041- with the Active (Standby) console:

    Recursive RCs 040/008 added to RC 076 may led to a Sysplex Down on the model 001 (Ref TIP H131232).

    A loss of communication with the console(s) has no impact on the timer operations.

    Do not close the Console Application, when not needed minimize the window. From LIC 01.03.00 a dialog box appears to warn the user not to close the application.

    A RC 40(41) will appear in the Event log only after reconnexion.

    Possible causes of RC 40 or 41 when they are recorded alone:

    Any doubt: follow the MAPS /TIPs for this RC.


    Chapter 4. E.C.s (purpose and contents):


    4.1 LIC HISTORY:

    This chapter gives informations on current ECs only ( Obsolete infos are not kept to avoid confusion ). Refer to the TIP H164503 for a complete 9037 MODEL 2 LIC HISTORY.

    4.2 ECA009 - Version 02.00.00 information:

    The LIC level 02.00.00 is delivered through ECA009 - EC F11237 This code contains support for European "TELEPHONE DATE CODE" External Time Service and a fix that will help reduce the possibility of any more CTP card data corruption if the PSA fails (see ECA 010). Ref TIP H057790.

    4.3 ECA010 - Selective PSA replacement:

    A replacement of some PSA units must be performed through the ECA010 - EC F11236. Refer to the TIP H057794 for the box S/Ns where the PSA must be replaced ASAP.

    4.4 Selected CTP cards replacement:

    The list of boxes S/Ns impacted by ECA007 can be inspected (ref TIP list). This list contains ALL machines affected (since 01/20/1998).

    The EC E46348 package is released (ECA007):

    The EC E46349(ECA008) - timer LIC 01.02.01 or above - is a prerequisite for the ECE46348(ECA007).

    To avoid a RC 014 due to different LIC levels in both boxes read the appropriate TIP (ref TIP list).


    Chapter 5. External Time Source


    5.1 Sysplex Timer accuracy and E.T.S.:

    The typical Sysplex Timer accuracy is 1ppm, nearly 0.6 S/week. If an off-specs drift is observed, switch the primary unit and monitor with the new primary unit (E.A.)

    The Sysplex Timer is able to steer its oscillator to maintain an accuracy of 100 us vs an ETS (5ms for the model 1) if a CLO card is installed (E.B. or E.A. configuration), with the 1 PPS option. Without, the accuracy is maintained to within 0.001s of the ETS.

    The ACTS (Boulder, CO) is accurate to within 0.01s to U.T.C. with a propagation delay through telephone lines less than 0.1s for the U.S.

    The european Telephone Date Code support is provided with the LIC 02.00.00, allowing an alternate modem source.

    The Radio Receivers and Time Code Generators are typically within 0.1 to 0.5 S of UTC.

    A GPS Satellite is typically within 100 nanoseconds of UTC.


    5.2 External Time Source tests:

    For test purpose the ETS can be defined on the console application without a re-initialization of the timer network. See the "How Do I... Define the External Time Source" online help. The S.T. unit will not track the ETS clock as long as you do not select the "Automatic Adjustment" check box.

    5.3 Basic Configuration:

    The attached ETS is accessed to initially set the TOD clock.

    Then, the ETS is only accessed for information, no steering is possible. Time difference can be seen through the ETS results dialog box.


    5.4 Time tracking:

    Refer to the TIP H00892 for the tracking RCs description.

    For the Expanded Basic or Expanded Availability configurations the Automatic Adjustment option must be set (ETS definition dialog box). If not set, the ETS will only be accessed for the Calculate option (Adjust Time dialog box) or initialization.

    To guarantee tracking two E.T.S. must be used, one of which can be designated as the prefered one.

    How to display the difference between the Timer time and the ETS time ?


    5.5 Leap seconds:

    If applicable, their ETR offset amount is of 22 at 99/01/01 00:00:00. A leap second was introduced at the end of December 1998 for a total offset of 22.

    No positive leap second will be introduced at the end of December 2000

    For announcements of next leap second occurence consult the "Bulletin C" of the International Earth Rotation Service (IERS) below:

    "Observatoire de Paris - IERS link"


    5.6 Modem Access - US and EMEA telephone numbers:

    Customize the "External Time Source Definition" dialog box according to the selected service (either protocol "modem1" or "modem2") Note that the 9037-001 supports the EMEA Telephone Date code from LIC 4.10 (latest lic is 4.21)

    5.7 ETS connector pin numbering:

    Here is our 25 pole pins numbering: - the sysplex timer connector is cabled as DTE (Data Terminal Equipment). - it must determined with the provider if the ETS device is also cabled in DTE (true for most of them) or in DCE (Data Communication Equipment). - if cabled in DTE a 'null modem connection' must be realized: TxD of timer to RxD of device et vice versa RTS of one side to CTS of the other (2 times)

    URLs of manufacturers are in the M.I.M. page F-8


    5.8 GPS/Low freq receiver Protocol 1 documentation:

    Refer to the MIM SY27-2641-02 appendix F

    As this protocol is common to both sysplex timer models you can also find the protocol 1 description under the 9037 model 1 chapter on the focuspe web site:

    "ETS Protocol 1 Information link"

    For protocols 2 and 3 use the MIM as referenced above.


    Chapter 6. Sysplex disaster recovery procedure


    6.1 9037 model 2 disaster scenario:

    A major problem occurs on the primary unit preventing it from sending the OffLine Sequence (OLS) to the other unit that would cause a normal recovery.

    Internal errors are usually detected (e.g. hardware error or power loss) and an OLS sent through the CLO link to the other unit. In this situation the alive timer will stay online (emitting) without disruption, either because it is already acting as primary or switched to primary on receipt of the OLS.

    A sum of incidents or a major problem may led to both units not emitting when the OLS is not received by a secondary unit.


    6.2 9037 model 2 disaster prevention:

    For safe operations every timer must be at level 01.02.01 minimum ( latest level: 02.00.00 ).

    The Planning Guide "Recommendations for Continuous Availability" chapter must be followed for the installation of the S.T. network.

    To avoid the accumulation of incidents, do not leave unattended a CLO Reason Code indicating that a single link failure occured (RC "77" or "78" alone). In this situation, from the version 01.02.00, a message will be issued to the host every 8 hours to warn the operator.

    When any intervention is required, follow the M.I.M. procedures to avoid and create such a scenario. (e.g. if CLO cables are disconnected before powering off one box, the secondary will go dormant - not emitting - and you may then power off the primary - emitting - box and cause the disaster.) Following the M.I.M. procedures "Service Impact Analysis" then "Preparing the Timer for Service" will guide you to avoid this. Ensure you have the last version of the M.I.M./Service Exception Letter.

    Have a backup tape of the latest configuration.

    Check from the following procedure which item would cause a delay and take the proper actions before the 'panic'.


    6.3 9037 model 2 disaster recovery:

    For most of the failure scenarios, the recovery design will keep one timer alive and emitting to the CPC ports.

    The following manual recovery list ONLY APPLIES TO ONE SPECIFIC SITUATION when, in an E.A. configuration, one box is definitely out of order and time is needed for a repair action. The other box is alive but in the status 'DORMANT' that keeps it from emitting on the CPC ports. In this specific situation it will not be possible to recover in E.A. (Expanded Availability). The purpose of this manual recovery is to recover the Sysplex down situation asap, using a restart of the S.T. network in an Expanded Basic (E.B.) configuration until the other box is repaired.

    Recovery sequence:


    Chapter 7. Hardware Management Console Support for the 9037 Model 2 Sysplex Timer


    7.1 Feature / Documentation

    FC 4910

    Documentation PN 620-900012-000

    Documentation sent with the FC4910:


    7.2 Installation Pre-reqs.

    Familiarize with the documentation.

    Installation Time: 30 min

    Pre-req PC:

    A remote LAN containing a HMC can connect to the Timers LAN through a bridge.

    The timer can communicate with two consoles. If you are installing the Console Application on a HMC for testing purposes while 2 dedicated PCs are already installed, as active and backup consoles, shutdown one of these ones before starting the Console Application.


    7.3 Installation Summary

    Review the TIPs relevant to HMC and 9037 in the Installation chapter.

    7.4 Console Application backup:

    To avoid loosing a C.A. backup file when 9032 and 9037 Consoles are loaded on the HMC, follow directions described in the TIP H057587 "9032/9037 C.A. HMC BACKUP FILE OVERWRITTEN - NOT RESTORED".

    7.5 Keyboard customization:

    Problem encountered with Driver 92W for french keyboard customization: During the OS2 customization (Country and Keyboard), the KBDFR.DLL isn't loaded though the SYSTEM.INI is updated with "keyboard.dll=kbdfr.dll". Subsequently the Win/OS2 default keyboard setting is indicated as "french" and cannot be modified, though the actual setting remains U.S. through the existing KBDUS.DLL (From this point the only way to be prompted for the missing file during this keyboard customization is to come back with an OS/2 setting for keyboard U.S. then to customize Win/OS2 for french keyboard. But again the above file can't be retrieved from the cd-rom and must be copied/retrieved from a diskette.)

    Refer to TDR H057419, you can get the file by entering the following command from any VM system command line: TOOLS SENDTO MOPVMA CETOOLS RSCP GET KBDFR DLLBIN Copy the missing file in C:\OS2\MDOS\WINOS2\SYSTEM\


    7.6 "Undefined Timer/Director Consoles" Icone (Group view):

    This new auto-discovery function will allow a connexion with the OS/2 Warp 4.0 timer console PCs (old installations: see ECA009) coming with the LIC level 01.03.00 (sept 98). The HMC must be at driver A2 minimum.

    For appropriate procedures refer to the "S/390 Console Installation and User's Guide - 9032 Models 3 and 5 Escon Director - 9033 Model 4 Escon Director - 9037 Model 2 Sysplex Timer" - Ref GA22-7291 - shipped with the Unit(s) to install and use the PC-330, the PC 365,the HMC, the ESCD consoles) etc. (delivered from 01.03.00 LIC level).

    Operations flow:


    7.7 Miscellaneous

    Shutdown the application before a shutdown of the HWMCA, before powering off the PC or activating MCLs. In case of HWMCA shutdown restart the HWMCA before a restart of the S.T. Console application.

    If the HMC hard drive was rebuild the programs associated with the "Extra Applications Folder" are restored but the folder isn't. Follow directions to recreate this folder.

    When installing the HMC do not configure TCP/IP outside of the HWMCA to avoid interferences between both applications.

    When installing the Sysplex Timer Console Application do not place its icon in the startup folder. (The HWMCA starts TCP/IP during its initialization).


    Chapter 8. ESCON Director Console Support for the 9037 Model 2 Sysplex Timer


    8.1 Feature / Documentation

    FC 4910

    Documentation PN 620-900013-000

    Documentation sent with the FC4910:


    8.2 Installation Pre-reqs (minimum):

    The following stands for the PC-330 OS/2 Warp Connect 3.0.

    The ESCON Director Console code level must be at 2.1 or higher.

    Familiarize with the documentation.

    Installation Time: 20 min

    Pre-req PC:

    Although there can be two PCs per Timer Network (Active and Standby Console Applications), the Director application will not run if more than one PC is operating as a Console, the 9032-003 can communicate only with one address.

    Do not install the Console Application on a Director Console controlling 16 Directors. The console is able to communicate with up to 14 directors and two timers networks.

    A remote LAN containing a Director Console can connect to the Timers LAN through a bridge.

    The timer can communicate with two consoles. If you are installing the Console Application on a ESCD Console for testing purposes while 2 dedicated PCs are already installed, as active and backup consoles, shutdown one of these ones before starting the Console Application.


    8.3 Installation Summary


    8.4 Miscellaneous

    Shutdown the application before Powering off the PC.