Hard takeover fails in a 3 nodes cluster. incorrect serial definitions
ENV: 9076 4.2.1 hacmp 4.2.2
CUSTOMER REP: Valerie Scott & Jack Anderson
PROBLEM: doing hard failures back & forth in hacmp doesn't recognise
failed clusters went down. graceful failover ok. poweroff
won't failover. no numeric error code. Hacmp.out didn't
see anything. clstat is yellow.
This is on nodes 5, 6 & 7. 7 still shows interfaces up
which is impossible. node 5 is the backup that 6 & 7
feeds into. customer is running a lotus notes ap.
The customer found out what they think is the problem.
They had cabled the serial connections like this.
6 tty2- - -tty1 7
According to the diagram the serial networks should be
serial1 5tty1 - 7tty2
serial2 7tty1 - 6tty2
serial3 6tty1 - 5tty2
However when the customer defined the networks in hacmp
they were typed in as:
serial1 Atty2 - Ctty1
serial2 Ctty2 - Btty1
serial3 Btty2 - Atty1
So when system 6 when down, and 5 should takeover, all of the
tcpip networks showed down for 6, but the serial2 network
still showed up because it was getting its heartbeat packets
Action taken: Talked with customer and redefining her serial networks
Action Plan: close
Support Line: Hard takeover fails in a 3 nodes cluster. incorrect serial definitions ITEM: FB2401L
Dated: December 1997 Category: N/A
This HTML file was generated 99/06/24~13:30:15
Comments or suggestions?