Second cluster member locks up

From: Colin Bull <c.bull_at_videonetworks.com>
Date: Wed, 24 Oct 2001 16:13:35 +0100

We are running Tru64 5.1 PK3 with a 2 server cluster, BM1 - ES40 3GMmemory,
BM2 - DS20E 2GB memory.
Both servers are connected to each other via 2 memory channel adapters to 2
memory channel
hubs. The are connected to a SAN by dual redundant fibre to 2 HSG80s.

We had a previous experience where BM1 had a faulty PCI motherboard changed,
and as it came up after repair the BM2 server locked. Any sessions just
hung, including telnet
and console sessions.

Today, as part of our User Testing, the power was pulled on the BM1 and
again the BM2 locked
solid. All telnet session and the console screen just locked. The direct IP
address
could be pinged, but the cluster and application IPs failed the ping.
The BM1 started rebooting and complained cfs_kgs_submit_join_proposal
cluster_root failed over and over again.

After an hour we reset both servers and they both came up.

Any suggestions ?

Colin Bull
DBA 2nd Floor Icon
VideoNetworks.com Watch over 1000 films AND Premier League football when
ever you want
Tel 01438 363496
Received on Wed Oct 24 2001 - 15:16:56 NZDT

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:42 NZDT