Patch makes system unusable

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]

From: Rudolf Gabler <rug_at_usm.uni-muenchen.de>
Date: Sun, 01 Apr 2007 17:52:46 +0200

Hi managers,

We applied the latest jumbo patch to our 5.1B 3-node cluster. The whole
rolling upgrade went well without any errors until we applied the
clu_upgrade switch
stage. This went also well and then you should reboot the nodes (to apply at
the last stage a clu_upgrade clean).
The leading node was rebooted and during boot the remaining nodes crashed as
the leading one attempted to do a lsmbstartup.

Now we have the following:

A) we are able to boot the 3 nodes into single user mode from halt. Any
attempt to start lsmbstartup on all 3 nodes yields in either
LSM: Vold is not enabled for transactions - and the nodes remain up
B) a subsequent bcheckrc tries to start lsm on one of the nodes and crash
the others

C) if we use only the 2 nodes crashed during reboot -- mentioned above, we
can startup the cluster, but any attempt to boot the leading node again into
the cluster results in a panic with some sort of clsm messages i.e.
"clsm_state.distribute is false", "kgs_respond failed with error 6" or a
crash without a dump on the already running nodes.

Anyone with a clue what we should try?

Rudi Gabler
Received on Sun Apr 01 2007 - 15:54:07 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:45 NZDT