Cluster hung after a NFS service modified

From: Pang Wai Man Raymond <wmpang_at_se.cuhk.edu.hk>
Date: Wed, 29 Apr 1998 17:47:35 +0800

Hi managers,

We have the cluster set up by 2x4100 running DU4.0B (patch up to 16/Dec/97)
and TCR1.4a. The mail service is set up as according to the manual so that
/var/spool/mail and /var/spool/mqueue are NFS loopback mounted to itself on
both members. In /etc/fstab, it is:

/var/spool/mail_at_mail_serv /var/spool/mail nfs rw,fg 0 0
/var/spool/mqueue_at_mail_serv /var/spool/mqueue nfs rw,fg 0 0

One day, the whole cluster was hung after the export list of a NFS service was
modified inside the "asemgr". After the whole cluster was rebooted, I repeated
the whole process again and this time it's OK. Following messages is in
daemon.log.

Apr 20 14:46:51 clust_A ASE: mc40 Agent Notice: stopping service nfs_uac_mail_msc_gds
Apr 20 14:47:26 clust_A ASE: mc40 Agent Error: /var/ase/sbin/ase_mount_action: /
usr/var/ase/mnt/nfs_uac_mail_msc_gds/var/spool: Device busy
Apr 20 14:47:26 clust_A last message repeated 9 times
Apr 20 14:47:26 clust_A ASE: mc40 Agent Error: /var/ase/sbin/ase_mount_action: U
nable to umount /var/ase/mnt/nfs_uac_mail_msc_gds/var/spool
Apr 20 14:47:27 clust_A ASE: mc40 Director Error: can't stop service
Apr 20 14:47:28 clust_A ASE: mc40 AseMgr Error: Stop with force failed - Unable
to stop service with force.

With that and other information reported to DEC, they seemed do not have any
clue and only suggest us to apply the latest patch for DU4.0B.

Does anybody know what's happening and how to prevent it from happening
again?

TIA and I'll summarize.


Regards,
Raymond
Received on Wed Apr 29 1998 - 11:48:35 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:37 NZDT