SUMMARY: Cluster hung after a NFS service modified

From: Pang Wai Man Raymond <wmpang_at_se.cuhk.edu.hk>
Date: Mon, 04 May 1998 11:40:39 +0800

Hi,

Thanks for those who replied:

"Mullin, Stephen (CAP, ITS, US)" <Stephen.Mullin_at_gecits.ge.com>
Bryan Rank <bryan_at_compgen.com>
"Knut Helleb." <Knut.Hellebo_at_nho.hydro.com>

The suggestions are to use fuser or lsof to determine the processes still
using the files in that direcotry. Kill them all before continue. You may
also need to modify the user defined stop/start scripts to handle this.

Regards,
Raymond

> Hi managers,
>
> We have the cluster set up by 2x4100 running DU4.0B (patch up to 16/Dec/97)
> and TCR1.4a. The mail service is set up as according to the manual so that
> /var/spool/mail and /var/spool/mqueue are NFS loopback mounted to itself on
> both members. In /etc/fstab, it is:
>
> /var/spool/mail_at_mail_serv /var/spool/mail nfs rw,fg 0 0
> /var/spool/mqueue_at_mail_serv /var/spool/mqueue nfs rw,fg 0 0
>
> One day, the whole cluster was hung after the export list of a NFS service was
> modified inside the "asemgr". After the whole cluster was rebooted, I repeated
> the whole process again and this time it's OK. Following messages is in
> daemon.log.
>
> Apr 20 14:46:51 clust_A ASE: mc40 Agent Notice: stopping service nfs_uac_mail_msc_gds
> Apr 20 14:47:26 clust_A ASE: mc40 Agent Error: /var/ase/sbin/ase_mount_action:
> /usr/var/ase/mnt/nfs_uac_mail_msc_gds/var/spool: Device busy
> Apr 20 14:47:26 clust_A last message repeated 9 times
> Apr 20 14:47:26 clust_A ASE: mc40 Agent Error:
/var/ase/sbin/ase_mount_action: Unable to umount /var/ase/mnt/nfs_uac_mail_msc_gds/var/spool
> Apr 20 14:47:27 clust_A ASE: mc40 Director Error: can't stop service
> Apr 20 14:47:28 clust_A ASE: mc40 AseMgr Error: Stop with force failed - Unable
> to stop service with force.
>
> With that and other information reported to DEC, they seemed do not have any
> clue and only suggest us to apply the latest patch for DU4.0B.
>
> Does anybody know what's happening and how to prevent it from happening
> again?
>
> TIA and I'll summarize.
>
>
> Regards,
> Raymond
Received on Mon May 04 1998 - 05:41:54 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:37 NZDT