Recoverying the Cluster Root File System

From: Alejandro García Borboroglu <agb_at_aluar.com.ar>
Date: Wed, 07 Nov 2001 09:47:50 -0300

I have problems to Recoverying the Cluster Root File System to a New Disk.
My configuration is 2 AlphaServer COMPAQ
800 5/500 C.2 MB Mem 640 MB and DS20E 500 MHz C. 4 MB Mem1024 MB with RA3000 Cabinets onTru64 Unix 5.1 kit 3
I'm trying this:
First, no disk is available to restore to that is on a shared bus accessible to all
cluster members and was part of the cluster configuration before the
problems with the root file system occurred
This procedure assumes the following:
The vdump command was used to back up the cluster root
(cluster_root, cluster_usr and cluster_var) file system.
If necessary, partition the new disk so that the partition sizes and file
system types will be appropriate once the disk is the cluster root.
Create a new domain for the new cluster root:
# mkfdmn /dev/disk/dsk5d new_root
Make a root fileset in the domain:
# mkfset new_root root
Mount the domain that will become the new cluster root:
# new_root#root /mnt
 Restore cluster root from the backup media
# vrestore -xf /home21/backup_cluster_root -D /mnt
Copy the restored cluster databases to the /etc directory:
# cd /mnt/etc
# cp dec_unid_db dec_hwc_cdb dfsc.dat /etc
Copy the restored databases from the member specific area of the
current member to /etc:
# cd /mnt/cluster/members/member1/etc
# cp dfsl.dat /etc
If one does not already exist, create a domain for the member boot disk:
# cd /etc/fdmns
# mkdir root1_domain
# ln -s /dev/disk/dsk2a
 Mount the member boot partition:
# cd /
# umount /mnt
# mount root1_domain#root /mnt (Thats's the Problem) !!! Is Imposible :(
Copy the databases from the member boot partition:
# cd /mnt/etc
# cp dec_devsw_db dec_hw_db dec_hwc_ldb dec_scsi_db /etc
Unmount member boot disk:
# cd /
# umount /mnt
Update the database .bak backup files:
# cd /etc
# for f in dec_*db ; do cp $f $f.bak ; done
Reboot the system into single-user mode using the same base disk, so
that it will use the databases that you copied to /etc.
After booting to single-user mode, remount the root as writable:
# # mount -u /
Verify and update the device database:
# dsfmgr -v -F
Use hwmgr to learn the current device naming:
# hwmgr -view hier
Do this by going to the appropriate /etc/fdmns directory, deleting
the existing link and creating new links to the current device names.
For example:
# cd /etc/fdmns/root_domain
# rm *
# ln -s /dev/disk/dsk1a
# cd /etc/fdmns/usr_domain
# rm *
# ln -s /dev/disk/dsk1g
# cd /etc/fdmns/root1_domain
# rm *
# ln -s /dev/disk/dsk3a
# cd /etc/fdmns/new_root
# rm *
# ln -s /dev/disk/dsk7b
Run the bcheckrc command to mount local file systems, particularly
/usr:
# bcheckrc
Copy the updated cluster database files onto the cluster root:
# mount new_root#root /mnt
# cd /etc
# cp dec_unid_db* dec_hwc_cdb* dfsc.dat /mnt/etc
# cp dfsl.dat /mnt/cluster/members/member1/etc
Update the cluster_root domain on the new cluster root:
# rm /mnt/etc/fdmns/cluster_root/*
# cd /etc/fdmns/new_root
# tar cf - * | (cd /mnt/etc/fdmns/cluster_root && tar xf -)
Copy the updated cluster database files to the member boot disk
# umount /mnt
# mount root29_domain#root /mnt
# cd /etc
# cp dec_devsw_db* dec_hw_db* dec_hwc_ldb*
dec_scsi_db* /mnt/etc
Use the file command to get the major/minor numbers of the
cluster_root devices. Write down these major/minor numbers.
For example:
# file /dev/disk/dsk7b
/dev/disk/dsk7b: block special (19/221)
# file /dev/disk/dsk9g
/dev/disk/dsk9g: block special (19/227)
Halt the system and reboot interactively, specifying the device major
and minor numbers of the new cluster root:
>>> boot -fl "ia"
(boot dkb200.2.0.7.0 -flags ia)
block 0 of dkb200.2.0.7.0 is a valid boot block
reading 18 blocks from dkb200.2.0.7.0
bootstrap code read in
base = 200000, image_start = 0, image_bytes = 2400
initializing HWRPB at 2000
initializing page table at fff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code
Tru64 UNIX boot - Mon Jan 4 14:08:41 EDT 2000
Enter kernel_name [option_1 ... option_n]
Press Return to boot default kernel
'vmunix':vmunix cfs:cluster_root_dev1_maj=19 \
cfs:cluster_root_dev1_min=227 cfs:cluster_root_dev2_maj=19 \
cfs:cluster_root_dev1_min=221 Return
Boot the other cluster members.
run the command dsfmgr -v -F.

Any suggestions? Can anyone give me any clues on this, please.

Thanks in advance

Alejandro García Borboroglu
agb_at_aluar.com.ar
System Manager Jr.
Tecnología Informática
ALUAR Aluminio Argentino
Puerto Madryn - Argentina
Received on Wed Nov 07 2001 - 12:49:36 NZDT

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:42 NZDT