Thanks to everyone for the super fast responses on this problem
As you may recall I was having problems with I/O errors on 1 domain but with
both filesets within the domain.
I was getting mesgs in the kern.log file like the following
Feb 13 13:21:59 sharky vmunix: advfs I/O error: setId 0x30d8dfc5.000109d8.2.8001
tag 0x0000000a.82c5u page 0
Feb 13 13:21:59 sharky vmunix: vd 1 blk 7577840 blkCnt 128
Feb 13 13:22:00 sharky vmunix: write error = 6
Feb 13 13:22:00 sharky vmunix: advfs I/O error: setId 0x30d8dfc5.000109d8.2.8001
tag 0x0000000a.82c5u page 8
and when I did a reboot I got a
Feb 13 16:53:03 sharky vmunix: ADVFS EXCEPTION
Feb 13 16:53:03 sharky vmunix: Module = 36, Line = 3648
Feb 13 16:53:03 sharky vmunix:
Feb 13 16:53:03 sharky vmunix: panic (cpu 0):
Feb 13 16:53:03 sharky vmunix: syncing disks... done
But uerf was not logging any errors at all.
Most people sugested that the disk might have a bad block or to check that
LSM was happy.
Their was also a sugestion to check the firmware version on the disk.
What I did was to dd the whole disk on to a new disk that I had spare
and then swap the new disk with the old.
After doing the dd & that went ok I swaped the disks over & I still had the
same errors occuring.
By this time I had also received a good colection of ADVFS patches from DEC
but none where specific to my problem.
Anyway because only ADVFS was complaining about problems & it didnt look like a
H/W error any more I decided to install all the patches from DEC & then vdump
the filesets drop the domain & fiesets & then recreate them & vrestore the
data back on.
And this believe it or not worked great, no more problems, we are back on line
and have been working the domain for about 16hrs now with no more errors.
However if it wasnt a H/W fault Why did ADVFS get lost?, You would also think
that there would be a fsck type utility to fix the ADVFS problems not just
find them.
I sent this mesg to the list before I went home on tuesday night.
I came in Wednsday morning and had lots of good ideas from people to check & do.
and by Wed afternoon the problem was fixed.
So thanks very much to
Peter Flack
Andrew Greer
Paul E. Rockwell
Alan Rollow
Jean-Marc VINCENT
--
John Zentveld Internet: jzentvel_at_scu.edu.au
UNIX VAX/VMS System Administrator Room: B2.14
Southern Cross University Tel: (066) 203859
Lismore NSW Australia Fax: (066) 203033
Received on Wed Feb 14 1996 - 23:55:04 NZDT