Strange RAID problem

From: David Gadbois <gadbois_at_cyc.com>
Date: Fri, 7 Apr 1995 00:01-0500

I have a 2100 running OSF/1 3.0 with 4 RZ28-VAs hooked to a SWXCR-EB in
a RAID 0 configuration. On two occasions, the SWXCR has reported read
errors once each on two different drives in the group. Both times the
SWXCR marked the drive as failed and took itself offline. Both times I
powered the system down, pulled the failed drive, reseated it, and used
the SWXCR configuration software to mark the failed drive as optimal.
The first time I did this the filesystem on the SWXCR mounted, and I was
able to do a level 0 vdump of the whole thing. I am in the process of
doing a second level 0 vdump after the second go-round.

Now, the drives are only a couple of months old and have been running
fine for the several weeks I have had the system. It seems awfully
suspicious that lightning would strike twice like this. So, my current
assumption is that something else is wrong, maybe with the SWXCR, maybe
with the SCSI cabling, maybe with the internal StorageWorks shelf the
drives are on. Has anyone seen similar failures and their solutions?
Or suggest potential problems and ways to troubleshoot them? My DEC CE
is not, alas, familiar with the StorageWorks setups, so I am afraid I am
in for a component swap-fest (with its attendant hassles and wastes of
time) unless I can narrow the problem down for him.

Thanks,
--David Gadbois
Received on Fri Apr 07 1995 - 01:02:48 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:45 NZDT