SUMMARY : Scsi cam errors

From: Gary Menna <G.Menna_at_isu.usyd.edu.au>
Date: Mon, 31 May 1999 08:29:00 +1000 (EST)

Question :

Alpha 4100A Ver 4.0D patch#3 HSZ70 rz1df-cb

        We have been getting scsi cam errors that are basically
        resets . They happen on different 'devices' on different
        RAID 5 sets . These raid sets have been partitioned off
        and the logical disks thrown into LSM to use as raw
        (sybase) database devices. The engineer is returning
        to replace the controller , cable etc .
        My question is , could these resets cause a database
        corruption . I am getting NO errors in LSM or anywhere
        else . Our database has corrupted twice in the last week .
        Sybase says yes , it's a hardware problem and this must
        be it . I don't see how .


Thanks to Allan (Borg) Rollow for the following :


This particular reset happened behind the hosts's back and
was completely in the controller. The HSZ must have gotten
pretty desparate to think that a bus reset was the only way
to get the particular port to behave. Whatever happened
before this to result in a bus reset, could certainly have
corrupted data. It could still be an insane device on that
particular bus, or a bad backplane, but a bad controller is
the easiest thing to swap.


It does indeed seem to be caused by the controller .
A swap fixed the resets . The database still corrupts
but that's another problem .



Thanking you ,

Gary Menna E-Mail g.menna_at_isu.usyd.edu.au
Information Technology Services Phone +61 2 9351-6360
University of Sydney (G05) Fax +61 2 9351-7711


        *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
        * *
        * I do not want anyone to want for me *
        * I want to want for myself . *
        * - Yevgeny Zamyatin *
        * *
        *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
                Bring on the Liquor Pops
Received on Sun May 30 1999 - 22:31:13 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:39 NZDT