new disk, scsi-errors + crash (fwd)

From: Bob Parkinson <rwplists_at_omni.ac.uk>
Date: Fri, 02 Jul 1999 10:57:56 +0100 (BST)

Dear All,

I've added a new disk to my alphaserver 1200 (4.0D), its the RZ1CB-CS in
the list below.

Jul 1 11:03:23 omni vmunix: scsi1 at isp0 slot 0
Jul 1 11:03:23 omni vmunix: rz8 at scsi1 target 0 lun 0 (LID=1) (DEC
RZ1CB-CA (C) DEC LYJ0) (Wide16)
Jul 1 11:03:23 omni vmunix: rz9 at scsi1 target 1 lun 0 (LID=2) (DEC
RZ28M (C) DEC 0616)
Jul 1 11:03:23 omni vmunix: rz10 at scsi1 target 2 lun 0 (LID=3) (DEC
RZ28 (C) DEC D41C)
Jul 1 11:03:23 omni vmunix: rz11 at scsi1 target 3 lun 0 (LID=4) (DEC
RZ28M (C) DEC 0616)
Jul 1 11:03:24 omni vmunix: tz12 at scsi1 target 4 lun 0 (LID=5) (DEC
TLZ07 (C)DEC 553B)
Jul 1 11:03:24 omni vmunix: rz13 at scsi1 target 5 lun 0 (LID=6) (DEC
RZ28D (C) DEC 0010)
Jul 1 11:03:24 omni vmunix: rz14 at scsi1 target 6 lun 0 (LID=7) (DEC
RZ1CB-CS (C) DEC 0844) (Wide16)


I disklabel'd it, and newfs'd it. I then started to generate some data
and write to the new disk.

About 6 hours later the whole machine crashed. I restarted the machine,
without the new disk. THe next day I mounted the new disk by hand, and
started to fsck it. THe whole machine locked up for minutes at a time,
till I was able to kill the fsck, and then the box came back to life
normally.

I'm getting CAM errors reported in the kern.log like this:

Jul 1 12:50:39 omni vmunix: cam_logger: CAM_ERROR packet
Jul 1 12:52:19 omni vmunix: cam_logger: bus 1 target 6 lun 0
Jul 1 12:52:19 omni vmunix: ss_perform_timeout
Jul 1 12:52:19 omni vmunix: timeout on disconnected request
Jul 1 12:52:19 omni vmunix: Active CCB at time of error
Jul 1 12:52:19 omni vmunix: cam_logger: CAM_ERROR packet
Jul 1 12:52:19 omni vmunix: cam_logger: bus 1 target 3 lun 0
Jul 1 12:52:19 omni vmunix: ss_perform_timeout
Jul 1 12:52:19 omni vmunix: timeout on disconnected request
Jul 1 12:52:20 omni vmunix: Active CCB at time of error

and out of uerf like this:

********************************* ENTRY 211.
*********************************

----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 212.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Thu Jul 1 12:49:05 1999
OCCURRED ON SYSTEM omni
SYSTEM ID x00070016
SYSTYPE x00000000

----- UNIT INFORMATION -----

CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0001
                              x0070 LUN x0
                                        TARGET x6


Various TARGETS are reported.

We had a couple of visits form compaq engineers recently, and one of them
mentioned that my SCSI bus was not terminated, and when he came back to
replace a mother borad, he would bring some terminators. Unfortunately, I
got a different chap the next day, and no terminators. Had had no disk
problems so I was not immediately concerned.


Are no terminators for my SCSI bus going to cause problems/symptoms as
above, or may there be another problem.


Thanks,

Bob
Received on Fri Jul 02 1999 - 10:00:54 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:39 NZDT