Dear Managers,
I am, at present quite confused by a problem on an AlphaServer 8200 system
running Digital UNIX 4.0D and TruCluster ASE version 1.5. The system is
configured with 2 GB of memory and shares 160 GB of RAID 0+1 disk storage in a
RAID Array 7000 with with another AlphaServer 8200. The problem I am getting is
that recently the node "sampson" has started making a large series of errorlog
entries. The output of uerf from this looks like the following.
********************************* ENTRY 1. *********************************
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 55327.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Thu Jan 14 10:51:16 1999
OCCURRED ON SYSTEM sampson
SYSTEM ID x0000000C CPU TYPE: DEC 7000
SYSTYPE x00000000
PROCESSOR COUNT 2.
PROCESSOR WHO LOGGED x0000000C
----- UNIT INFORMATION -----
CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0008
x0209 LUN x1
TARGET x1
----- CAM STRING -----
ROUTINE NAME cdisk_op_spin
----- CAM STRING -----
Unit Reserved
----- CAM STRING -----
ERROR TYPE Information Message Detected
_(recovered)
----- CAM STRING -----
DEVICE NAME DEC HSZ70 V70Z
-----------------------------------------------------------------------------------
It seems that a similar error entry gets logged approximately every second and
they seem to all be singling out the devices at [SCSI Bus#, Target, Lun]
[8,1,1], [8,1,2], [8,2,0], [8,3,0], [8,3,1], [8,2,1], [8,1,1] and so on for each
of the devices attached to that bus.
I was wondering if anyone could tell me what this kind of error entry means.
What is the cdisk_op_spin routine? Should I be alarmed?
Yours sincerely,
Robert Honore.
Received on Fri Jan 15 1999 - 19:36:58 NZDT