Configuration: 2 X 2100, du3.2d, ase1.3, lsm, advfs
My system was originally set up as an nfs file server using ADVfs, with
each ADVfs 'volume' being an LSM "volume". Each of the LSM volumes
was a single (HSZ40) raid disk. I've decided our configuration would be
a lot simpler if we eliminated LSM from our configuration and handed off
the RAID disks directly to ADVfs. I've been able to modify 2 of the ASE
services and now notice some unsettling/disturbing/unnerving behaviour.
Problem 1) "Extra" disks show up when using monitor. My HSZ40 disks are
rzb21-rzg21 for bus 2 and rzb29-rzg29 for bus 3. I'm now
seeing 'rz21' which I don't remember seeing before, and earlier
this week I was seeing an 'rz29' which seems to have vanished.
I *think* the rz29 went away when I did a 'voldisk rm rzb29'
but I'm not certain. The rz21 is especially puzzling since
the 2 changes I made were with rzb29 and rzg29--i.e. nothing
on bus 2 changed.
Problem 2) I used "voldisk list" as a quick troubleshooting command to
tell me which system thought it was controlling which disks.
I'm switching over to "file -f disks.dat" which displays
something along the lines of:
/dev/rrzb21a: character special (8/37952) SCSI #2 HSZ40 disk #169 (SCSI ID #5)
/dev/rrzb29a: character special (8/54336) SCSI #3 HSZ40 disk #233 (SCSI ID #5) offline
/dev/rrzc21a: character special (8/38016) SCSI #2 HSZ40 disk #170 (SCSI ID #5)
/dev/rrzc29a: character special (8/54400) SCSI #3 HSZ40 disk #234 (SCSI ID #5)
/dev/rrzd21a: character special (8/38080) SCSI #2 HSZ40 disk #171 (SCSI ID #5)
/dev/rrzd29a: character special (8/54464) SCSI #3 HSZ40 disk #235 (SCSI ID #5)
/dev/rrze21a: character special (8/38144) SCSI #2 HSZ40 disk #172 (SCSI ID #5) offline
/dev/rrze29a: character special (8/54528) SCSI #3 HSZ40 disk #236 (SCSI ID #5)
/dev/rrzf21a: character special (8/38208) SCSI #2 HSZ40 disk #173 (SCSI ID #5) offline
/dev/rrzf29a: character special (8/54592) SCSI #3 HSZ40 disk #237 (SCSI ID #5) offline
/dev/rrzg21a: character special (8/38272) SCSI #2 HSZ40 disk #174 (SCSI ID #5) offline
/dev/rrzg29a: character special (8/54656) SCSI #3 HSZ40 disk #238 (SCSI ID #5) offline
Anything that shows up as "offline" is being controlled by the other system.
Starting today, I've been seeing some output that looks like:
/dev/rrzb21a: character special (8/37952) SCSI #2 HSZ40 disk #169 (SCSI ID #5) errors = 0/10 offline
Both systems have given clean disk status today and both systems at other times
have reported "errors=n/mm" messages like the above line shows for some of the
disks. When I start seeing things like "CAM_UNEXP_BUSFREE", "CAM_SIM_QFRZDIS",
& "SCSI_STAT_RESERVATION_CONFLICT", I don't get a warm fuzzy feeling inside.
Does anyone have an idea why I'm seeing errors now and know where my "extra"
rz21 drive comes from? TIA Jeff Beck
---------------------------------------------------------------------------
A snapshot of uerf for those interested shows:
uerf version 4.2-011 (122)
********************************* ENTRY 1. *********************************
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 51.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Sun Aug 3 13:41:17 1997
OCCURRED ON SYSTEM decfs1
SYSTEM ID x00060009 CPU TYPE: DEC 2100
SYSTYPE x00000000
----- UNIT INFORMATION -----
CLASS x001F UNKNOWN
SUBSYSTEM x0000 DISK
BUS # x0003
x00FF LUN x7
TARGET x7
----- CAM STRING -----
ROUTINE NAME targ_send_comp()
----- CAM STRING -----
Target send failed
----- CAM STRING -----
ERROR TYPE Soft Error Detected (recovered)
----- CAM STRING -----
Active CCB at time of error
ERROR - os_std, os_type = 11, std_type = 10
----- ENT_CCB_SCSIIO -----
*MY ADDR x06E93F28
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x0013 CAM_UNEXP_BUSFREE
PATH ID 3.
TARGET ID 7.
TARGET LUN 7.
CAM FLAGS x00000480
CAM_DIR_OUT
CAM_SIM_QFRZDIS
*PDRV_PTR x06E93C28
*NEXT_CCB x00000000
*REQ_MAP x00000000
VOID (*CAM_CBFCNP)() x0052DC40
*DATA_PTR x03D26B40
DXFER_LEN x0000008C
*SENSE_PTR x06E93C50
SENSE_LEN xA4
CDB_LEN x06
SGLIST_CNT x0000
CAM_SCSI_STATUS x0000 SCSI_STAT_GOOD
SENSE_RESID x00
RESID x00000000
CAM_CDB_IO x000000000000028C0000E00A
CAM_TIMEOUT x00000006
MSGB_LEN x0000
VU_FLAGS x0000
TAG_ACTION x00
----- ENT_SENSE_DATA -----
ERROR CODE x0000 CODE x0
SEGMENT x00
SENSE KEY x0000 NO SENSE
INFO BYTE 3 x00
INFO BYTE 2 x00
INFO BYTE 1 x00
INFO BYTE 0 x00
ADDITION LEN x00
CMD SPECIFIC 3 x00
CMD SPECIFIC 2 x00
CMD SPECIFIC 1 x00
CMD SPECIFIC 0 x00
ASC x00
ASQ x00
FRU x00
SENSE SPECIFIC x000000
ADDITIONAL SENSE
0000: 00000000 00000000 00000000 00000000 *................*
0010: 00000000 00000000 00000000 00000000 *................*
0020: 00000000 00000000 00000000 00000000 *................*
0030: 00000000 00000000 00000000 00000000 *................*
0040: 00000000 00000000 00000000 00000000 *................*
0050: 00000000 00000000 00000000 00000000 *................*
0060: 00000000 00000000 00000000 00000000 *................*
0070: 00000000 00000000 00000000 00000000 *................*
0080: 00000000 00000000 00000000 00000000 *................*
0090: 00000000 00000000 7E250000 00005E3C *..........%~<^..*
00A0: 00000000 *.... *
********************************* ENTRY 2. *********************************
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 52.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Sun Aug 3 13:41:21 1997
OCCURRED ON SYSTEM decfs1
SYSTEM ID x00060009 CPU TYPE: DEC 2100
SYSTYPE x00000000
----- UNIT INFORMATION -----
CLASS x001F UNKNOWN
SUBSYSTEM x0000 DISK
BUS # x0003
x00FF LUN x7
TARGET x7
----- CAM STRING -----
ROUTINE NAME targ_send_comp()
----- CAM STRING -----
Target send failed
----- CAM STRING -----
ERROR TYPE Soft Error Detected (recovered)
----- CAM STRING -----
Active CCB at time of error
ERROR - os_std, os_type = 11, std_type = 10
----- ENT_CCB_SCSIIO -----
*MY ADDR x0B74C328
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x0013 CAM_UNEXP_BUSFREE
PATH ID 3.
TARGET ID 7.
TARGET LUN 7.
CAM FLAGS x00000480
CAM_DIR_OUT
CAM_SIM_QFRZDIS
*PDRV_PTR x0B74C028
*NEXT_CCB x00000000
*REQ_MAP x00000000
VOID (*CAM_CBFCNP)() x0052DC40
*DATA_PTR x03D26B40
DXFER_LEN x0000008C
*SENSE_PTR x0B74C050
SENSE_LEN xA4
CDB_LEN x06
SGLIST_CNT x0000
CAM_SCSI_STATUS x0000 SCSI_STAT_GOOD
SENSE_RESID x00
RESID x00000000
CAM_CDB_IO x000000000000028C0000E00A
CAM_TIMEOUT x00000006
MSGB_LEN x0000
VU_FLAGS x0000
TAG_ACTION x00
----- ENT_SENSE_DATA -----
ERROR CODE x0000 CODE x0
SEGMENT x00
SENSE KEY x0000 NO SENSE
INFO BYTE 3 x00
INFO BYTE 2 x00
INFO BYTE 1 x00
INFO BYTE 0 x00
ADDITION LEN x00
CMD SPECIFIC 3 x00
CMD SPECIFIC 2 x00
CMD SPECIFIC 1 x00
CMD SPECIFIC 0 x00
ASC x00
ASQ x00
FRU x00
SENSE SPECIFIC x000000
ADDITIONAL SENSE
0000: 00000000 00000000 00000000 00000000 *................*
0010: 00000000 00000000 00000000 00000000 *................*
0020: 00000000 00000000 00000000 00000000 *................*
0030: 00000000 00000000 00000000 00000000 *................*
0040: 00000000 00000000 00000000 00000000 *................*
0050: 00000000 00000000 00000000 00000000 *................*
0060: 00000000 00000000 00000000 00000000 *................*
0070: 00000000 00000000 00000000 00000000 *................*
0080: 00000000 00000000 00000000 00000000 *................*
0090: 00000000 00000000 7E250000 00005E3C *..........%~<^..*
00A0: 00000000 *.... *
********************************* ENTRY 3. *********************************
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 53.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Sun Aug 3 13:41:23 1997
OCCURRED ON SYSTEM decfs1
SYSTEM ID x00060009 CPU TYPE: DEC 2100
SYSTYPE x00000000
----- UNIT INFORMATION -----
CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0003
x00E9 LUN x1
TARGET x5
----- CAM STRING -----
ROUTINE NAME cdisk_op_spin
----- CAM STRING -----
Unit Reserved
----- CAM STRING -----
ERROR TYPE Information Message Detected
_(recovered)
----- CAM STRING -----
DEVICE NAME DEC HSZ4
----- CAM STRING -----
Active CCB at time of error
----- CAM STRING -----
CCB request completed with an error
ERROR - os_std, os_type = 11, std_type = 10
----- ENT_CCB_SCSIIO -----
*MY ADDR x0FF38728
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x0004 CAM_REQ_CMP_ERR
PATH ID 3.
TARGET ID 5.
TARGET LUN 1.
CAM FLAGS x000004C0
CAM_DIR_NONE
CAM_SIM_QFRZDIS
*PDRV_PTR x0FF38428
*NEXT_CCB x00000000
*REQ_MAP x00000000
VOID (*CAM_CBFCNP)() x00472470
*DATA_PTR x00000000
DXFER_LEN x00000000
*SENSE_PTR x0FF38450
SENSE_LEN xA0
CDB_LEN x06
SGLIST_CNT x0000
CAM_SCSI_STATUS x0018 SCSI_STAT_RESERVATION_CONFLICT
SENSE_RESID x00
RESID x00000000
CAM_CDB_IO x000000000000000000000000
CAM_TIMEOUT x00000014
MSGB_LEN x0000
VU_FLAGS x0000
TAG_ACTION x00
********************************* ENTRY 4. *********************************
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 54.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Sun Aug 3 13:41:23 1997
OCCURRED ON SYSTEM decfs1
SYSTEM ID x00060009 CPU TYPE: DEC 2100
SYSTYPE x00000000
----- UNIT INFORMATION -----
CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0003
x00E9 LUN x1
TARGET x5
----- CAM STRING -----
ROUTINE NAME cdisk_complete
----- CAM STRING -----
Retries Exhausted
----- CAM STRING -----
ERROR TYPE Hard Error Detected
----- CAM STRING -----
DEVICE NAME DEC HSZ4
----- CAM STRING -----
Active CCB at time of error
----- CAM STRING -----
CCB request completed with an error
ERROR - os_std, os_type = 11, std_type = 10
----- ENT_CCB_SCSIIO -----
*MY ADDR x0767CB28
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x0004 CAM_REQ_CMP_ERR
PATH ID 3.
TARGET ID 5.
TARGET LUN 1.
CAM FLAGS x00000442
CAM_QUEUE_ENABLE
CAM_DIR_IN
CAM_SIM_QFRZDIS
*PDRV_PTR x0767C828
*NEXT_CCB x00000000
*REQ_MAP x0077D8B8
VOID (*CAM_CBFCNP)() x00472470
*DATA_PTR x8F7B6000
DXFER_LEN x00000200
*SENSE_PTR x0767C850
SENSE_LEN xA0
CDB_LEN x06
SGLIST_CNT x0000
CAM_SCSI_STATUS x0018 SCSI_STAT_RESERVATION_CONFLICT
SENSE_RESID x00
RESID x00000200
CAM_CDB_IO x000000000000000100000008
CAM_TIMEOUT x0000003C
MSGB_LEN x0000
VU_FLAGS x4000
TAG_ACTION x20
********************************* ENTRY 5. *********************************
Received on Mon Aug 04 1997 - 01:38:01 NZST