[Q] Disk errors

From: <mortimer_at_physics.uq.edu.au>
Date: Tue, 14 Oct 1997 08:06:55 +1000

Hi DU Admins

Our AlphaStation 200 4/233 registered 9 CAM SCSI bad block errors
from the same disk in one day. This disk is used for /usr/local
and secondary swap. I've included two error log entries below.

Does this indicate the disk is going bad? Is there some other
diagnostic I can run to check the disk's condition?

I checked the archives and found several questions along the
same lines as this but no summaries.

Thanks
Ian

_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
_/ Ian Mortimer _/
_/ mortimer_at_physics.uq.edu.au ,-_|\ Department of Physics _/
_/ Tel: +61 7 3365 3436 / *\ University of Queensland _/
_/ Fax: +61 7 3365 1242 \_,-._/ St. Lucia, Brisbane _/
_/ v Queensland, Australia 4072 _/
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
Disclaimer: Any opinions expressed are my own.


********************************* ENTRY 3. ********************************
*

----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 7.
OPERATING SYSTEM DEC OSF/1
:OCCURRED/LOGGED ON Wed Oct 8 16:04:26 1997
OCCURRED ON SYSTEM maxwell
SYSTEM ID x0006000D CPU TYPE: DEC 7000
SYSTYPE x00000000

----- UNIT INFORMATION -----

CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0000
                              x0018 LUN x0
                                        TARGET x3

----- CAM STRING -----

ROUTINE NAME cdisk_bbr_done

----- CAM STRING -----

                                        cdisk_bbr_reassign: BBR complete bad
                                         _block number: 1307680

----- CAM STRING -----

ERROR TYPE Soft Error Detected (recovered)

----- CAM STRING -----

DEVICE NAME DEC RZ28

----- CAM STRING -----

                                        Active CCB at time of error

----- CAM STRING -----

                                        CCB request completed w/out error
ERROR - os_std, os_type = 11, std_type = 10


----- ENT_CCB_SCSIIO -----

*MY ADDR x0371F328
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x0001 CAM_REQ_CMP
PATH ID 0.
TARGET ID 3.
TARGET LUN 0.
CAM FLAGS x00001C82
                                        CAM_QUEUE_ENABLE
                                        CAM_DIR_OUT
                                        CAM_SIM_QFRZDIS
                                        CAM_SIM_QFREEZE
                                        CAM_SIM_QHEAD
*PDRV_PTR x0371F028
*NEXT_CCB x00000000
*REQ_MAP x00000000
:VOID (*CAM_CBFCNP)() x00470B00
*DATA_PTR x0111D600
DXFER_LEN x00000008
*SENSE_PTR x0371F050
SENSE_LEN x40
CDB_LEN x06
SGLIST_CNT x0000
CAM_SCSI_STATUS x0000 SCSI_STAT_GOOD
SENSE_RESID x00
RESID x00000000
CAM_CDB_IO x000000000000000000000007
CAM_TIMEOUT x0000012C
MSGB_LEN x0000
VU_FLAGS x0000
TAG_ACTION x20

********************************* ENTRY 4. ********************************
*

----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 6.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Wed Oct 8 16:03:54 1997
OCCURRED ON SYSTEM maxwell
SYSTEM ID x0006000D CPU TYPE: DEC 7000
SYSTYPE x00000000

----- UNIT INFORMATION -----

CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0000
                              x0018 LUN x0
                                        TARGET x3

----- CAM STRING -----

ROUTINE NAME cdisk_bbr_done

----- CAM STRING -----

                                        cdisk_bbr: Not ECC Correctable Error
                                         _bad block number: 1129554

----- CAM STRING -----

ERROR TYPE Soft Error Detected (recovered)

----- CAM STRING -----

DEVICE NAME DEC RZ28

----- CAM STRING -----

                                        Active CCB at time of error

:----- CAM STRING -----

                                        CCB request completed with an error
ERROR - os_std, os_type = 11, std_type = 10


----- ENT_CCB_SCSIIO -----

*MY ADDR x05F98728
CCB LENGTH x00C0
FUNC CODE x01
CAM_STATUS x0084 CAM_REQ_CMP_ERR
                                        AUTOSNS_VALID
PATH ID 0.
TARGET ID 3.
TARGET LUN 0.
CAM FLAGS x00000482
                                        CAM_QUEUE_ENABLE
                                        CAM_DIR_OUT
                                        CAM_SIM_QFRZDIS
*PDRV_PTR x05F98428
*NEXT_CCB x00000000
*REQ_MAP x0068B9E0
VOID (*CAM_CBFCNP)() x0046F6B0
*DATA_PTR x87C76000
DXFER_LEN x00002000
*SENSE_PTR x05F98450
SENSE_LEN x40
CDB_LEN x06
SGLIST_CNT x0000
CAM_SCSI_STATUS x0002 SCSI_STAT_CHECK_CONDITION
SENSE_RESID x2E
RESID x00000000
CAM_CDB_IO x00000000000000104F3C110A
CAM_TIMEOUT x0000003C
MSGB_LEN x0000
VU_FLAGS x0000
TAG_ACTION x20

----- CAM STRING -----

                                        Error, exception, or abnormal
                                         _condition

----- CAM STRING -----

                                        RECOVERED ERROR - Recovery action
                                         _performed

----- ENT_SENSE_DATA -----

ERROR CODE x0070 CODE x70
SEGMENT x00
SENSE KEY x0001 RECOVER ERR
INFO BYTE 3 x00
INFO BYTE 2 x11
INFO BYTE 1 x3C
INFO BYTE 0 x52
:ADDITION LEN x0A
CMD SPECIFIC 3 x00
CMD SPECIFIC 2 x00
CMD SPECIFIC 1 x06
CMD SPECIFIC 0 x00
ASC x02
ASQ x00
FRU x02
SENSE SPECIFIC x000000
ADDITIONAL SENSE
0000: 00000000 00000000 00000000 00000000 *................*
0010: 00000000 00000000 00000000 00000000 *................*
0020: 00000000 00000000 00000000 00000000 *................*
0030: 7E250000 00005E3C 00000000 00000000 *..%~<^..........*
Received on Tue Oct 14 1997 - 00:24:06 NZDT

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:36 NZDT