Crash on DPW600au

From: mod alliv <bucki_98_at_yahoo.com>
Date: Wed, 04 Aug 1999 08:47:23 -0700 (PDT)

MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii

Hi Managers,
after a week in holidays I have found a crash in DPW 600au. It seem be
an hardware error during IO operation on a disk but the disk was /
....

The crash-data was:
panic_string:"k_mem_fault: IO error during kernel pagein"
msg_bufc = "Alpha boot: available memory from 0xdee000 to 0xfffe000
Digital UNIX V4.0D (Rev. 878); Fri Aug 7 10:32:02 GMT 1998
physical memory = 256.00 megabytes.
available memory = 242.22 megabytes.
using 975 buffers containing 7.61 megabytes of memory
Digital Personal WorkStation 600au
Firmware revision: 6.8-20
.
.
.
rz16 at scsi2 target 0 lun 0 (LID=1) (DEC RZ1CC-BA (C) DEC 880F)
(Wide16)
rz17 at scsi2 target 1 lun 0 (LID=2) (DEC RZ1CB-BS (C) DEC 0818)
(Wide16)
.
.cam_logger: CAM_ERROR packet
cam_logger: bus 2 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
cam_logger: CAM_ERROR packet
cam_logger: bus 2 target 0 lun 0
cdisk_rec_error
Recovery Failed
Hard Error Detected
DEC RZ1CC-BA (C) DEC880F
Active CCB at time of error
Command timed out
<6>Defering I/O (errno 5) for block(0x65980, 0x65980) on device 8,32774

etc...
----------------------
the device 8,32774 is rz16a .... (/) :-(

Using uerf -R i have found (in time order):

##############ENTRY 9.
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 84.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Fri Jul 30 23:31:11 1999
----- UNIT INFORMATION -----
CLASS x0022 DEC SIM
SUBSYSTEM x0000 DISK
BUS # x0002
                              x0080 LUN x0
                                       TARGET x0
##############ENTRY 8
----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
OCCURRED/LOGGED ON Fri Jul 30 23:35:35 1999
----- UNIT INFORMATION -----
CLASS x0000 DISK
SUBSYSTEM x0000 DISK
BUS # x0002
                              x0080 LUN x0
                                        TARGET x0

################# ENTRY 7.
   like the previous at 23:35:35

################## ENTRY 6.
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 302. PANIC
SEQUENCE NUMBER 112.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Fri Jul 30 23:35:35 1999
OCCURRED ON SYSTEM catop-1
SYSTEM ID x0007001E
SYSTYPE x00000000
MESSAGE panic (cpu 0): k_mem_fault: IO error
                             _during kernel pagein

------------------------------------------------------

What you think about?...
is it really an Hardware ERROR :-( ?
Have I to call the Digital/Compaq Hardware support :-( ?

>From Fri Jul 30 there aren't more hard errors,
but there is an "excessive page-in activity" on the system....

Could be a relation between the crash and the page-in activity?

Any and all help is greatly appreciated

Domenico Villa E-mail bucki_98_at_yahoo.com
                                cnv_at_ecmwf.int
Italian Met Service

_____________________________________________________________
Do You Yahoo!?
Free instant messaging and more at http://messenger.yahoo.com
Received on Wed Aug 04 1999 - 15:45:12 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:39 NZDT