4.0b+pl6 / DEC3000/M700 - bad hardware?

From: Brian C Hill <bchill_at_bch.net>
Date: Sat, 12 Sep 1998 10:26:57 -0700 (PDT)

        Do I have some bad memory or a bad CPU? Where can I look
these things up? I have had a few of these crashes now.

   _____________________________________________________________________
  / Brian C. Hill bchill_at_bch.net http://brian.bch.net \
  | Systems Programmer University of California, Davis |
  | Unix Specialist BCH Technical Services |
------------------------------------------------------------------------------
kern.log:
------------------------------------------------------------------------------
Sep 11 02:06:42 dilbert vmunix: MACHINE CHECK type 0x660 Machine check abort
Sep 11 02:06:42 dilbert vmunix: ptr[0-1] = 0000000100000088 0000000000008a5d
Sep 11 02:06:42 dilbert vmunix: ptr[2-3] = 000486f800000004 000000014001d494
Sep 11 02:06:42 dilbert vmunix: ptr[4-5] = 00000001400131a1 000000000
.
.
.
------------------------------------------------------------------------------
uerf -o full
------------------------------------------------------------------------------
                                                  uerf version 4.2-011 (122)


********************************* ENTRY 1. *********************************

----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 100. CPU EXCEPTION
SEQUENCE NUMBER 1.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Fri Sep 11 02:02:47 1998
OCCURRED ON SYSTEM dilbert
SYSTEM ID x00060004 CPU TYPE: DEC 3000
SYSTYPE x00000000

----- UNIT INFORMATION -----

UNIT CLASS CPU

----- LEP MACHINE CHECK STACK FRAME -----

PROCESSOR OFFSET x00000110
SYSTEM OFFSET x000001A0
PALTEMP1 x0000000000008A5D
PALTEMP2 x000486F800000004
PALTEMP3 x000000014001D494
PALTEMP4 x00000001400131A1
PALTEMP5 x0000000000000085
PALTEMP6 x0000000140004710
PALTEMP7 x0000000000104000
PALTEMP8 x0000000000000000
PALTEMP9 x0000000000000008
PALTEMP10 xFFFFFC00005081F0
PALTEMP11 x0000000000000000
PALTEMP12 xFFFFFC0000508590
PALTEMP13 xFFFFFC00005085C0
PALTEMP14 xFFFFFC0000508620
PALTEMP15 xFFFFFC0000508390
PALTEMP16 xFFFFFC0000508060
PALTEMP17 x0000000000000027
PALTEMP18 x000000011FFFFDA0
PALTEMP19 xFFFFFFFF85913A38
PALTEMP20 xFFFFFC0000698C90
PALTEMP21 x0000000000000000
PALTEMP22 x6068686C7C7C7C7C
PALTEMP23 x0000000000000000
PALTEMP24 x0000000000000000
PALTEMP25 x0000000000010000
PALTEMP26 x0000000000000000
PALTEMP27 x0000000000000000
PALTEMP28 x00000000036F8000
PALTEMP29 xFFFFFFFC00000000
PALTEMP30 x0000000000000001
PALTEMP31 x00000000026D7A38
EXC_ADDR x0000000020004336
                                        EXCEPTING OR EXECUTING INSTRUCTION DID NOT COMPLETE PC IS x480010CD
EXC_SUM x0000000000000000
EXC_MSK x0000000000000000
ICCSR x0000000000000000
                                        PC0 INT ENABLED AFTER 2**16 EVENTS
                                        PC1 INT ENABLED AFTER 2**12 EVENTS
                                        PC0 COUNTER INPUT TOTAL ISSUES DIVIDED
                                         _BY 2
                                        PC1 COUNTER INPUT DCACHE MISSES
                                        FP INSTRUCTIONS CAUSE FEN EXCEPTIONS
                                        ADDRESS SPACE NUMBER = x0
PAL_BASE x0000000000060000
                                        BASE ADDRESS FOR PALCODE = x18
HIER x00000000000018F0
                                        CORRECTABLE READ ERROR INTERRUPT
                                         _ENABLED
                                        CPU HARDWARE INTERRUPT ENABLED ON PIN
                                         _3
                                        CPU HARDWARE INTERRUPT ENABLED ON PIN
                                         _4
                                        CPU HARDWARE INTERRUPT ENABLED ON PIN
                                         _5
                                        PC1 INTERRUPT DISABLED
                                        PC0 INTERRUPT DISABLED
                                        CPU HARDWARE INTERRUPT ENABLED ON PIN
                                         _1
                                        CPU HARDWARE INTERRUPT ENABLED ON PIN
                                         _2
HIRR x0000000000000000
MM_CSR x0000000000003E01 D-STREAM REFERENCE ERROR CAUSE WAS A
                                         _WRITE
                                        INTEGER REGISTER USED IS R 0.
DC_STAT x0000000000000003
                                        DC_HIT LAST LOAD OR STORE MISSED
                                         _DCACHE
                                        OPCODE RA FIELD - INTEGER REGISTER IS R 0.
DC_ADDR x00000000FFFFFFFF SEO SECOND ERROR OCCURRED
ABOX_CTL x000000000000942E
                                        FUNCTIONS ENABLED - MCHECK ENABLED FOR
                                         _UNCORRECTABLE ERRORS
                                        FUNCTIONS ENABLED - CRD CORRECTED READ
                                         _DATA INTERRUPT ENABLED
                                        FUNCTIONS ENABLED - SINGLE ENTRY ICACHE
                                         _STREAM BUFFER ENABLED
                                        FUNCTIONS ENABLED - DCACHE ENABLED
BIU_STAT x0000000000002140
                                        BIU_CMD CYCLE CLASS IS READ_BLOCK
                                        FILL_ECC PRI. CACHE FILL FROM EXT.
                                         _CACHE HAD ECC ERROR
BIU_ADDR x0000000003593490
                                        PHYSICAL ADDRESS OF CACHE BLOCK WITH ERROR IS x1AC9A4
BIU_CTL x0000000040006557
                                        EXTERNAL CACHE ENABLED
                                        EXTERNAL CACHE ECC ENABLED
                                        EXTERNAL CACHE FORCE HIT FOR
                                         _READ_BLOCK AND WRITE_BLOCK
                                         _TRANSACTIONS
                                        EXTERNAL CACHE READ/WRITE SPEED IN CPU CYCLES IS
                                         _11
                                        EXTERNAL CACHE WRITE ENABLE TIMING BIT FIELD IS x1
FILL_SYNDROME x0000000000003500 SINGLE BIT ERROR IS NO ERRORS
FILL_ADDR x0000000003593494
                                        PHYSICAL ADDRESS OF QUADWORD WITH ERROR x1AC9A4
VA x00000000001011C8 D-STREAM FAULT OR DTB MISS - VIRTUAL ADDRESS IS x1011C8
BC_TAG x0000000000000C12
                                        PARITY FOR DS AND V BITS
                                        V BIT - CACHE BLOCK VALID
                                        TAG ADDRESS IS x60

----- KN15AA CPU SPECIFIC STACK FRAME -----

INT_EXC_IDENT x0000000000000088
                                        INTERRUPT OR EXCEPTION IS NONE
MCR_STAT x0000000011118080 BANK 0 32 MBYTES
                                        BANK 1 32 MBYTES
IOSLOT x0000000000100000
                                        TURBOCHANEL OPTION SLOT 1 PARITY
                                         _DISABLED
                                        TURBOCHANEL OPTION SLOT 2 PARITY
                                         _DISABLED
                                        TURBOCHANEL OPTION SLOT 4 PARITY
                                         _DISABLED
                                        TURBOCHANEL OPTION SLOT 5 PARITY
                                         _DISABLED
                                        TURBOCHANEL OPTION SLOT 6 PARITY
                                         _DISABLED
                                        TC OPTION SCSI ADAPTER PARITY DISABLED
                                        TC OPTION CORE I/O PARITY DISABLED
                                        TC OPTION CXTURBO PARITY DISABLED
TC_CONFIG x0000000000000016 MAGIC # FOR DMA CONTROL IS x16
                                        PAGE SIZE IS 8KBYTES
IR x000000000007FE00
                                        SECOND ERROR OCCURED
                                        DMA BUFFER ERROR - UNDER/OVER FLOW
                                        CROSSED 2K BOUNDARY ON DMA
                                        TC RESET IN PROGRESS
                                        TC PARITY ERROR
                                        TAG ERROR DURING DMA
                                        SINGLE BIT ERROR ON I/O WRITE OR DMA
                                         _READ
                                        DOUBLE BIT ERROR ON I/O WRITE OR DMA
                                         _READ
                                        TC TIMEOUT ON I/O REQUEST

********************************* ENTRY 2. *********************************

----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 302. PANIC
SEQUENCE NUMBER 2.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Fri Sep 11 02:02:47 1998
OCCURRED ON SYSTEM dilbert
SYSTEM ID x00060004 CPU TYPE: DEC 3000
SYSTYPE x00000000
MESSAGE panic (cpu 0): Machine check -
                                         _Hardware error

********************************* ENTRY 3. *********************************

----- EVENT INFORMATION -----

EVENT CLASS OPERATIONAL EVENT
OS EVENT TYPE 300. SYSTEM STARTUP
SEQUENCE NUMBER 0.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Fri Sep 11 02:06:42 1998
OCCURRED ON SYSTEM dilbert
SYSTEM ID x00060004 CPU TYPE: DEC 3000
SYSTYPE x00000000
MESSAGE Alpha boot: available memory from
                                         _0x9ba000 to 0x4000000
                                        Digital UNIX V4.0B (Rev. 564); Fri
                                         _Feb 20 10:27:50 PST 1998
                                        physical memory = 64.00 megabytes.
                                        available memory = 54.27 megabytes.
                                        using 238 buffers containing 1.85
                                         _megabytes of memory
                                        tc0 at nexus
                                        scc0 at tc0 slot 7
                                        tcds0 at tc0 slot 6
                                        scsi0 at tcds0 slot 0
                                        rz0 at scsi0 target 0 lun 0 (LID=0)
                                         _(SEAGATE ST15230N 0638)
                                        rz1 at scsi0 target 1 lun 0 (LID=1)
                                         _(SEAGATE ST34371N 0338)
                                        scsi1 at tcds0 slot 1
                                        tz8 at scsi1 target 0 lun 0 (LID=2)
                                         _(DEC DLT2000 830A)
                                        rz9 at scsi1 target 1 lun 0 (LID=3)
                                         _(SEAGATE ST410800N 0021)
                                        rz10 at scsi1 target 2 lun 0 (LID=4)
                                         _(SEAGATE ST19171N 0024)
                                        rz11 at scsi1 target 3 lun 0 (LID=5)
                                         _(SEAGATE ST39173W 5698)
                                         _(Wide16)
                                        rz12 at scsi1 target 4 lun 0 (LID=6)
                                         _(SEAGATE ST39173W 5698)
                                         _(Wide16)
                                        rz13 at scsi1 target 5 lun 0 (LID=7)
                                         _(SEAGATE ST39173W 5698)
                                         _(Wide16)
                                        tz14 at scsi1 target 6 lun 0 (LID=8)
                                         _(EXABYTE EXB-85058SQANXR1 0781)
                                        bba0 at tc0 slot 7
                                        ln0: DEC LANCE Module Name: PMAD-BA
                                        ln0 at tc0 slot 7
                                        ln0: DEC LANCE Ethernet Interface,
                                         _hardware address: 08-00-2B-9A-37-06
                                        fb0 at tc0 slot 0
                                         1280X1024
                                        DEC 3000 - M700 system
                                        Firmware revision: 7.0
                                        PALcode: OSF version 1.45
                                        lvm0: configured.
                                        lvm1: configured.
                                        dli: configured
                                        ATM Subsystem configured with 1
                                         _restart threads
                                        ATM UNI 3.x signalling: configured
                                        ATM IP interface: configured
Received on Sat Sep 12 1998 - 17:28:33 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:38 NZDT