I would like some help interpreting the attached DECevent entry. This
event occurred while booting the server, causing the system to panic and
return to the SRM prompt. The very next boot succeeded without error.
The configuration layout is:
Two AS4100s in a TruCluster ASE 1.5 configuration, running DU4.0D, Patch
Kit #3.
They each have 3 KZPSA scsi controllers connected for shared busses.
One bus is to a Raid Array 7000. Another is to a BA356 shelf containing
a TZ89 tape drive. The third set of controllers currently have no scsi
devices connected to them, they are just terminated at the servers. All
KZPSAs have firmware revs of R01,A11. Also, they are set to scsi id 6
in one server and id 7 in the other.
Each server has a KZPSC raid controller managing its mirrored system
disks.
At the time this server crashed, the other cluster member was powered
on, sitting at the SRM prompt.
The nearest I can tell, the panic occurred on IOD #1. With all 4
controllers signaling an error? Could this be bad hardware? Which
piece(s)? Any input will be appreciated.
Also, the dia man page references a "DECevent Translation and Reporting
Utility for OSF User and Reference Guide". Anyone know where this
manual can be obtained? I checked the DU Documentation CD but did not
find it there. I'd really like to better prepare myself for
interpreting the DECevent messages.
Thanks,
Kevin Erickson
kevine_at_ncrinc.com
Received on Wed Jan 27 1999 - 15:18:05 NZDT