A 2100 with DU 3.2c crashes frequently (only several hours up time) with
two processors and runs longer, days or weeks with one processor. DEC has
tried replacing processors, memory, i/o module, cpu backplane and power
supplies to no avail.
Machine check info is printed on the console printer and the system just
hangs up that point and it is impossible to get a crash dump - a hard
reset is necessary to get the thing going again.
The application is innd (7xrz29 LSM news spool). There are a few netscape
clients (output to x-terms) running from time to time and DECnet is running
but not terribly active except when backing up / and /usr to VMS. I use
tkined for network monitoring so occasionally that is running too. The 3
PCI slots have an FDDI controller and 2 plain disk controllers. There are no
external terminators on the PCI disk controllers rear bulkhead connectors,
but they are strapped for active termination. The disks are internal. The
standard i/o module does have an external rear bulkhead terminator.
DEC has the console logs and has been asked several times whether they
think the problem is software and the answer is no.
Any ideas other than moving the applications to another machine?
Thanks.
John Nebel
Received on Fri Dec 06 1996 - 17:12:23 NZDT