Helpful suggestions received from:
Dr. Tom Blinn
Christian Klein
David (davegu1_at_...)
Johan Brusche
Roberto Mackun
Dr. Kieran Lynch
After a lot of in-depth analysis and debugging from Dr.Tom Blinn
and Christian Klein at HP, I managed to isolate the problem to a
device driver (dnb.mod) used for the HP DNBE1-BQ interface card.
Looks like the driver doesn't unload correctly from Tru64 5.1b-3
and causes the corruption within the kernel stack for adjacent
drivers in memory (in our case, AdvFS and the ace console driver).
Fortunatly the problem only seems to occur when the driver is being
unloaded, so the problem doesn't cause any ongoing stability issue.
thanks again for your help.
-----Original Message-----
From: tru64-unix-managers-owner_at_ornl.gov
[mailto:tru64-unix-managers-owner_at_ornl.gov]On Behalf Of Iain Barker
Sent: Monday, 06 June, 2005 14:46
To: tru64-unix-managers_at_ornl.gov
Subject: Kernel panic during shutdown/reboot with Tru64 5.1b-3 (PK5)
Hi Managers,
Has anyone experienced kernel panics during shutdown/reboot when using the latest Tru64 5.1b-3 (pk5) ?
It doesn't seem to be related to the actual PK5 installation operation and doesn't impact system stability, the panic seems to occurs later when the system is subsequently shut down (shutdown -h now) or during a scheduled reboot.
thanks.
Shutdown at 19:07 (in 0 minutes) [pid 8052]
System shutdown time has arrived
/proc: Invalid argument
Rebooting . . .
trap: invalid memory read access from kernel mode
faulting virtual address: 0x000000010000000c
pc of faulting instruction: 0xffffffff000ddb94
ra contents at time of fault: 0xffffffff000ddb94
sp contents at time of fault: 0xfffffe0450737690
panic (cpu 0): kernel memory fault
syncing disks... done
DUMP: blocks available: 12000000
DUMP: blocks wanted: 115682 (partial compressed dump) [OKAY]
DUMP: Device Disk Blocks Available
DUMP: ------ ---------------------
DUMP: 0x1300007 9004095 - 11999997 (of 11999998) [primary swap]
DUMP.prom: Open: dev 0x5100003, block 4004096: SCSI 0 8 0 0 0 0 0
DUMP: Writing header... [1024 bytes at dev 0x1300007, block 11999998]
DUMP: Writing data........ [8MB]
DUMP: Writing header... [1024 bytes at dev 0x1300007, block 11999998]
DUMP: crash dump complete.
halted CPU 0
halt code = 5
HALT instruction executed
PC = ffffffff0048be30
CPU 0 booting
resetting all I/O buses
Received on Thu Jun 09 2005 - 15:04:17 NZST