Hi,
I restarted my system (Alpha LX164, TRU64 5.0) today at an uptime of more
than 40 days, after 2 hours it crashed with a kernel memory fault (I
attached the crash data at the end of this mail)
After the panic the system booted without any problems. However the system
crashed once more after a period of around 2 hours. So it did after that
one...
I'm not sure what's causing this, the hardware should be alright as the
system is brand new. The ADVFS domains and filesets should be OK according
to ADVFS verify.
Does sombody have any idea on what's causing this crash?
Thanks in advance for any help and sorry for the rather long attachment!
cheers,
Horst Reiterer
---------------------------------
#
# Crash Data Collection (Version 1.4)
#
_crash_data_collection_time: Wed Mar 29 20:26:34 CEST 2000
_current_directory: /
_crash_kernel: /var/adm/crash/vmunix.7
_crash_core: /var/adm/crash/vmzcore.7
_crash_arch: alpha
_crash_os: Digital UNIX
_host_version: V5.0 (Rev. 910)
Digital UNIX V5.0 (Rev. 910); Fri Feb 18 16:28:42 CET 2000
_crash_version: V5.0 (Rev. 910)
Digital UNIX V5.0 (Rev. 910); Fri Feb 18 16:28:42 CET 2000
thread 0xfffffc000ff2cc00 stopped at [boot:2492 ,0xfffffc00004b37e0]
Source not available
_crashtime: struct {
tv_sec = 954354262
tv_usec = 650016
}
_boottime: struct {
tv_sec = 954348043
tv_usec = 147376
}
_config: struct {
sysname = "OSF1"
nodename = "onyx.xxxxxxxxxxxx.xxx"
release = "V5.0"
version = "910"
machine = "alpha"
}
_cpu: 46
_system_string: 0xffffffffff8008b0 = "Digital AlphaPC 164LX 599 MHz"
_ncpus: 1
_avail_cpus: 1
_partial_dump: 1
_physmem(MBytes): 255
_panic_string: 0xfffffc00006838d0 = "kernel memory fault"
_paniccpu: 0
_panic_thread: 0xfffffc000ff2cc00
_preserved_message_buffer_begin:
struct {
hdr = struct {
msg_magic = 0x880524
msg_bufx = 0x669
msg_bufr = 0x45d
msg_size = 0x3fe0
}
msg_bufc = "Alpha boot: available memory from 0x101c000 to 0xffee000
Digital UNIX V5.0 (Rev. 910); Fri Feb 18 16:28:42 CET 2000
physical memory = 256.00 megabytes.
available memory = 239.82 megabytes.
using 975 buffers containing 7.61 megabytes of memory
Firmware revision: 5.6-2
PALcode: UNIX version 1.23-2
Digital AlphaPC 164LX 599 MHz
pci0 (primary bus:0) at nexus
vga0 at pci0 slot 5
640x480 VGA, 16 colors
vga0: generic VGA driver
psiop_pci_initialize: Warning - Using unsupported 53c875 scsi chip
Loading SIOP: script c0000000, reg 82061000, data 4104c000
scsi1 at psiop0 slot 0
tu1: DECchip 21041: Revision: 1.1
tu1 at pci0 slot 7
tu1: DEC TULIP (10Mbps) Ethernet Interface, hardware address:
00-00-F8-01-32-4F
tu1: console mode: selecting 10BaseT (UTP) port: half duplex
isa0 at pci0
gpc0 at isa0
ace0 at isa0
ace1 at isa0
lp0 at isa0
fdi0 at isa0
fd0 at fdi0 unit 0
ata0 at pci0 slot 11
ata0: CMD PCI0646
scsi0 at ata0 slot 0
scsi2 at ata0 slot 1
kernel console: vga0
dli: configured
NetRAIN configured.
ADVFS: using 2321 buffers containing 18.13 megabytes of memory
vm_swap_init: swap is set to eager allocation mode
trap: invalid memory read access from kernel mode
faulting virtual address: 0x0000000000000018
pc of faulting instruction: 0xfffffc00005cf43c
ra contents at time of fault: 0xfffffc00005a2b5c
sp contents at time of fault: 0xfffffe0413e879c0
panic (cpu 0): kernel memory fault
syncing disks... 37 9 47 47 device string for dump = SCSI 0 6 0 5 500 0 0.
DUMP.prom: dev SCSI 0 6 0 5 500 0 0, block 4194304
device string for dump = SCSI 0 6 0 5 500 0 0.
DUMP.prom: dev SCSI 0 6 0 5 500 0 0, block 4194304
"
}
_preserved_message_buffer_end:
_kernel_process_status_begin:
PID COMM
00000 kernel idle
00001 init
...
_kernel_process_status_end:
_current_pid: 0
_current_tid: 0xfffffc000ff2cc00
warning: Files compiled -g3: parameter values probably wrong
_kernel_thread_list_begin:
...
_kernel_thread_list_end:
_savedefp: (nil)
_kernel_memory_fault_data_begin:
struct {
fault_va = 0x18
fault_pc = 0xfffffc00005cf43c
fault_ra = 0xfffffc00005a2b5c
fault_sp = 0xfffffe0413e879c0
access = 0x0
status = 0x4
cpunum = 0x0
count = 0x1
pcb = 0xfffffe0413e87a00
thread = 0xfffffc000ff2cc00
task = 0xfffffc000fefc000
proc = 0xfffffc000fefc200
}
_kernel_memory_fault_data_end:
_uptime: 1.72 hours
thread 0xfffffc000ff2cc00 stopped at [boot:2492 ,0xfffffc00004b37e0]
Source not available
paniccpu: 0x0
machine_slot[paniccpu]: struct {
...
cpu_panicstr = 0xfffffc00006838d0 = "kernel memory fault"
cpu_panic_thread = 0xfffffc000ff2cc00
}
tset machine_slot[paniccpu].cpu_panic_thread:
Begin Trace for machine_slot[paniccpu].cpu_panic_thread:
thread 0xfffffc000ff2cc00 stopped at [boot:2492 ,0xfffffc00004b37e0]
Source not available
> 0 boot(0x0, 0x1, 0x4, 0xfffffc000026d190, 0xfffffc00006257e0)
["../../../../src/kernel/arch/alpha/machdep.c":2492, 0xfffffc00004b37e0]
1 panic(s = 0xfffffc0000641778 = "panic stuck syncing disks")
["../../../../src/kernel/bsd/subr_prf.c":1225, 0xfffffc000028e230]
2 hardclock(pc = 0xfffffc0000458d28 = "\320Wc ", ps = (unallocated -
symbol optimized away)) ["../../../../src/kernel/bsd/kern_clock.c":1213,
0xfffffc000025d700]
3 _XentInt(0x2, 0xfffffc0000458d28, 0xfffffc00006257e0,
0xfffffc0000677c68, 0xfffffc00006cafb8)
["../../../../src/kernel/arch/alpha/locore.s":1664, 0xfffffc00004af7f4]
4 getnewvnode(tag = (unallocated - symbol optimized away), vops =
0xfffffc0000676348, vpp = 0xfffffe0413e874e0)
["../../../../src/kernel/vfs/vfs_subr.c":1817, 0xfffffc0000458d24]
5 vdealloc() ["../../../../src/kernel/vfs/vfs_subr.c":1528,
0xfffffc000045879c]
6 vrele(vp = 0x28a7) ["../../../../src/kernel/vfs/vfs_subr.c":2490,
0xfffffc00004598c8]
7 mntbusybuf(mountp = 0xfffffc000f440f00)
["../../../../src/kernel/vfs/vfs_bio.c":1614, 0xfffffc0000451150]
8 boot(0x1, 0xfffffc000ff2cc00, 0x0, 0x2600000026, 0x80000005d)
["../../../../src/kernel/arch/alpha/machdep.c":2430, 0xfffffc00004b3674]
9 panic(s = 0xfffffc00006838d0 = "kernel memory fault")
["../../../../src/kernel/bsd/subr_prf.c":1310, 0xfffffc000028e434]
10 trap(a0 = (...), a1 = (...), a2 = (...), code = 0xfffffc00006c8298,
exc_frame = 0xfffffe0413e878b8)
["../../../../src/kernel/arch/alpha/trap.c":2057, 0xfffffc00004bbac4]
11 _XentMM(0x4, 0xfffffc00005cf43c, 0xfffffc00006257e0, 0x6, 0x0)
["../../../../src/kernel/arch/alpha/locore.s":2187, 0xfffffc00004afbd4]
12 ws_enter_hot_swap_event(0xfffffc00006257e0, 0x6, 0x0, 0x1,
0xfffffc00005a2b5c) ["../../../../src/kernel/io/dec/ws/ws_driver.c":4352,
0xfffffc00005cf438]
13 pcxa_thread(0x0, 0xfffffc00006c4940, 0x0, 0x15, 0xfffffc000ff2cc00)
["../../../../src/kernel/io/dec/eisa/gpc.c":1138, 0xfffffc00005a2b58]
End Trace for machine_slot[paniccpu].cpu_panic_thread:
_crash_data_collection_finished:
---------------------------------
Received on Wed Mar 29 2000 - 19:03:04 NZST