[Q] Advfs Panics (N1=5)

From: Clare West <clare_at_cs.auckland.ac.nz>
Date: Tue, 22 Aug 1995 16:19:31 +1200

Thank you for your help with my last question. Maybe you can all help me
out again now.

We have been having a lot of crashes with an ADVFS EXCEPTION panic N1 = 5.
It eventually turns out that is a file (or maybe files) on one of our advfs
filesets which cause these crashes when you try to "rm" them. We will be
doing some more experiments this evening (when the system is less loaded
and crashes will inconvenience less people). I have three questions:

1. how do we delete these files and stop the crashes?
2. how do we detect other files that may have similar problems?
3. how do we prevent these problems from occuring in the future?

any answers gratefully accepted.

some extracts from the Crash-Data for the last crash are included if they
are any use. I have looked in the archives, and tried the msfsck/vchkdir
solution but that was no use.

thanks in advance,

clare

#
# Crash Data Collection (Version 1.4)
#
_crash_data_collection_time: Tue Aug 22 15:26:40 NZST 1995
_current_directory: /
_crash_kernel: /var/adm/crash/vmunix.27
_crash_core: /var/adm/crash/vmcore.27
_crash_arch: alpha
_crash_os: DEC OSF/1
_host_version: DEC OSF/1 V3.2A (Rev. 17); Mon Aug 14 10:21:32 NZST 1995
_crash_version: DEC OSF/1 V3.2A (Rev. 17); Mon Aug 14 10:21:32 NZST 1995

thread 0xffffffff81a1af00 stopped at [boot:1620 ,0xfffffc00004df390]
Source not available
_crashtime: struct {
    tv_sec = 809061572
    tv_usec = 446292
}
_boottime: struct {
    tv_sec = 809057803
    tv_usec = 244000
}
_config: struct {
    sysname = "OSF1"
    nodename = "cs26.cs.auckland.ac.nz"
    release = "V3.2"
    version = "17"
    machine = "alpha"
}
_cpu: 35
_system_string: 0xffffffffff801048 = "AlphaServer 2100 4/275"
_ncpus: 2
_avail_cpus: 2
_partial_dump: 1
_physmem(MBytes): 511
_panic_string: 0xffffffffb619ade0 = "N1 = 5"
_preserved_message_buffer_begin:
struct {

[most of the message buffer deleted -- this was the standard boot up messages]

advfs I/O error: setId 0x2f4e9138.00055363.1.8001 tag 0x00000001.8001u
page 5392
        vd 1 blk 1338256 blkCnt 128
        read error = 5
ADVFS EXCEPTION
Module = 2, Line = 1346
N1 = 5
panic (cpu 0): N1 = 5
syncing disks... device string for dump = SCSI 0 1 0 4 400 0 0 .
device string for dump = SCSI 0 1 0 4 400 0 0 .
"
}
_preserved_message_buffer_end:

[more stuff deleted]

_dump_begin:
> 0 boot(0x0, 0x4, 0x200, 0x3, 0x0)
>["../../../../src/kernel/arch/alpha/machdep.c":1620, 0xfffffc00004df390]

   1 panic(s = 0xfffffc000060d5c8 = "event_timeout: panic request")
["../../../../src/kernel/bsd/subr_prf.c":668, 0xfffffc000043fa64]
pcpu = 0x1
i = -1259292352
bootopt = -1536403216
mycpu = 3852512
prevcc = 2147483647
nextcc = 4655468
timer = 1
limit = -4398039922672

   2 event_timeout(func = 0xfffffc00004eaec0, arg = 0xfffffc000060f5e8,
timeout = 5142088) ["../../../../src/kernel/arch/alpha/cpu.c":698,
0xfffffc00004d9938]
prevcc = 18446744072449752380
nextcc = 18446739675668197328
timer = 1
limit = 18446739675669961440

   3 pmap_update_send(0xfffffffdff7fdff8, 0x40000000000000,
0xfffffffdffed41d0, 0x1119, 0xfffffc0000675740)
["../../../../src/kernel/arch/alpha/pmap_update.c":201, 0xfffffc00004eb018]

   4 pmap_tbsync(pmap = 0xfffffc0000675740, va = 1, siz =
18446739675668182700) ["../../../../src/kernel/arch/alpha/pmap.c":3166,
0xfffffc00004e7b9c]

   5 pmap_mmu_unload(va = 18446739675669974600, sz = 24576, tbop = 4)
["../../../../src/kernel/arch/alpha/pmap.c":3021, 0xfffffc00004e76a8]
end = 33
pte = 0xfffffc000069ccb8
s = 1
other = (nil)

   6 bs_osf_complete(bp = 0xffffffffa4bd7e00)
["../../../../src/kernel/msfs/osf/msfs_io.c":644, 0xfffffc0000403538]
iop = 0xffffffff00000000
next = 0xffffffffb4f44b10
vdp = 0xffffffffa477c008
s = 4452032
sts = 0
pages = 3
len = 0
setp = 0xfffffc0000000007
taddr = 0xffffffffb5074000l3 address 0xffffffffb5074000 not mapped, pte 0x0

offset = 8

   7 msfs_async_iodone_lwc()
["../../../../src/kernel/msfs/osf/msfs_io.c":698, 0xfffffc00004036d4]
s = 1
bp = 0xfffffc000069cc68

   8 lwc_schedule(0xfffffc0000643e10, 0xfffffc000053ff70,
0xfffffc0000000001, 0x1, 0xfffffc0000471254)
["../../../../src/kernel/bsd/lwc.c":220, 0xfffffc000024bdec]

   9 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1710,
0xfffffc0000470ff0]
thread = 0xffffffff81a1af00
new_thread = 0x540068
mycpu = 0
myprocessor = (nil)
s = 1
pset = 0xfffffc000053ff70
prev = 0xfffffc00006bce90

  10 xpt_callback_thread() ["../../../../src/kernel/io/cam/xpt.c":2250,
0xfffffc0000540114]
xpt_ws = 0xfffffc00006d0d18
s = -2119902208
thread = 0xfffffc0000540118

_dump_end:

[rest of crash-data deleted]

--
Clare West, Rm 107, Ext 8266
clare_at_cs.auckland.ac.nz
Received on Tue Aug 22 1995 - 06:35:03 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:45 NZDT