Summary: the zombie walks, the sequel

From: Chua Koon Teck <koonteck_at_singnet.com.sg>
Date: Wed, 27 Mar 1996 14:05:04 +0800 (SST)

Hi

My summary for the above is to get patch OSF360-350113 for V3.2D.

My local Digital do not seems to have the patch.

Does anyone have the above patch ?


---------- Forwarded message ----------
Date: Tue, 26 Mar 96 12:14:35 -0500
From: Dr. Tom Blinn, 603-881-0646 <tpb_at_zk3.dec.com>
To: Chua Koon Teck <koonteck_at_singnet.com.sg>
Subject: Re: the zombie walks, the sequel

> I have a DEC7000 with 2 CPUs and running Digital UNIX 3.2D. Yesterday,
> it crashes and the output of the crash-data shows :
>
> _panic_string: 0xfffffc000050c330 = "the zombie walks, the sequel"
> _paniccpu: 1
> _panic_thread: 0xfffffc001b6f9b80
>
> 1 panic(s = 0xfffffc000053e778 = "event_timeout: panic request")
> ["../../../../
> src/kernel/bsd/subr_prf.c":673, 0xfffffc000040cdd8]
>
> Anyone got any idea what is this "the zombie walk, the sequel" error ??

I think it's a bug in the kernel. The error condition is detected in the
module kern_exit.c in src/kernel/bsd in a section of code that looks like
this:


        /*
         * We used to call thread_halt_self() here, but that left
         * a window prior to this point where our state could be yanked
         * before we were safely halted. We are setup to suspend, so
         * blocking here will allow this thread to be deallocated by
         * a waiting parent.
         */
        thread_block();
        panic("the zombie walks, the sequel");
        /*NOTREACHED*/

which is part of a routine called "exit" that is supposed to allow a user
process to exit. As the process is exiting, a bunch of cleanup is done and
the process is supposed to go into what's called the "zombie" state, where
it is never supposed to run again.

As you can see from the comments, the kernel is supposed to NEVER return
from the call the thread_block() and if it does, something is wrong so it
aborts the system at that point.

I have the name of the developer who MIGHT have worked on this, and I'll go
talk to him. In the meantime, you should file a problem report with youre
local Digital services organization, and be sure to save the vmcore and
vmunix files from the panic so that if we need to look at them they'll be
available.

Tom
 
 Dr. Thomas P. Blinn, UNIX Software Group, Digital Equipment Corporation
  110 Spit Brook Road, MS ZKO3-2/U20 Nashua, New Hampshire 03062-2698
   Technology Partnership Engineering Phone: (603) 881-0646
    Internet: tpb_at_zk3.dec.com Digital's Easynet: alpha::tpb

  Worry kills more people than work because more people worry than work.

     My favorite palindrome is: Satan, oscillate my metallic sonatas.
                                         -- Phil Agre, pagre_at_ucsd.edu

  Opinions expressed herein are my own, and do not necessarily represent
  those of my employer or anyone else, living or dead, real or imagined.
 
Received on Wed Mar 27 1996 - 07:23:36 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:46 NZDT