SUMMARY: ASE 1.4 + DU 4.0b hangs at times

From: Gunther Feuereisen <gunther_at_ibm.net>
Date: Sat, 14 Jun 1997 23:40:06 +1000

Thanks to:

Dave Cherkus <cherkus_at_homerun.unimaster.com>
"Jenkins, Gary C." <gcjenkins_at_escocorp.com>
"Eric Z. Ayers" <eric_at_compgen.com>

I have also logged a call with Digital - and they are looking at it. When I
get an "official" response, I'll post it. Gary mentioned there are patches
available for DU4.0b and ASE 1.4 - I'm also looking at getting more info
about these.

gunther
--
My original question:
>Hi,
>
>I'm currently having a problem with DU 4.0b and ASE 1.4:
>
>My config:
>
>2 x 4000's
>2 x KZPSA's designated pza0 and pza1
>On each bus, a BA356 with disks.
>I mirror each disk across the shared bus using LSM. (No HSZ40's)
>I have a private network on dka450's configured at tu0 - BNC
>I have my lan cards (dka500's) configured at tu1 - UTP
>I have tu0 to be 192.0.0.1 and 192.0.0.2 for each machine (private)
>I have tu1 to be 172.19.200.41 and 172.19.200.42 for each machine (LAN)
>I have a static route in /etc/routes = default 172.19.200.1 which is my
>gateway
>I can set up everything, my services, disks, floating IP and everything
>works fine.
>I can failover, crash a machine etc. and DECSafe works fine.
>
>The problem:
>-----------
>When I tried to modify a service, by or creating a start or stop script,
>which does something as simple as:
>
>date > /tmp/TEST.is.running
>
>ase stops the service, deletes it, adds it, and when it tries to start it
>it hangs the machine.
>
>Also, it seems to clobber my primary network interface that i configured with
>ase so that the machines can no longer talk to each other via network.
>
>The only way out is to reboot both machines and then everything starts
>normally
>as it was before I added the start script.
>
>I seem to be getting errors about gated. I don't ever remember seeing that
>I needed
>gated - I've never used it before. If I fire up gated manually, it also
>kills the private network interfaces. So that seems to be the culprit.
>
>I've run ASE 1.3 on DU 3.2C, D-2 and G with no problems whatsoever, I've
>run Production
>Server with multiple ASE's on 4.0b and 1.4 (on 4100's) and I didn't have
>any problems either.
>
>Any help or advice appreciated - all I can think is that I've done
>something silly along
>the way and not realised it.
>
>many thanks,
>gunther
>--
>Excerpts from the logs are:
>
>daemon.log: from the time I said yes to change the service
>--
>Jun  5 10:10:38 pofrea ASE: pofrea Agent Notice: stopping service popfre
>Jun  5 10:10:41 pofrea ASE: pofrea Director Notice: stopped popfre on pofrea
>Jun  5 10:10:44 pofrea ASE: pofrea Agent Notice: deleting service popfre
>Jun  5 10:10:44 pofrea ASE: pofreb Agent Notice: deleting service popfre
>Jun  5 10:10:55 pofrea gated[2010]: Start gated[2010] version R3_5Alpha_11
>built Fri Nov 15 21:45:16 EST 1996
>Jun  5 10:10:55 pofrea ASE: pofrea Director Notice: deleted service
>Jun  5 10:10:55 pofrea gated[2010]: trace_on: tracing to
>"/usr/tmp/gated.log" started
>Jun  5 10:10:55 pofrea gated[2010]: inet_init: *WARNING* IP forwarding
>disabled!
>Jun  5 10:10:56 pofrea gated[2010]: KRT READ REMNANT 0.0.0.0         mask
>255.255.255     router (null)
> flags <UP>1: ignoring
>Jun  5 10:10:56 pofrea gated[2010]: KRT READ REMNANT 128.169         mask
>255.255         router 172.19.200.2
> flags <UP GW DYN>13: queuing delete for redirect
>Jun  5 10:10:56 pofrea gated[2010]: rt_add: MARTIAN will not be propagated
>192/255.255.255 gw 192.0.0.1 Direct
>Jun  5 10:10:56 pofrea gated[2010]: Commence routing updates
>Jun  5 10:10:56 pofrea gated[2010]: if_rtup: UP route for interface tu0
>192.0.0.1/255.255.255
>Jun  5 10:10:56 pofrea gated[2010]: rt_add: MARTIAN will not be propagated
>192/255.255.255 gw 192.0.0.1 Direct
>Jun  5 10:11:06 pofrea ASE: local HSM Warning: Can't ping pofreb over the
>network
>Jun  5 10:11:06 pofrea ASE: local HSM ***ALERT: HSM_PATH_STATUS:192.0.0.2:DOWN
>Jun  5 10:11:06 pofrea ASE: local HSM Warning: member pofreb is DOWN
>
>gated.log
>--
>Jun  5 10:10:55 trace_on: Tracing to "/usr/tmp/gated.log" started
>Jun  5 10:10:55
>Jun  5 10:10:55 Tracing flags enabled: general
>Jun  5 10:10:55
>Jun  5 10:10:55 inet_init: *WARNING* IP forwarding disabled!
>Jun  5 10:10:55 inet_routerid_notify: Router ID: 172.19.200.41
>Jun  5 10:10:55
>Jun  5 10:10:55
>Jun  5 10:10:55 krt_rtread: Initial routes read from kernel (radix tree via
>kmem):
>ADD      0.0.0.0          0.0.0.0         gw 172.19.200.1    Kernel   pref
>254/0 metric 0/0 tu1 <NoAdvise Int Active Gateway>
>KRT READ REMNANT 0.0.0.0         mask 255.255.255     router (null)
> flags <UP>1: ignoring
>ADD      127.0.0.1        255.255.255.255 gw 127.0.0.1       Kernel   pref
>-254/0 metric 0/0 lo0 <NotInstall NoAdvise Int Hidden Gateway>
>ADD      127.0.0.1        255.255.255.255  Kernel   pref 0/0 metric 0/0
><NoAdvise Active Gateway>
>CHANGE   127.0.0.1        255.255.255.255  Kernel   pref 0/0 metric 0/0
><NoAdvise Delete Gateway>
>RELEASE  127.0.0.1        255.255.255.255 gw 127.0.0.1       Kernel   pref
>-254/0 metric 0/0 lo0 <NotInstall NoAdvise Int Release Gateway>
>KRT READ REMNANT 128.169         mask 255.255         router 172.19.200.2
> flags <UP GW DYN>13: queuing delete for redirect
>ADD      172.19.200       255.255.255     gw 172.19.200.41   Kernel   pref
>-254/0 metric 0/0 tu1 <NotInstall NoAdvise Int Hidden Gateway>
>ADD      172.19.200       255.255.255      Kernel   pref 0/0 metric 0/0
><NoAdvise Active Gateway>
>CHANGE   172.19.200       255.255.255      Kernel   pref 0/0 metric 0/0
><NoAdvise Delete Gateway>
>RELEASE  172.19.200       255.255.255     gw 172.19.200.41   Kernel   pref
>-254/0 metric 0/0 tu1 <NotInstall NoAdvise Int Release Gateway>
>ADD      192              255.255.255     gw 192.0.0.1       Kernel   pref
>-254/0 metric 0/0 tu0 <NotInstall NoAdvise Int Hidden Gateway>
>ADD      192              255.255.255      Kernel   pref 0/0 metric 0/0
><NoAdvise Active Gateway>
>CHANGE   192              255.255.255      Kernel   pref 0/0 metric 0/0
><NoAdvise Delete Gateway>
>RELEASE  192              255.255.255     gw 192.0.0.1       Kernel   pref
>-254/0 metric 0/0 tu0 <NotInstall NoAdvise Int Release Gateway>
>Jun  5 10:10:55 rt_close: 13 routes proto KRT
>Jun  5 10:10:55
>Jun  5 10:10:55 if_ifachange:	192.0.0.1
>Jun  5 10:10:55 if_ifachange:		index: 1  name: tu0  state: <Up
>Broadcast
>Multicast Simplex NoAge>
>Jun  5 10:10:55 if_ifachange:		change: <>  metric: 0  route: not
>installed
>Jun  5 10:10:55 if_ifachange:		preference: 0  down: 120  refcount:
>2  mtu:
>1436
>Jun  5 10:10:55 if_ifachange:		broadaddr: 192.0.0.255
>Jun  5 10:10:55 if_ifachange:		subnet: 192  subnetmask: 255.255.255
>Jun  5 10:10:55
>Jun  5 10:10:55 if_rtup: ADD route for interface tu0 192.0.0.1/255.255.255
>ADD      192              255.255.255     gw 192.0.0.1       Direct   pref
>0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Active Retain>
>Jun  5 10:10:55 rt_add: MARTIAN will not be propagated 192/255.255.255 gw
>192.0.0.1 Direct
>Jun  5 10:10:55 rt_close: 1 route proto IF
>Jun  5 10:10:55
>Jun  5 10:10:55 if_ifachange:	172.19.200.41
>Jun  5 10:10:55 if_ifachange:		index: 2  name: tu1  state: <Up
>Broadcast
>Multicast Simplex NoAge>
>Jun  5 10:10:55 if_ifachange:		change: <>  metric: 0  route: not
>installed
>Jun  5 10:10:55 if_ifachange:		preference: 0  down: 120  refcount:
>3  mtu:
>1436
>Jun  5 10:10:55 if_ifachange:		broadaddr: 172.19.200.255
>Jun  5 10:10:55 if_ifachange:		subnet: 172.19.200  subnetmask:
>255.255.255
>Jun  5 10:10:55
>Jun  5 10:10:55 if_rtup: ADD route for interface tu1 172.19.200.41/255.255.255
>ADD      172.19.200       255.255.255     gw 172.19.200.41   Direct   pref
>0/0 metric 0/0 tu1 <Int Active Retain>
>Jun  5 10:10:55 rt_close: 1 route proto IF
>Jun  5 10:10:55
>Jun  5 10:10:55 if_ifachange:	127.0.0.1
>Jun  5 10:10:55 if_ifachange:		index: 4  name: lo0  state: <Up
>Loopback
>Multicast Simplex NoAge>
>Jun  5 10:10:55 if_ifachange:		change: <>  metric: 0  route: not
>installed
>Jun  5 10:10:55 if_ifachange:		preference: 0  down: 120  refcount:
>2  mtu:
>1472
>Jun  5 10:10:55 if_ifachange:		subnetmask: 255.255.255.255
>Jun  5 10:10:55
>Jun  5 10:10:55 if_rtup: ADD route for interface lo0 127.0.0.1/255.255.255.255
>ADD      127.0.0.1        255.255.255.255 gw 127.0.0.1       Direct   pref
>0/0 metric 0/0 lo0 <NoAdvise Active Retain>
>Jun  5 10:10:55 rt_close: 1 route proto IF
>Jun  5 10:10:55
>Jun  5 10:10:55
>Jun  5 10:10:55 ***Routes are being installed in kernel
>Jun  5 10:10:55
>Jun  5 10:10:55
>Jun  5 10:10:55 Commence routing updates
>Jun  5 10:10:55
>Jun  5 10:10:56 inet_routerid_notify: Router ID: 172.19.200.41
>Jun  5 10:10:56
>Jun  5 10:10:56 if_ifachange:	192.0.0.1
>Jun  5 10:10:56 if_ifachange:		index: 1  name: tu0  state: <Up
>Broadcast
>Multicast Simplex NoAge>
>Jun  5 10:10:56 if_ifachange:		change: <>  metric: 0  route: installed
>Jun  5 10:10:56 if_ifachange:		preference: 0  down: 120  refcount:
>3  mtu:
>1436
>Jun  5 10:10:56 if_ifachange:		broadaddr: 192.0.0.255
>Jun  5 10:10:56 if_ifachange:		subnet: 192  subnetmask: 255.255.255
>Jun  5 10:10:56
>Jun  5 10:10:56 if_rtup: UP route for interface tu0 192.0.0.1/255.255.255
>ADD      192              255.255.255     gw 192.0.0.1       Direct   pref
>0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Retain>
>Jun  5 10:10:56 rt_add: MARTIAN will not be propagated 192/255.255.255 gw
>192.0.0.1 Direct
>CHANGE   192              255.255.255     gw 192.0.0.1       Direct   pref
>0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Active Retain>
>RELEASE  192              255.255.255     gw 192.0.0.1       Direct   pref
>0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Release Retain>
>Jun  5 10:10:56 rt_close: 2 routes proto IF
>Jun  5 10:10:56
>ADD      127              255             gw 127.0.0.1       Static   pref
>0/0 metric 0/0 lo0 <NoAdvise Int Active Reject>
>Jun  5 10:10:56 rt_close: 1 route proto RT
>Jun  5 10:10:56
>Jun  5 10:10:56 if_ifachange:	172.19.200.41
>Jun  5 10:10:56 if_ifachange:		index: 2  name: tu1  state: <Up
>Broadcast
>Multicast Simplex NoAge>
>Jun  5 10:10:56 if_ifachange:		change: <>  metric: 0  route: installed
>Jun  5 10:10:56 if_ifachange:		preference: 0  down: 120  refcount:
>4  mtu:
>1436
>Jun  5 10:10:56 if_ifachange:		broadaddr: 172.19.200.255
>Jun  5 10:10:56 if_ifachange:		subnet: 172.19.200  subnetmask:
>255.255.255
>Jun  5 10:10:56
>Jun  5 10:10:56 if_ifachange:	127.0.0.1
>Jun  5 10:10:56 if_ifachange:		index: 4  name: lo0  state: <Up
>Loopback
>Multicast Simplex NoAge>
>Jun  5 10:10:56 if_ifachange:		change: <>  metric: 0  route: installed
>Jun  5 10:10:56 if_ifachange:		preference: 0  down: 120  refcount:
>4  mtu:
>1472
>Jun  5 10:10:56 if_ifachange:		subnetmask: 255.255.255.255
>Jun  5 10:10:56
>Jun  5 10:10:56
>Jun  5 10:10:56 rt_new_policy: new policy started with 5 entries
>RELEASE  127.0.0.1        255.255.255.255  Kernel   pref 0/0 metric 0/0
><NoAdvise Delete Release Gateway>
>RELEASE  172.19.200       255.255.255      Kernel   pref 0/0 metric 0/0
><NoAdvise Delete Release Gateway>
>RELEASE  192              255.255.255      Kernel   pref 0/0 metric 0/0
><NoAdvise Delete Release Gateway>
>Jun  5 10:10:56 rt_new_policy: new policy ended with 5 entries
>Jun  5 10:10:56
>Jun  5 10:10:56 stdio: bootname /vmunix
>ADD      203.102.144      255.255.255     gw 172.19.200.2    RIP      pref
>100/0 metric 2/0 tu1 <Int Active Gateway>
>ADD      203.12.186       255.255.255     gw 172.19.200.2    RIP      pref
>100/0 metric 2/0 tu1 <Int Active Gateway>
>Jun  5 10:10:56 rt_close: 2/3 routes proto RIP.0.0.0.0+520 from 172.19.200.2
>Jun  5 10:10:56
>Jun  5 10:10:56
>Jun  5 10:10:56 rt_flash_update: updating kernel with 2 entries
>Jun  5 10:10:56
>Jun  5 10:10:56 rt_flash_update: flash update started with 2 entries
>Jun  5 10:10:56 rt_flash_update: flash update ended with 2 entries
>Jun  5 10:10:56
>CHANGE   0.0.0.0          0.0.0.0         gw 172.19.200.1    Kernel   pref
>254/0 metric 0/0 tu1 <NoAdvise Int Delete Gateway>
>Jun  5 10:13:55 rt_close: 1 route proto KRT
>Jun  5 10:13:55
>Jun  5 10:13:55
>Jun  5 10:13:55 rt_flash_update: updating kernel with 1 entries
>Jun  5 10:13:55
>Jun  5 10:13:55 rt_flash_update: flash update started with 1 entries
>Jun  5 10:13:55 rt_flash_update: flash update ended with 1 entries
>Jun  5 10:13:55
>RELEASE  0.0.0.0          0.0.0.0         gw 172.19.200.1    Kernel   pref
>254/0 metric 0/0 tu1 <NoAdvise Int Delete Release Gateway>
Eric wrote:
>I have had problems with gated on ASE on several installations with
>multiple network cards installed.  I have had no problems on an
>installation with only 1 network card.
>
>I think the problem is that you are running 2 network cards w/o using
>gated or routed.  Somehow, this gets ASE confused.  It decides to start
>gated, even when you don't have gated configured!  It is normal behavior
>for gated to delete routes to an interface if it doesn't hear any
>routing messages over the interface (unless you tell it otherwise)
>
>There are 2 things you can do.
>
>1) Configure gated with a minimal configuration script.
>
># begin /etc/gated.conf
>interfaces {
>   # keep gated from deleting routes on tu interfaces w/ no routing
>information
>   # passing over them.
>   interface tu passive ;
>}
>
>rip no ;
># end /etc/gated.conf
>
>
>2) Disable all gated stuff in ase.
>                         **********************************
>      If you have ASE configured with a primary and backup network, do not
>      disable gated!
>                         **********************************
>I did this by modifing
>/var/ase/sbin/ase_route_gated
>
>            # *** EDITED BY CGI ***
>    'start')
>            # Disable gated startup
>            ;;
>    'old_start')
>            # *** END CGI EDITS ***
>
>
>            # *** EDITED BY CGI ***
>    'stop')
>            # Disable gated startup
>            ;;
>    'old_stop')
>            # *** END CGI EDITS ***
>
Received on Sun Jun 15 1997 - 07:45:51 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:36 NZDT