Q: ASE 1.4 + DU 4.0b hangs at times

From: Gunther Feuereisen <gunther_at_ibm.net>
Date: Thu, 05 Jun 1997 14:52:01 -1000

Hi,

I'm currently having a problem with DU 4.0b and ASE 1.4:

My config:

2 x 4000's
2 x KZPSA's designated pza0 and pza1
On each bus, a BA356 with disks.
I mirror each disk across the shared bus using LSM. (No HSZ40's)
I have a private network on dka450's configured at tu0 - BNC
I have my lan cards (dka500's) configured at tu1 - UTP
I have tu0 to be 192.0.0.1 and 192.0.0.2 for each machine (private)
I have tu1 to be 172.19.200.41 and 172.19.200.42 for each machine (LAN)
I have a static route in /etc/routes = default 172.19.200.1 which is my
gateway
I can set up everything, my services, disks, floating IP and everything
works fine.
I can failover, crash a machine etc. and DECSafe works fine.

The problem:
-----------
When I tried to modify a service, by or creating a start or stop script,
which does something as simple as:

date > /tmp/TEST.is.running

ase stops the service, deletes it, adds it, and when it tries to start it
it hangs the machine.

Also, it seems to clobber my primary network interface that i configured with
ase so that the machines can no longer talk to each other via network.

The only way out is to reboot both machines and then everything starts
normally
as it was before I added the start script.

I seem to be getting errors about gated. I don't ever remember seeing that
I needed
gated - I've never used it before. If I fire up gated manually, it also
kills the private network interfaces. So that seems to be the culprit.

I've run ASE 1.3 on DU 3.2C, D-2 and G with no problems whatsoever, I've
run Production
Server with multiple ASE's on 4.0b and 1.4 (on 4100's) and I didn't have
any problems either.

Any help or advice appreciated - all I can think is that I've done
something silly along
the way and not realised it.

many thanks,
gunther
--
Excerpts from the logs are:
daemon.log: from the time I said yes to change the service
--
Jun  5 10:10:38 pofrea ASE: pofrea Agent Notice: stopping service popfre
Jun  5 10:10:41 pofrea ASE: pofrea Director Notice: stopped popfre on pofrea
Jun  5 10:10:44 pofrea ASE: pofrea Agent Notice: deleting service popfre
Jun  5 10:10:44 pofrea ASE: pofreb Agent Notice: deleting service popfre
Jun  5 10:10:55 pofrea gated[2010]: Start gated[2010] version R3_5Alpha_11
built Fri Nov 15 21:45:16 EST 1996
Jun  5 10:10:55 pofrea ASE: pofrea Director Notice: deleted service
Jun  5 10:10:55 pofrea gated[2010]: trace_on: tracing to
"/usr/tmp/gated.log" started
Jun  5 10:10:55 pofrea gated[2010]: inet_init: *WARNING* IP forwarding
disabled!
Jun  5 10:10:56 pofrea gated[2010]: KRT READ REMNANT 0.0.0.0         mask
255.255.255     router (null)
 flags <UP>1: ignoring
Jun  5 10:10:56 pofrea gated[2010]: KRT READ REMNANT 128.169         mask
255.255         router 172.19.200.2
 flags <UP GW DYN>13: queuing delete for redirect
Jun  5 10:10:56 pofrea gated[2010]: rt_add: MARTIAN will not be propagated
192/255.255.255 gw 192.0.0.1 Direct
Jun  5 10:10:56 pofrea gated[2010]: Commence routing updates
Jun  5 10:10:56 pofrea gated[2010]: if_rtup: UP route for interface tu0
192.0.0.1/255.255.255
Jun  5 10:10:56 pofrea gated[2010]: rt_add: MARTIAN will not be propagated
192/255.255.255 gw 192.0.0.1 Direct
Jun  5 10:11:06 pofrea ASE: local HSM Warning: Can't ping pofreb over the
network
Jun  5 10:11:06 pofrea ASE: local HSM ***ALERT: HSM_PATH_STATUS:192.0.0.2:DOWN
Jun  5 10:11:06 pofrea ASE: local HSM Warning: member pofreb is DOWN
gated.log
--
Jun  5 10:10:55 trace_on: Tracing to "/usr/tmp/gated.log" started
Jun  5 10:10:55 
Jun  5 10:10:55 Tracing flags enabled: general
Jun  5 10:10:55 
Jun  5 10:10:55 inet_init: *WARNING* IP forwarding disabled!
Jun  5 10:10:55 inet_routerid_notify: Router ID: 172.19.200.41
Jun  5 10:10:55 
Jun  5 10:10:55 
Jun  5 10:10:55 krt_rtread: Initial routes read from kernel (radix tree via
kmem):
ADD      0.0.0.0          0.0.0.0         gw 172.19.200.1    Kernel   pref
254/0 metric 0/0 tu1 <NoAdvise Int Active Gateway>
KRT READ REMNANT 0.0.0.0         mask 255.255.255     router (null)
 flags <UP>1: ignoring
ADD      127.0.0.1        255.255.255.255 gw 127.0.0.1       Kernel   pref
-254/0 metric 0/0 lo0 <NotInstall NoAdvise Int Hidden Gateway>
ADD      127.0.0.1        255.255.255.255  Kernel   pref 0/0 metric 0/0
<NoAdvise Active Gateway>
CHANGE   127.0.0.1        255.255.255.255  Kernel   pref 0/0 metric 0/0
<NoAdvise Delete Gateway>
RELEASE  127.0.0.1        255.255.255.255 gw 127.0.0.1       Kernel   pref
-254/0 metric 0/0 lo0 <NotInstall NoAdvise Int Release Gateway>
KRT READ REMNANT 128.169         mask 255.255         router 172.19.200.2
 flags <UP GW DYN>13: queuing delete for redirect
ADD      172.19.200       255.255.255     gw 172.19.200.41   Kernel   pref
-254/0 metric 0/0 tu1 <NotInstall NoAdvise Int Hidden Gateway>
ADD      172.19.200       255.255.255      Kernel   pref 0/0 metric 0/0
<NoAdvise Active Gateway>
CHANGE   172.19.200       255.255.255      Kernel   pref 0/0 metric 0/0
<NoAdvise Delete Gateway>
RELEASE  172.19.200       255.255.255     gw 172.19.200.41   Kernel   pref
-254/0 metric 0/0 tu1 <NotInstall NoAdvise Int Release Gateway>
ADD      192              255.255.255     gw 192.0.0.1       Kernel   pref
-254/0 metric 0/0 tu0 <NotInstall NoAdvise Int Hidden Gateway>
ADD      192              255.255.255      Kernel   pref 0/0 metric 0/0
<NoAdvise Active Gateway>
CHANGE   192              255.255.255      Kernel   pref 0/0 metric 0/0
<NoAdvise Delete Gateway>
RELEASE  192              255.255.255     gw 192.0.0.1       Kernel   pref
-254/0 metric 0/0 tu0 <NotInstall NoAdvise Int Release Gateway>
Jun  5 10:10:55 rt_close: 13 routes proto KRT
Jun  5 10:10:55 
Jun  5 10:10:55 if_ifachange:	192.0.0.1
Jun  5 10:10:55 if_ifachange:		index: 1  name: tu0  state: <Up Broadcast
Multicast Simplex NoAge>
Jun  5 10:10:55 if_ifachange:		change: <>  metric: 0  route: not installed
Jun  5 10:10:55 if_ifachange:		preference: 0  down: 120  refcount: 2  mtu:
1436
Jun  5 10:10:55 if_ifachange:		broadaddr: 192.0.0.255
Jun  5 10:10:55 if_ifachange:		subnet: 192  subnetmask: 255.255.255
Jun  5 10:10:55 
Jun  5 10:10:55 if_rtup: ADD route for interface tu0 192.0.0.1/255.255.255
ADD      192              255.255.255     gw 192.0.0.1       Direct   pref
0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Active Retain>
Jun  5 10:10:55 rt_add: MARTIAN will not be propagated 192/255.255.255 gw
192.0.0.1 Direct
Jun  5 10:10:55 rt_close: 1 route proto IF
Jun  5 10:10:55 
Jun  5 10:10:55 if_ifachange:	172.19.200.41
Jun  5 10:10:55 if_ifachange:		index: 2  name: tu1  state: <Up Broadcast
Multicast Simplex NoAge>
Jun  5 10:10:55 if_ifachange:		change: <>  metric: 0  route: not installed
Jun  5 10:10:55 if_ifachange:		preference: 0  down: 120  refcount: 3  mtu:
1436
Jun  5 10:10:55 if_ifachange:		broadaddr: 172.19.200.255
Jun  5 10:10:55 if_ifachange:		subnet: 172.19.200  subnetmask: 255.255.255
Jun  5 10:10:55 
Jun  5 10:10:55 if_rtup: ADD route for interface tu1 172.19.200.41/255.255.255
ADD      172.19.200       255.255.255     gw 172.19.200.41   Direct   pref
0/0 metric 0/0 tu1 <Int Active Retain>
Jun  5 10:10:55 rt_close: 1 route proto IF
Jun  5 10:10:55 
Jun  5 10:10:55 if_ifachange:	127.0.0.1
Jun  5 10:10:55 if_ifachange:		index: 4  name: lo0  state: <Up Loopback
Multicast Simplex NoAge>
Jun  5 10:10:55 if_ifachange:		change: <>  metric: 0  route: not installed
Jun  5 10:10:55 if_ifachange:		preference: 0  down: 120  refcount: 2  mtu:
1472
Jun  5 10:10:55 if_ifachange:		subnetmask: 255.255.255.255
Jun  5 10:10:55 
Jun  5 10:10:55 if_rtup: ADD route for interface lo0 127.0.0.1/255.255.255.255
ADD      127.0.0.1        255.255.255.255 gw 127.0.0.1       Direct   pref
0/0 metric 0/0 lo0 <NoAdvise Active Retain>
Jun  5 10:10:55 rt_close: 1 route proto IF
Jun  5 10:10:55 
Jun  5 10:10:55 
Jun  5 10:10:55 ***Routes are being installed in kernel
Jun  5 10:10:55 
Jun  5 10:10:55 
Jun  5 10:10:55 Commence routing updates
Jun  5 10:10:55 
Jun  5 10:10:56 inet_routerid_notify: Router ID: 172.19.200.41
Jun  5 10:10:56 
Jun  5 10:10:56 if_ifachange:	192.0.0.1
Jun  5 10:10:56 if_ifachange:		index: 1  name: tu0  state: <Up Broadcast
Multicast Simplex NoAge>
Jun  5 10:10:56 if_ifachange:		change: <>  metric: 0  route: installed
Jun  5 10:10:56 if_ifachange:		preference: 0  down: 120  refcount: 3  mtu:
1436
Jun  5 10:10:56 if_ifachange:		broadaddr: 192.0.0.255
Jun  5 10:10:56 if_ifachange:		subnet: 192  subnetmask: 255.255.255
Jun  5 10:10:56 
Jun  5 10:10:56 if_rtup: UP route for interface tu0 192.0.0.1/255.255.255
ADD      192              255.255.255     gw 192.0.0.1       Direct   pref
0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Retain>
Jun  5 10:10:56 rt_add: MARTIAN will not be propagated 192/255.255.255 gw
192.0.0.1 Direct
CHANGE   192              255.255.255     gw 192.0.0.1       Direct   pref
0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Active Retain>
RELEASE  192              255.255.255     gw 192.0.0.1       Direct   pref
0/0 metric 0/0 tu0 <NotInstall NoAdvise Int Release Retain>
Jun  5 10:10:56 rt_close: 2 routes proto IF
Jun  5 10:10:56 
ADD      127              255             gw 127.0.0.1       Static   pref
0/0 metric 0/0 lo0 <NoAdvise Int Active Reject>
Jun  5 10:10:56 rt_close: 1 route proto RT
Jun  5 10:10:56 
Jun  5 10:10:56 if_ifachange:	172.19.200.41
Jun  5 10:10:56 if_ifachange:		index: 2  name: tu1  state: <Up Broadcast
Multicast Simplex NoAge>
Jun  5 10:10:56 if_ifachange:		change: <>  metric: 0  route: installed
Jun  5 10:10:56 if_ifachange:		preference: 0  down: 120  refcount: 4  mtu:
1436
Jun  5 10:10:56 if_ifachange:		broadaddr: 172.19.200.255
Jun  5 10:10:56 if_ifachange:		subnet: 172.19.200  subnetmask: 255.255.255
Jun  5 10:10:56 
Jun  5 10:10:56 if_ifachange:	127.0.0.1
Jun  5 10:10:56 if_ifachange:		index: 4  name: lo0  state: <Up Loopback
Multicast Simplex NoAge>
Jun  5 10:10:56 if_ifachange:		change: <>  metric: 0  route: installed
Jun  5 10:10:56 if_ifachange:		preference: 0  down: 120  refcount: 4  mtu:
1472
Jun  5 10:10:56 if_ifachange:		subnetmask: 255.255.255.255
Jun  5 10:10:56 
Jun  5 10:10:56 
Jun  5 10:10:56 rt_new_policy: new policy started with 5 entries
RELEASE  127.0.0.1        255.255.255.255  Kernel   pref 0/0 metric 0/0
<NoAdvise Delete Release Gateway>
RELEASE  172.19.200       255.255.255      Kernel   pref 0/0 metric 0/0
<NoAdvise Delete Release Gateway>
RELEASE  192              255.255.255      Kernel   pref 0/0 metric 0/0
<NoAdvise Delete Release Gateway>
Jun  5 10:10:56 rt_new_policy: new policy ended with 5 entries
Jun  5 10:10:56 
Jun  5 10:10:56 stdio: bootname /vmunix
ADD      203.102.144      255.255.255     gw 172.19.200.2    RIP      pref
100/0 metric 2/0 tu1 <Int Active Gateway>
ADD      203.12.186       255.255.255     gw 172.19.200.2    RIP      pref
100/0 metric 2/0 tu1 <Int Active Gateway>
Jun  5 10:10:56 rt_close: 2/3 routes proto RIP.0.0.0.0+520 from 172.19.200.2
Jun  5 10:10:56 
Jun  5 10:10:56 
Jun  5 10:10:56 rt_flash_update: updating kernel with 2 entries
Jun  5 10:10:56 
Jun  5 10:10:56 rt_flash_update: flash update started with 2 entries
Jun  5 10:10:56 rt_flash_update: flash update ended with 2 entries
Jun  5 10:10:56 
CHANGE   0.0.0.0          0.0.0.0         gw 172.19.200.1    Kernel   pref
254/0 metric 0/0 tu1 <NoAdvise Int Delete Gateway>
Jun  5 10:13:55 rt_close: 1 route proto KRT
Jun  5 10:13:55 
Jun  5 10:13:55 
Jun  5 10:13:55 rt_flash_update: updating kernel with 1 entries
Jun  5 10:13:55 
Jun  5 10:13:55 rt_flash_update: flash update started with 1 entries
Jun  5 10:13:55 rt_flash_update: flash update ended with 1 entries
Jun  5 10:13:55 
RELEASE  0.0.0.0          0.0.0.0         gw 172.19.200.1    Kernel   pref
254/0 metric 0/0 tu1 <NoAdvise Int Delete Release Gateway>
Received on Thu Jun 05 1997 - 07:33:35 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:36 NZDT