Dear group,
After using Networker for more then 4 years now without many problems we
now have serious problems.
Networker V4.3 is running on an AlphaStation with DU 4.0B and all
patches installed.
We have about 60 clients which are backuped with 6 parallel streams to a
DLT7000. Almost every night Networker looses all active connections at
the same time (at least within a few minutes) and continues after a few
hours with a new set of savesets. During this period pings from the
backupserver to any host on our network show normal responsetimes. All
the other hosts in our network show no problems.
Anyone seen this situation?
Here's a fragment of /nsr/logs/messages
...
Feb 20 01:10:52 nsrhost root: * sun01:/usr2 has been inactive for 12
minutes since Thu Feb 19 21:40:57 1998.
Feb 20 01:10:52 nsrhost root: * sun01:/usr2 is being abandoned by
asavegrp.
Feb 20 01:10:52 nsrhost root:
Feb 20 01:10:52 nsrhost root: * zalmet.eurog.org:/ has been inactive for
12 minutes since Thu Feb 19 21:38:58 1998.
Feb 20 01:10:52 nsrhost root: * zalmet.eurog.org:/ is being abandoned by
asavegrp.
...
Here's the output from /usr/sbin/netstat -I tu1 -p tcp
and /usr/sbin/netstat -I tu1 -p udp
before and after a backup (counters are zeroed first)
=======================================================
tu1 Ethernet counters at Tue Mar 17 21:30:02 1998
65535 seconds since last zeroed
4294967284 bytes received
389581512 bytes sent
37062850 data blocks received
6647510 data blocks sent
50902041 multicast bytes received
204486 multicast blocks received
436644 multicast bytes sent
3171 multicast blocks sent
0 blocks sent, initially deferred
0 blocks sent, single collision
0 blocks sent, multiple collisions
0 send failures
0 collision detect check failure
0 receive failures
0 unrecognized frame destination
0 data overruns
0 system buffer unavailable
0 user buffer unavailable
tcp:
44177855 packets sent
31840776 data packets (146670357 bytes)
62 data packets (71966 bytes) retransmitted
621515 ack-only packets (589401 delayed)
0 URG only packets
2211 window probe packets
11705558 window update packets
7733 control packets
74394341 packets received
5763039 acks (for 146744257 bytes)
10725 duplicate acks
0 acks for unsent data
68527594 packets (1085274019 bytes) received in-sequence
1223 completely duplicate packets (1245641 bytes)
2 packets with some dup. data (1280 bytes duped)
17765 out-of-order packets (20353191 bytes)
3906 packets (21685 bytes) of data after window
3889 window probes
259580 window update packets
8 packets received after close
20 discarded for bad checksums
0 discarded for bad header offset fields
0 discarded because packet too short
2543 connection requests
3377 connection accepts
5900 connections established (including accepts)
6407 connections closed (including 118 drops)
20 embryonic connections dropped
5762492 segments updated rtt (of 5762542 attempts)
31 retransmit timeouts
0 connections dropped by rexmit timeout
2286 persist timeouts
335 keepalive timeouts
314 keepalive probes sent
1 connection dropped by keepalive
udp:
194387 packets sent
200667 packets received
0 incomplete headers
0 bad data length fields
0 bad checksums
0 full sockets
2105 for no port (2101 broadcasts, 0 multicasts)
0 input packets missed pcb cache
Netstat-info at Wed Mar 18 01:20:53 MET 1998
tcp:
52267410 packets sent
36222386 data packets (1723024921 bytes)
66 data packets (75209 bytes) retransmitted
773018 ack-only packets (717032 delayed)
0 URG only packets
2212 window probe packets
15258100 window update packets
11628 control packets
87483728 packets received
6588490 acks (for 1723104710 bytes)
16834 duplicate acks
0 acks for unsent data
80787499 packets (1046674267 bytes) received in-sequence
4231 completely duplicate packets (5411033 bytes)
4 packets with some dup. data (2316 bytes duped)
32796 out-of-order packets (40428711 bytes)
5320 packets (31743 bytes) of data after window
5251 window probes
289966 window update packets
16 packets received after close
20 discarded for bad checksums
0 discarded for bad header offset fields
0 discarded because packet too short
3747 connection requests
5070 connection accepts
8711 connections established (including accepts)
9659 connections closed (including 222 drops)
105 embryonic connections dropped
6588415 segments updated rtt (of 6588559 attempts)
40 retransmit timeouts
0 connections dropped by rexmit timeout
2289 persist timeouts
393 keepalive timeouts
371 keepalive probes sent
2 connections dropped by keepalive
udp:
221196 packets sent
228066 packets received
0 incomplete headers
0 bad data length fields
0 bad checksums
0 full sockets
2336 for no port (2332 broadcasts, 0 multicasts)
0 input packets missed pcb cache
========================================================================
--
Kees Bol
=================================================================
mailto:bol_at_axp1.iend.wau.nl
Department for Information Management and Datacommunication (I&D)
Wageningen Agricultural University, The Netherlands
Phone: +31 (0)317 484715 Fax: +31 (0)317 484731
=================================================================
Received on Wed Mar 18 1998 - 14:39:51 NZST