Serious problems with Networker Save and Restore

From: C.J.Bol <bol_at_Axp1.IenD.wau.nl>
Date: Wed, 18 Mar 1998 14:36:34 +0100

Dear group,

After using Networker for more then 4 years now without many problems we
now have serious problems.
Networker V4.3 is running on an AlphaStation with DU 4.0B and all
patches installed.
We have about 60 clients which are backuped with 6 parallel streams to a
DLT7000. Almost every night Networker looses all active connections at
the same time (at least within a few minutes) and continues after a few
hours with a new set of savesets. During this period pings from the
backupserver to any host on our network show normal responsetimes. All
the other hosts in our network show no problems.
Anyone seen this situation?


Here's a fragment of /nsr/logs/messages
...
Feb 20 01:10:52 nsrhost root: * sun01:/usr2 has been inactive for 12
minutes since Thu Feb 19 21:40:57 1998.
Feb 20 01:10:52 nsrhost root: * sun01:/usr2 is being abandoned by
asavegrp.
Feb 20 01:10:52 nsrhost root:
Feb 20 01:10:52 nsrhost root: * zalmet.eurog.org:/ has been inactive for
12 minutes since Thu Feb 19 21:38:58 1998.
Feb 20 01:10:52 nsrhost root: * zalmet.eurog.org:/ is being abandoned by
asavegrp.
...


Here's the output from /usr/sbin/netstat -I tu1 -p tcp
and /usr/sbin/netstat -I tu1 -p udp
before and after a backup (counters are zeroed first)
=======================================================

tu1 Ethernet counters at Tue Mar 17 21:30:02 1998

       65535 seconds since last zeroed
  4294967284 bytes received
   389581512 bytes sent
    37062850 data blocks received
     6647510 data blocks sent
    50902041 multicast bytes received
      204486 multicast blocks received
      436644 multicast bytes sent
        3171 multicast blocks sent
           0 blocks sent, initially deferred
           0 blocks sent, single collision
           0 blocks sent, multiple collisions
           0 send failures
           0 collision detect check failure
           0 receive failures
           0 unrecognized frame destination
           0 data overruns
           0 system buffer unavailable
           0 user buffer unavailable
tcp:
        44177855 packets sent
                31840776 data packets (146670357 bytes)
                62 data packets (71966 bytes) retransmitted
                621515 ack-only packets (589401 delayed)
                0 URG only packets
                2211 window probe packets
                11705558 window update packets
                7733 control packets
          74394341 packets received
                5763039 acks (for 146744257 bytes)
                10725 duplicate acks
                0 acks for unsent data
                68527594 packets (1085274019 bytes) received in-sequence
                1223 completely duplicate packets (1245641 bytes)
                2 packets with some dup. data (1280 bytes duped)
                17765 out-of-order packets (20353191 bytes)
                3906 packets (21685 bytes) of data after window
                3889 window probes
                259580 window update packets
                8 packets received after close
                20 discarded for bad checksums
               0 discarded for bad header offset fields
                0 discarded because packet too short
        2543 connection requests
        3377 connection accepts
        5900 connections established (including accepts)
        6407 connections closed (including 118 drops)
        20 embryonic connections dropped
        5762492 segments updated rtt (of 5762542 attempts)
        31 retransmit timeouts
                0 connections dropped by rexmit timeout
        2286 persist timeouts
        335 keepalive timeouts
                314 keepalive probes sent
                1 connection dropped by keepalive
udp:
        194387 packets sent
        200667 packets received
        0 incomplete headers
        0 bad data length fields
        0 bad checksums
        0 full sockets
        2105 for no port (2101 broadcasts, 0 multicasts)
        0 input packets missed pcb cache

Netstat-info at Wed Mar 18 01:20:53 MET 1998
tcp:
        52267410 packets sent
                36222386 data packets (1723024921 bytes)
                66 data packets (75209 bytes) retransmitted
                773018 ack-only packets (717032 delayed)
                0 URG only packets
                2212 window probe packets
                15258100 window update packets
                11628 control packets
         87483728 packets received
                6588490 acks (for 1723104710 bytes)
                16834 duplicate acks
                0 acks for unsent data
                80787499 packets (1046674267 bytes) received in-sequence
                4231 completely duplicate packets (5411033 bytes)
                4 packets with some dup. data (2316 bytes duped)
                32796 out-of-order packets (40428711 bytes)
                5320 packets (31743 bytes) of data after window
              5251 window probes
                289966 window update packets
                16 packets received after close
                20 discarded for bad checksums
                0 discarded for bad header offset fields
                0 discarded because packet too short
        3747 connection requests
        5070 connection accepts
        8711 connections established (including accepts)
        9659 connections closed (including 222 drops)
        105 embryonic connections dropped
        6588415 segments updated rtt (of 6588559 attempts)
        40 retransmit timeouts
                0 connections dropped by rexmit timeout
        2289 persist timeouts
        393 keepalive timeouts
                371 keepalive probes sent
                2 connections dropped by keepalive
udp:
        221196 packets sent
        228066 packets received
        0 incomplete headers
        0 bad data length fields
        0 bad checksums
        0 full sockets
        2336 for no port (2332 broadcasts, 0 multicasts)
        0 input packets missed pcb cache

========================================================================

-- 
Kees Bol
=================================================================
mailto:bol_at_axp1.iend.wau.nl
Department for Information Management and Datacommunication (I&D) 
Wageningen Agricultural University,  The Netherlands
Phone: +31 (0)317 484715                   Fax: +31 (0)317 484731
=================================================================
Received on Wed Mar 18 1998 - 14:39:51 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:37 NZDT