My original problem:
>Every once in awhile, one of our 2100a machines running 4.0d will drop
>off the network. It's very strange. The system can ping itself, but
>nothing out on the net and nothing out on the net can ping it. This
>usually happens during heavy network loads (like backups). Some
>details:
>
> AlphaServer 2100a
> 256mb RAM
> ~40gb disk space
> Primary job is NFS service
> 100mb interface (/dev/tu0) Tulip card
> 4.0d, unknown ECO level (how do I find this out???)
> NetBackup 3.1.5 client installed, Solaris 2.5.1 backup server
>
>Right now we have a script that runs out of cron every 5 minutes and
>pings a system on the outside that we know will be alive. It tries 4
>times and then shoots itself in the head. Clearly, this is not the way
>to handle this problem. But I'm the Sun and HP guy and I'm just
>starting to pickup the Dec stuff so I'm baffled...
>
>Any direction in solving this problem would be greatly appreciated.
>Thanks!
Thanks to Jie Gao, John Losey, Larye D. Parkin and T. S. Horsnell for
responding.
The general consensus was to check for a cabling or switch problem.
I plan to replace the cable first and see if that reduces the incidence
of this problem. Then I'll move the switch port if it does it again.
Unfortunately this only happens once a month or so so it's a hard problem
to debug. But I'll let everyone know what the ultimate solution is (in 6
months or so... :-) )
-Phil
--
GREYMOUSER CONSULTING
System, Network and Security Architecture and Administration
for Central Virginia (http://www.greymouser.com)
* S o l a r i s * H P - U X * L I N U X * W i n d o w s N T *
Received on Wed Dec 15 1999 - 18:58:18 NZDT