![]() |
![]() HP OpenVMS Systemsask the wizard |
![]() |
The Question is: It seems we are getting a number of decnet event 4.18 Adjacency down when we begin certain backups. These are at a time when the system is not busy (VAX 4106 64megs RAM, TZ87 DLT IV SCSI tape drive.) Backup account has the following: WSQUOTA 16384 WSEXTENT 16384 PGFLQUOTA 32768 FILLM 128 DIOLM 4096 ASTLM 4096 BIOLM 128 BYTLM 65536 ENQLM 256 The system and network are fine and dandy all through the day but almost the instant we begin one of our nightly backups we begin to get a flury of decnet events. We have multiple HSD10s with SCSI Bus Adapters, the 4106 is part of a two node cluster with a Vax4505 and the above mentioned DLT drive is hanging off the Scsi port on the VAX 4106. The decnet events do not always occur. The question is simply where else can we look for the cause of this after having insured network connectivity is not the is sue? Is the DLT IV dirve perhaps ging VMS backup issues? We have noticed errors on devices on the VAX not causing the decnetevents after they occur. Error Count PAB0: 2 PAA0: 3 PEA0: 2 Is this a matter of some sysgen paramaters? Hoping this isn't to vauge. Thank you The Answer is : The recommended BACKUP process quotas are listed in the current OpenVMS documentation. This could be a process quota error, or (more likely) this is a network congestion or system parameter error. As a first step, the OpenVMS Wizard would clean out MODPARAMS.DAT of any absolete settings (when ADD_ or MIN_ or MAX_ variants are available), and the OpenVMS Wizard would also remove any entries for products or versions no longer relevent), and AUTOGEN the system with FEEDBACK. Of central interest would be the DECnet counters -- see the OpenVMS FAQ for how to zero these values on recent OpenVMS versions. You will probably find retransmission and back-off (retransmission) errors reported by the DECnet counters. Assuming DECnet Phase IV, you can use the NCP command SHOW KNOWN LINE COUNTERS to see the current values of the DECnet communications lines. Also assuming DECnet Phase IV, the NCP command ZERO KNOWN LINE COUNTERS will clear the counters. You may also find that the particular device(s) targeted by the BACKUP are central to OpenVMS system or DECnet operations, and that the BACKUP is saturating the I/O capabilities of these devices. MONITOR DISK/ITEM=QUEUE_LENGTH can be used to determine I/O rates, and any values above 0.5 effectively indicate I/O saturation; that half of all I/O operations to the disk are waiting. This can point to I/O loading, or to insufficient physical memory for I/O caching, or severely fragmented disks, poorly-tuned or internally-fragmented RMS (indexed) files, or other related I/O problems. You might also find that there is bus contention -- your VAX systems are comparatively old, and your I/O buses correspondingly slow. Newer DLT tapes are particularly good at driving the SCSI bus quickly, and you might be encountering bus-level contention on other devices sharing the same SCSI. The obvious approach here involves additional or (better) additional faster) buses. Please see the Performance manual in the OpenVMS documentation set. This manual has information on a systematic performance investigation. Also potentially of interest will be disk defragmentation and RMS indexed file conversion (and/or RMS file tuning). Also potentially the addition of physical memory. The performance manual should help you identify the system bottleneck(s) you are encountering here.
|