Software Product Description ________________________________________________________________ PRODUCT NAME: DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 DESCRIPTION DECwatchdog/Autopilot is a software product designed to improve Client/Server applications availability by failure detection and automated application recovery. A copy of DECwatchdog/Autopilot software runs on each computer in a group. A group is composed of 2 to 16 computers communicat- ing across a local area network and collaborating to perform a critical application. A critical application and the description of the group of computers on which this application runs, form a DECwatch- dog/Autopilot mission. DECwatchdog/Autopilot can manage several missions simultaneously on several jointed or disjointed groups of computers to automate failover of several independent applications. Used on top of redundant or partially redundant hardware, DECwatchdog/Autopilot provides unattended operation and mission continuity. DECwatchdog/Autopilot performs four different roles: o Mutual watch o Distributed lock management o Master/Slave management o Automated recovery Features DIGITAL June 1994 AE-Q0CNB-TE DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 Mutual Watch For each mission, DECwatchdog/Autopilot periodically watches the different computers of the group. DECwatchdog/Autopilot detects CPU failures and network failures. Detection delay can be modified from 5 seconds to 2 minutes. When the group uses redundant networks, DECwatchdog/Autopilot differentiates network failures from computer failures. When the group includes more than 2 computers, DECwatchdog/Autopilot localizes network failures by deduction. Distributed Lock Management For each mission, the group of computers can share a set of up to 24 resources. A resource represents either: o A physical device identifier such as disk, tape, line or peripheral o An access path to data or NFS files o Any virtual resource, flag, or event Master/Slave Management There are three different roles for computers in a mission : master, slave and satellite. DECwatchdog/Autopilot automatically assigns master and slave roles for each mission, conforming to user's preferences. A computer which is neither slave nor master is called a satellite computer. Automated Recovery DECwatchdog/Autopilot can automatically initiate standard com- mand files on each computer in case of any state transition of o computers o networks 2 DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 o resources Command files include any application recovery action. Other Features DECwatchdog/Autopilot is specifically designed to manage hetero- geneous groups of computers. While computers do not necessarily run the same operating system, a compatible version of DECwatch- dog/Autopilot must be present on each of them. DECwatchdog/Autopilot supports planned service interruptions for hardware maintenance or operating system upgrades. It allows on-line reconfiguration of applications and groups. DECwatchdog/Autopilot does not prevent erroneous access con- flicts on shared peripherals. Additional safe locking mechanisms will be invoked in command files when necessary. HARDWARE REQUIREMENTS Processors Supported Alpha AXP: DEC 150 AXP, DEC 2000 Model 300 AXP Server, DEC 2000 Model 500 AXP Server DEC 2100 Server A500MP 3 DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 DEC 3000 Model 300 AXP Workstation, DEC 3000 Model 300L AXP Workstation, DEC 3000 Model 400 AXP Workstation, DEC 3000 Model 400 AXP Server, DEC 3000 Model 500 AXP Workstation, DEC 3000 Model 500 AXP Server, DEC 3000 Model 500X AXP Server, DEC 3000 Model 600 AXP Workstation, DEC 3000 Model 600S AXP Server, DEC 3000 Model 800 AXP Workstation, DEC 3000 Model 800S AXP Server DEC 4000 Model 610 AXP System, DEC 4000 Model 710 AXP System DEC 7000 Model 610 AXP System DEC 10000 Model 610 AXP System Disk Space Requirements Disk space required for installation/use: Root file sys- 615 KB tem: Other file usr 40 KB systems: var 4 KB These counts refer to the disk space required on the system disk. The sizes are approximate; actual sizes may vary depending on the user's system environment, configuration, and software options. 4 DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 OPTIONAL HARDWARE The full DECwatchdog functionality is correctly performed on redundant DECsystems communicating through two independant Ethernet links. SOFTWARE REQUIREMENTS DEC OSF/1 AXP Operating System Version 2.0 GROWTH CONSIDERATIONS The minimum hardware/software requirements for any future ver- sion of this product may be different from the requirements for the current version. DISTRIBUTION MEDIA 9-track 1600 BPI Magtape, TK50 Streaming Tape. ORDERING INFORMATION Software Licenses: QL-0ZMA*-** Software Media: QA-0ZMA*-** Software Documentation: QA-0ZMAA-GZ Software Product Services: QT-0ZMA*-** * Denotes variant fields. For additional information on avail- able licenses, services, and media, refer to the appropriate price book. 5 DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 SOFTWARE LICENSING This software is furnished only under a license. For more in- formation about Digital's licensing terms and policies, contact your local Digital office. License Management Facility Support: This layered product supports the DEC OSF/1 License Management Facility. License units for this product are allocated on a CPU-capacity basis. For more information on the DEC OSF/1 License Management Facil- ity, refer to the DEC OSF/1 Operating System Software Product Description (SPD 41.61.06) or documentation. SOFTWARE PRODUCT SERVICES Digital can help the user in designing a redundant system inte- grating SCSI disk multiplexors, uninterruptable power supply, and communications servers that can be advantageously combined with DECwatchdog/Autopilot. For more information on these or other services, please contact your local Digital office. SOFTWARE WARRANTY Warranty for this software product is provided by Digital with the purchase of a license for the product as defined in the Software Warranty Addendum of this SPD. The above information is valid at time of release. Please con- tact your local Digital office for the most up-to-date informa- tion. © 1994 Digital Equipment Corporation. All rights reserved. 6 DECwatchdog/Autopilot for OSF/1®, SPD 47.74.01 Version 2.1 ® OSF/1 is a registered trademark of Open Software Founda- tion. [TM] The DIGITAL Logo, Alpha AXP,AXP, DEC, DECstation, DEC- system, MicroVAX, VAX, VAXcluster, VAXft, VAXserver, VAXstation, and VMS are trademarks of Digital Equipment Corporation. 7