DIGITAL Software Product Description ___________________________________________________________________ PRODUCT NAME: Intelligent Peripheral Fault Manager SPD 60.35.00 For Digital UNIX, Version 1.0 DESCRIPTION Intelligent Peripheral Fault Manager For Digital UNIX (IPFM), a layered software product, provides fault management services for the AlphaServer Intelligent Peripheral Platform product. The IP Fault Manager basic services consist of event monitoring and control of the AlphaServer IP Platform. IP Fault Manager works together with and depends upon POLYCENTER System Watchdog (PSW), another Digital UNIX layered product, to pro- vide the full range of these IP services. The AlphaServer IP Platform, a rackmounted product designed by Computer Special Systems, provides resources for sophisticated voice process- ing, FAX, voice recognition, and voice messaging services for telecom- munication service providers in the Intelligent Network marketplace. The AlphaServer IP Platform is available in two basic variations; a simplex system and a duplex system. A simplex variation consists of: o An AlphaServer 1000 system with an EISA/ISA bus o An ISA bus expansion chassis (Dialogic Telco Platform with IP alarm panel) o A StorageWorks storage shelf with SCSI disks o A DEChub 90 communication hub The duplex IP variation replicates every major hardware component of the simplex system to provide a highly available system. To enhance its availability, the IP Fault Manager also utilizes DECsafe Available Server to provide failover services if an AlphaServer system fails. The simplex IP system requires a single IP Fault Manager license with March 1996 AE-QN0GA-TE POLYCENTER System Watchdog, whereas the duplex IP system requires two sets of licenses. Each simplex or duplex IP system also requires an AlphaServer IP Platform console workstation, that is, an AlphaStation system running the Digital UNIX operating system. It acts as a console device for the AlphaServer system as well as a management operations center to display the fault status of the AlphaServer IP Platform in a window dedicated for fault management. FUNCTIONAL DESCRIPTION The IP Fault Manager executes on an AlphaServer 1000 system running the Digital UNIX operating system. The purpose of the IP Fault Manager is to monitor and control the fault status of the AlphaServer IP Platform. It sends messages to the IP alarm panel to perform activities such as setting or clearing the alarm status indicator LEDs or modifying the system status indicator LEDs. It also monitors the SCSI disks in the AlphaServer 1000 system and the StorageWorks storage shelf. For the IP Fault Manager detected events, messages are sent to the POLYCENTER System Watchdog to indicate what alarms should be set or cleared, as well as the name of the event that triggered the alarm action. The IP alarm panel monitors the internal fault status of the ISA bus expansion chassis and sends it by means of a serial line to the IP Fault Manager, which then forwards it to the POLYCENTER System Watchdog for logging and displaying. Other events on the AlphaServer system are mon- itored by POLYCENTER System Watchdog. These monitored events are then logged and their existence indicated on the IP alarm panel by the IP Fault Manager. 2 POLYCENTER System Watchdog consists of two components: a Consolidator and an Agent. An Agent is installed on each AlphaServer system that requires monitoring in the AlphaServer IP Platform subnetwork and pe- riodically scans the system, the devices, and data structures attached to the system for events and problems. When an event is detected, this information is sent to the AlphaServer system executing the Consolidator, which then carries out the predetermined action and notification. The Consolidator, aside from receiving and handling Agent reported events, also periodically attempts to connect through the network to the Agent nodes to detect any network node availability, or Agent events in the process. The Consolidator also reports on specified events for either the simplex or duplex IP systems by sending them to the IP Platform console workstation to be displayed in a workstation window for use by the system operator responsible for IP fault management. The IP Fault Manager has interfaces to other components of the IP sys- tem as explained below: o IP alarm panel The IP alarm panel displays alarm and system status for the AlphaServer Intelligent Peripheral Platform. It is connected to the AlphaServer 1000 system through two serial ports. One serial port is used by the IP Fault Manager to perform activities such as setting or clearing the alarm status indicator (Critical, Major, Minor) or modifying the system status indicator (Active, Standby, Out of Service, Unavailable). Another serial port, intended for the maintenance center, is used by the alarm panel to send status messages to the IP Fault Manager, and by the latter to send alarm cutoff messages to the alarm panel. o AlphaServer 1000 system management registers The AlphaServer 1000 system has two system management registers, a server management alarm register and a PCI interrupt alarm reg- ister. The IP Fault Manager polls the status of these registers by means of a special driver and reports them to the POLYCENTER System Watchdog Agent. 3 o AlphaServer 1000 system log files These log files include the standard Digital UNIX system log files and the PSW external event log file. The Digital UNIX log files collect information from various AlphaServer subsystems such as hardware devices. The PSW external event log file is used to com- municate user-specified events to the PSW Agent. It can also be written to by an IP application to communicate application events to the PSW Agent for processing. o IP Platform console workstation The console workstation has a multifunction purpose. It provides the IP system operator with a terminal window to: - Use as a system console for the AlphaServer 1000 system. - Control manually the state of the IP alarm panel by setting or clearing alarm indicators or modifying system status by means of a menu interface. - Display alarm messages about the fault management status of the IP system. The window is driven by POLYCENTER System Watchdog Consolidator. o POLYCENTER System Watchdog (PSW) Agents Agents run on each AlphaServer 1000 processor; one for the simplex version and two for the duplex AlphaServer IP Platform version. Agents receive event monitoring parameters from the POLYCENTER System Watchdog Consolidator, which displays event messages on the IP Platform console workstation display. PSW Agents also mon- itor the system log files for user-specified events as defined in the configuration files. PSW Agents then notify the PSW Consolidator when monitored events occur. o POLYCENTER System Watchdog (PSW) Consolidator One Consolidator runs on each AlphaServer 1000 system; one version for the simplex and two for the duplex IP version. A Consolidator sends event collection instructions to the PSW Agents and an- alyzes the events collected by the Agents. Based upon the event monitoring parameters and their corresponding actions (defined 4 in the PSW configuration file) the Consolidator instructs the Actor-Manager to perform the corresponding action. o POLYCENTER System Watchdog (PSW) Actor-Manager The PSW Actor-Manager is part of the PSW Consolidator. It per- forms the action associated with a specific event. For certain events, the PSW Consolidator action is to communicate with the IP Fault Manager to send messages to the IP alarm panel. o IP Alarm log file This log file records the occurrences of all IP events and is accessible by the IP system operator. It serves as a permanent record of all events that are also displayed on the IP Platform console workstation display. User Interfaces The IP Fault Manager offers four user interfaces: 1. A user application API that enables application developers to mon- itor and control application-oriented fault events to be integrated with the ones already handled by the IPFM. The IPFM can then send messages to the IP alarm panel to set and clear alarms on behalf of the application. 2. A user menu located on a window on the IP Platform console work- station that allows the system operator to manually set and clear alarms as well as system status LEDs on the IP alarm panel. 3. An IP alarm log file that provides a recorded fault event history of all events monitored by the IP alarm panel, the IP Fault Manager itself, and the user application. 4. An IP Platform console workstation window that displays all of the events recorded in the IP alarm log file. The POLYCENTER System Watchdog Consolidator is responsible for sending messages to this window for either the simplex or duplex IP systems. 5 HARDWARE REQUIREMENTS Processors Supported AlphaServer:1000 Model 4/200 1000 Model 4/233 1000 Model 4/266 The IP Fault Manager has been designed to execute on AlphaServer 1000 4/2xx processors to take advantage of their system management regis- ters for IP fault management. Other Hardware ISA bus expansion chassis with IP alarm panel (Dialogic Telco Platform) Disk Space Requirements for AlphaServer and Digital UNIX Systems (Block Cluster Size = 1) Disk space required for 1.5 MB installation: This count refers to the disk space required on the system disk. The size is approximate; actual size may vary depending on the user's sys- tem environment, configuration, and software options. SOFTWARE REQUIREMENTS Digital UNIX V3.2C POLYCENTER System Watchdog V2.2 SOFTWARE LICENSING This software is furnished only under a license. For more information about Digital's licensing terms and policies, contact your local Digital office. License Management Facility Support This layered product supports the License Management Facility. 6 License units for this product are allocated on a Traditional or Un- limited System Use basis. For more information on the License Management Facility, refer to the Digital UNIX Operating System Software Product Description (SPD 41.61.xx) or the License Management Facility manual, which is part of the Digital UNIX Operating System documentation set. GROWTH CONSIDERATIONS The minimum hardware/software requirements for any future version of this product may be different from the requirements for the current version. DISTRIBUTION MEDIA CD-ROM This product is also available as part of the Digital UNIX Consoli- dated Software Distribution on CD-ROM. ORDERING INFORMATION Intelligent Peripheral Fault Manager For Digital UNIX Software Licenses: QL-4K4A*-** Software Media: QA-4K4AA-H8 Software Documentation: QA-4K4AA-GZ * Denotes variant fields. For additional information on available li- censes, services, and media, refer to the appropriate price book. The above information is valid at time of release. Please contact your local Digital office for the most up-to-date information. 7 SOFTWARE PRODUCT SERVICES A variety of service options are available from Digital. For more in- formation, contact your local Digital office. SOFTWARE WARRANTY Warranty for this software product is provided by Digital with the pur- chase of a license for the product as defined in the Software Warranty Addendum of this SPD. © 1996 Digital Equipment Corporation. All rights reserved. [TM] AlphaServer, AlphaStation, DEC, DECsafe, Digital, Digital UNIX, POLYCENTER, StorageWorks, and the DIGITAL Logo are trademarks of Digital Equipment Corporation. [R] Dialogic is a registered trademark of Dialogic Corporation. [R] UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company Limited. 8