DIGITAL Software Product Description ___________________________________________________________________ PRODUCT NAME: DIGITAL AlphaServer Intelligent Peripheral Fault Man- ager SPD 60.35.02 for DIGITAL UNIX, Version 2.1 DESCRIPTION Intelligent Peripheral Fault Manager for DIGITAL UNIX (IPFM) Version 2.1, a layered software product, provides fault management services for the AlphaServer Intelligent Peripheral Platform. It monitors events within the platform and provides visual and audible alarms. In addi- tion, it provides a software API (the IP Fault Manager Application Pro- gramming Interface) to allow applications or other layered software to integrate into the same alarm subsystem. IPFM V2.1 provides the fol- lowing new functionality: o Year 2000 compliance o Symmetrical Multiprocessing (SMP) support for AlphaServer 4100 o Support for DIGITAL UNIX 4.0d FUNCTIONAL DESCRIPTION The IP Fault Manager components are: 1. Event Detector - Event detection software that runs on each sys- tem in the Intelligent Peripheral. Monitors the DIGITAL UNIX op- erating system for detected errors, and monitors the system for user- specified events as defined in the configuration files. February 1998 AE-QN0GC-TE 2. Event Manager - The manager of the event database. The Event Man- ager coordinates the combination of all outstanding events on the local system. The Event Manager logs all event activity in the lo- cal event log file as well as insures that the local event database is kept up to date. It also manages the visual and audible alarms on the Indicator Module to ensure it is consistent with the Event Database. 3. Event Database - The repository of outstanding events. This event database contains an entry for each outstanding event that has been detected or presented to a system. The events can be added (SET), acknowledged (ACK), and removed (CLEAR) from the event database. 4. IPFM Application Programmer's Interface (API) - A programming in- terface that third-party applications can use to set, acknowledge and clear alarmed events within the AlphaServer IP Platform. 5. IPFM Operator Interface - An interface that can be used to display outstanding events , as well as to set, acknowledge, and clear alarms manually on the local system. 6. IP Event Log File - An event log file on each system in the Intel- ligent Peripheral Platform. The log files contain every event that is detected or generated. 7. Configuration Files - Files that are used to allow user customiza- tion of certain details of the IPFM configuration. The user can mod- ify all periodic timers (used for repeated actions), as well as the event-to-alarm severity definitions. 8. SNMP subagent / IPFM MIB - Maintains the MIB (SNMP database) to be consistent with the Event Database and sends traps to a network man- agement station (such as ServerWORKS) to trigger visual indicators and provide data for the alarm database within the network manage- ment station. Any network management station with an SNMP inter- face can manage IPFM. 2 EVENT DETECTION The Event Detector is responsible for detecting any events that oc- cur on a system, and reporting the events to the IPFM Event Managers. The IPFM Event Detector detects several categories of events: processor- based events (hardware events detected within the DIGITAL UNIX oper- ating system), external events, and storage events. When events are detected, they are checked against the list of reportable events (listed in the configuration file). An alarmed event is categorized by sever- ity: Critical: Severe, service-affecting condition requiring immediate corrective action. Major: Serious disruption of service or the malfunctioning or fail- ure of important functions or components. Less immediate or impend- ing effect on system performance than Critical. Minor: Trouble that does not have a serious effect on service, or that occurs in functions or components that are not essential for providing service. If the event passes the event filter, it is passed to the Event Man- ager. Since the Event Detector and the Event Manager reside in dif- ferent processes, the information is passed using common interprocess communication techniques. INTERFACES The IP Fault Manager has interfaces to other components of the AlphaServer IP Platform: o The IP Alarm Control Module An IP alarm control module must be present in order to support fault management for the chassis and for user defined events. The alarm control module has interfaces to the AlphaServer IP Platform sub- assemblies. When the alarm control module detects an event, it gen- erates an interrupt to system software in order to update the event database. 3 o Alarm Control Module Device Driver The Alarm Board Device Driver has an interface with the alarm con- trol module in the AlphaServer IP Platform. The alarm control mod- ule is the board that the device driver directly accesses. The de- vice driver presents an internal interface to the IPFM code, which allows the IPFM Fault Coordinator to set and clear minor, major, and critical alarms on the alarm indicator panel and to receive in- formation through the alarm control module. The information from the alarm control module can be mapped by the end users to indi- cate alarms and alarm priorities as they see fit. This device driver is dynamically loadable. The setting and clearing of alarms is ac- complished through ioctl calls to the device driver. This driver also supports ioctl calls, which reset the expansion board and the alarm board. When an event condition occurs or ceases to occur, the device driver is interrupted by the alarm control module, which reads the Status Register (SREG) to determine what change has occurred in the event conditions monitored. This event condition is then passed up to the event detector, and the appropriate action is taken as with any other event condition. The device driver has a diagnos- tics section that runs the hardware supplied self-test through the alarm control module and performs any additional hardware diagnos- tics that require software assist. The results of the diagnostics are reported to the operator. o IP Alarm Indicator Panel The IP alarm indicator panel displays alarm and system status for the AlphaServer Intelligent Peripheral Platform. It is connected to the AlphaServer 1000A system through a cable to the IP alarm con- trol module. Setting, clearing, and acknowledging alarm status (Crit- ical, Major, Minor) can be accomplished through the IPFM software menu. Clearing and acknowledging alarm status (Critical, Major, Mi- nor) can be accomplished from an SNMP-compliant network management station. o The Console The console has a multifunction purpose. It provides the AlphaServer IP Platform operator with a terminal window to: Use as a system console for the AlphaServer 1000A system. 4 Control manually the state of the IP alarm indicator panel by set- ting or clearing alarm indicators or modifying system status by means of the IPFM menu interface, if an NMS such as ServerWORKS is not being used for this purpose. o IP Alarm Log File This log file records the occurrences of all IP events and is ac- cessible by the AlphaServer IP Platform operator. It serves as a permanent record of all events that are displayed on the AlphaServer IP Platform console, operator workstation, or SNMP network manage- ment station. OPERATIONS SUPPORT The AlphaServer IP Platform can be managed locally or remotely. Fault Management is provided via the IPFM operator interface, ServerWORKS, or other SNMP-compliant NMS. Operator interfaces to the fault manage- ment capabilities of the AlphaServer IP Platform V2.0 include a ba- sic screen interface to each processor, specific to the IP fault man- agement capabilities and accessible from a character terminal. In ad- dition, a consolidated view of the fault management state of all pro- cessors in a distributed system can be provided via the ServerWORKS GUI operator interface. Finally, IPFM supports an SNMP interface al- lowing management using an SNMP-compliant network management station (NMS). USER INTERFACES The IP Fault Manager offers user interfaces as follows: 1. A user application (API) that enables applications to monitor and control application-oriented fault events to be integrated with the ones already handled by the IPFM. The IPFM can then send messages to the IP alarm indicator panel to set and clear alarms on behalf of the application. 5 2. A user menu located on a window on the AlphaServer IP Platform that allows the system operator to manu- ally set and clear alarms as well as system status LEDs on the IP alarm indicator panel. 3. An IP alarm log file that provides a recorded fault event history of all events monitored by the IP alarm indicator panel, the IP Fault Manager itself, and the user application. 4. An SNMP interface that allows an SNMP-compliant network management station to monitor, clear, and acknowledge alarms from a remote lo- cation. HARDWARE REQUIREMENTS Processors Supported AlphaServer Model 4/200 1000: Model 4/233 Model 4/266 Model 5/300 AlphaServer Model 5/400 1000A: AlphaServer Model 5/400 4100: Model 5/466 Other Hardware o ISA bus expansion chassis with IP sensor module: 2T-VC220-IB; 2T- IPSEN-AA o IP alarm control module: 2T-IPCON-AA o IP alarm indicator panel: 2T-IPAIP-AA, 2T-IPAIP-AB 6 Disk Space Requirements for AlphaServer and DIGITAL UNIX Systems Disk space required for 1.5 MB installation: Block cluster size = 1 These counts refer to the disk space required on the system disk. The sizes are approximate; actual sizes may vary depending on the user's system environment, configuration, and software options. SOFTWARE REQUIREMENTS o DIGITAL UNIX, V4.0d o DECevent, V2.6 SOFTWARE LICENSING This software is furnished only under a license. For more information about DIGITAL's licensing terms and policies, contact your local DIG- ITAL office. License Management Facility Support This layered product supports the License Management Facility. The li- cense units for this product are allocated on a Concurrent Use basis. For more information on the License Management Facility, refer to the DIGITAL UNIX Operating System Software Product Description or the Li- cense Management Facility manual, which is part of the DIGITAL UNIX operating system documentation set. GROWTH CONSIDERATIONS The minimum hardware/software requirements for any future version of this product may be different from the requirements for the current version. 7 DISTRIBUTION MEDIA CD-ROM ORDERING INFORMATION Intelligent Peripheral Fault Manager for DIGITAL UNIX Software Licenses: QL-4K4A9-AA Software Media: QA-4K4AA-H8 Software Documentation: QA-4K4AA-GZ The above information is valid at time of release. Please contact your local DIGITAL office for the most up-to-date information. SOFTWARE PRODUCT SERVICES A variety of service options are available from DIGITAL. For more in- formation, contact your local DIGITAL office. SOFTWARE WARRANTY Warranty for this software product is provided by DIGITAL with the purchase of a license for the product as defined in the Software Warranty Addendum of this SPD. © 1998 Digital Equipment Corporation. All rights reserved. [TM] AlphaServer, AlphaStation, DEC, DECevent, DECsafe, DIGI- TAL, DIGITAL UNIX, ServerWORKS, and the DIGITAL logo are trademarks of Digital Equipment Corporation. [R] Dialogic is a registered trademark of Dialogic Corporation. [R] UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company Limited. DIGITAL UNIX is an X/Open UNIX 95 branded product. 8