HP OpenVMS Systems Documentation

Guidelines for OpenVMS Cluster Configurations

8.6.2 Configuring Two LAN Segments

Figure 8-2 shows a sample configuration for an OpenVMS Cluster system connected to two different LAN segments. The configuration includes Alpha and VAX nodes, satellites, and two bridges.

Figure 8-2 Two-LAN Segment OpenVMS Cluster Configuration

The figure illustrates the following points:

Connecting critical nodes to multiple LAN segments provides increased availability in the event of segment or adapter failure. Disk and tape servers can use some of the network bandwidth provided by the additional network connection. Critical satellites can be booted using the other LAN adapter if one LAN adapter fails.
Connecting noncritical satellites to only one LAN segment helps to balance the network load by distributing systems equally among the LAN segments. These systems communicate with satellites on the other LAN segment through one of the bridges.
Only one LAN adapter per node can be used for DECnet and MOP service to prevent duplication of LAN addresses.
LAN adapters providing MOP service (Alpha or VAX, as appropriate) should be distributed among the LAN segments to ensure that LAN failures do not prevent satellite booting.
Using redundant LAN bridges prevents the bridge from being a single point of failure.

8.6.3 Configuring Three LAN Segments

Figure 8-3 shows a sample configuration for an OpenVMS Cluster system connected to three different LAN segments. The configuration also includes both Alpha and VAX nodes and satellites and multiple bridges.

Figure 8-3 Three-LAN Segment OpenVMS Cluster Configuration

The figure illustrates the following points:

Connecting disk and tape servers to two or three LAN segments can help provide higher availability and better I/O throughput.
Connecting critical satellites to two or more LAN segments can also increase availability. If any of the network components fails, these satellites can use the other LAN adapters to boot and still have access to the critical disk servers.
Distributing noncritical satellites equally among the LAN segments can help balance the network load.
A MOP server (Alpha or VAX, as appropriate) is provided for each LAN segment.

Reference: See Section 11.2.4 for more information about boot order and satellite dependencies in a LAN. See OpenVMS Cluster Systems for information about LAN bridge failover.

8.7 Availability in a DSSI OpenVMS Cluster

Figure 8-4 shows an optimal configuration for a medium-capacity, highly available DSSI OpenVMS Cluster system. Figure 8-4 is followed by an analysis of the configuration that includes:

Analysis of its components
Advantages and disadvantages
Key availability strategies implemented

Figure 8-4 DSSI OpenVMS Cluster System

8.7.1 Components

The DSSI OpenVMS Cluster configuration in Figure 8-4 has the following components:

Part	Description
1	Two DSSI interconnects with two DSSI adapters per node. Rationale: For redundancy, use at least two interconnects and attach all nodes to all DSSI interconnects.
2	Two to four DSSI-capable OpenVMS nodes. Rationale: Three nodes are recommended to maintain quorum. A DSSI interconnect can support a maximum of four OpenVMS nodes. Alternative 1: Two-node configurations require a quorum disk to maintain quorum if a node fails. Alternative 2: For more than four nodes, configure two DSSI sets of nodes connected by two LAN interconnects.
3	Two Ethernet interconnects. Rationale: The LAN interconnect is required for DECnet--Plus communication. Use two interconnects for redundancy. For higher network capacity, use FDDI instead of Ethernet.
4	System disk. Shadow the system disk across DSSI interconnects. Rationale: Shadow the system disk across interconnects so that the disk and the interconnect do not become single points of failure.
5	Data disks. Shadow essential data disks across DSSI interconnects. Rationale: Shadow the data disk across interconnects so that the disk and the interconnect do not become single points of failure.

8.7.2 Advantages

The configuration in Figure 8-4 offers the following advantages:

The DSSI interconnect gives all nodes shared, direct access to all storage.
Moderate potential for growth in size and performance.
There is only one system disk to manage.

8.7.3 Disadvantages

This configuration has the following disadvantages:

Applications must be shut down in order to swap DSSI cables. This is referred to as "warm swap." The DSSI cable is warm swappable for the adapter, the cable, and the node.
A node's location on the DSSI affects the recoverability of the node. If the adapter fails on a node located at the end of the DSSI interconnect, the OpenVMS Cluster may become unavailable.

8.7.4 Key Availability Strategies

The configuration in Figure 8-4 incorporates the following strategies, which are critical to its success:

This configuration has no single point of failure.
Volume shadowing provides multiple copies of system and essential data disks across separate DSSI interconnects.
All nodes have shared, direct access to all storage.
At least three nodes are used for quorum, so the OpenVMS Cluster continues if any one node fails.
There are no satellite dependencies.

8.8 Availability in a CI OpenVMS Cluster

Figure 8-5 shows an optimal configuration for a large-capacity, highly available CI OpenVMS Cluster system. Figure 8-5 is followed by an analysis of the configuration that includes:

Analysis of its components
Advantages and disadvantages
Key availability strategies implemented

Figure 8-5 CI OpenVMS Cluster System

8.8.1 Components

The CI OpenVMS Cluster configuration in Figure 8-5 has the following components:

Part	Description
1	Two LAN interconnects. Rationale: The additional use of LAN interconnects is required for DECnet--Plus communication. Having two LAN interconnects---Ethernet or FDDI---increases redundancy. For higher network capacity, use FDDI instead of Ethernet.
2	Two to 16 CI capable OpenVMS nodes. Rationale: Three nodes are recommended to maintain quorum. A CI interconnect can support a maximum of 16 OpenVMS nodes. Reference: For more extensive information about the CIPCA, see Appendix C. Alternative: Two-node configurations require a quorum disk to maintain quorum if a node fails.
3	Two CI interconnects with two star couplers. Rationale: Use two star couplers to allow for redundant connections to each node.
4	Critical disks are dual ported between CI storage controllers. Rationale: Connect each disk to two controllers for redundancy. Shadow and dual port system disks between CI storage controllers. Periodically alternate the primary path of dual-ported disks to test hardware.
5	Data disks. Rationale: Single port nonessential data disks, for which the redundancy provided by dual porting is unnecessary.
6	Essential data disks are shadowed across controllers. Rationale: Shadow essential disks and place shadow set members on different HSCs to eliminate a single point of failure.

8.8.2 Advantages

This configuration offers the following advantages:

All nodes have direct access to all storage.
This configuration has a high growth capacity for processing and storage.
The CI is inherently dual pathed, unlike other interconnects.

8.8.3 Disadvantages

This configuration has the following disadvantage:

Higher cost than the other configurations.

8.8.4 Key Availability Strategies

The configuration in Figure 8-5 incorporates the following strategies, which are critical to its success:

This configuration has no single point of failure.
Dual porting and volume shadowing provides multiple copies of essential disks across separate HSC or HSJ controllers.
All nodes have shared, direct access to all storage.
At least three nodes are used for quorum, so the OpenVMS Cluster continues if any one node fails.
There are no satellite dependencies.
The uninterruptible power supply (UPS) ensures availability in case of a power failure.

8.9 Availability in a MEMORY CHANNEL OpenVMS Cluster

Figure 8-6 shows a highly available MEMORY CHANNEL (MC) cluster configuration. Figure 8-6 is followed by an analysis of the configuration that includes:

Analysis of its components
Advantages and disadvantages
Key availability strategies implemented

Figure 8-6 MEMORY CHANNEL Cluster

8.9.1 Components

The MEMORY CHANNEL configuration shown in Figure 8-6 has the following components:

Part	Description
1	Two MEMORY CHANNEL hubs. Rationale: Having two hubs and multiple connections to the nodes prevents having a single point of failure.
2	Three to eight MEMORY CHANNEL nodes. Rationale: Three nodes are recommended to maintain quorum. A MEMORY CHANNEL interconnect can support a maximum of eight OpenVMS Alpha nodes. Alternative: Two-node configurations require a quorum disk to maintain quorum if a node fails.
3	Fast-wide differential (FWD) SCSI bus. Rationale: Use a FWD SCSI bus to enhance data transfer rates (20 million transfers per second) and because it supports up to two HSZ controllers.
4	Two HSZ controllers. Rationale: Two HSZ controllers ensure redundancy in case one of the controllers fails. With two controllers, you can connect two single-ended SCSI buses and more storage.
5	Essential system disks and data disks. Rationale: Shadow essential disks and place shadow set members on different SCSI buses to eliminate a single point of failure.

8.9.2 Advantages

This configuration offers the following advantages:

All nodes have direct access to all storage.
SCSI storage provides low-cost, commodity hardware with good performance.
The MEMORY CHANNEL interconnect provides high-performance, node-to-node communication at a low price. The SCSI interconnect complements MEMORY CHANNEL by providing low-cost, commodity storage communication.

8.9.3 Disadvantages

This configuration has the following disadvantage:

The fast-wide differential SCSI bus is a single point of failure. One solution is to add a second, fast-wide differential SCSI bus so that if one fails, the nodes can fail over to the other. To use this functionality, the systems must be running OpenVMS Version 7.2 or higher and have multipath support enabled.

8.9.4 Key Availability Strategies

The configuration in Figure 8-6 incorporates the following strategies, which are critical to its success:

Redundant MEMORY CHANNEL hubs and HSZ controllers prevent a single point of hub or controller failure.
Volume shadowing provides multiple copies of essential disks across separate HSZ controllers.
All nodes have shared, direct access to all storage.
At least three nodes are used for quorum, so the OpenVMS Cluster continues if any one node fails.

8.10 Availability in an OpenVMS Cluster with Satellites

Satellites are systems that do not have direct access to a system disk and other OpenVMS Cluster storage. Satellites are usually workstations, but they can be any OpenVMS Cluster node that is served storage by other nodes in the cluster.

Because satellite nodes are highly dependent on server nodes for availability, the sample configurations presented earlier in this chapter do not include satellite nodes. However, because satellite/server configurations provide important advantages, you may decide to trade off some availability to include satellite nodes in your configuration.

Figure 8-7 shows an optimal configuration for a OpenVMS Cluster system with satellites. Figure 8-7 is followed by an analysis of the configuration that includes:

Analysis of its components
Advantages and disadvantages
Key availability strategies implemented

The base configurations in Figure 8-4 and Figure 8-5 could replace the base configuration shown in Figure 8-7. In other words, the FDDI and satallite segments shown in Figure 8-7 could just as easily be attached to the configurations shown in Figure 8-4 and Figure 8-5.

Figure 8-7 OpenVMS Cluster with Satellites

8.10.1 Components

This satellite/server configuration in Figure 8-7 has the following components:

Part	Description
1	Base configuration. The base configuration performs server functions for satellites.
2	Three to 16 OpenVMS server nodes. Rationale: At least three nodes are recommended to maintain quorum. More than 16 nodes introduces excessive complexity.
3	FDDI ring between base server nodes and satellites. Rationale: The FDDI ring has increased network capacity over Ethernet, which is slower. Alternative: Use two Ethernet segments instead of the FDDI ring.
4	Two Ethernet segments from the FDDI ring to attach each critical satellite with two Ethernet adapters. Each of these critical satellites has its own system disk. Rationale: Having their own boot disks increases the availability of the critical satellites.
5	For noncritical satellites, place a boot server on the Ethernet segment. Rationale: Noncritical satellites do not need their own boot disks.
6	Limit the satellites to 15 per segment. Rationale: More than 15 satellites on a segment may cause I/O congestion.

8.10.2 Advantages

This configuration provides the following advantages:

A large number of nodes can be served in one OpenVMS Cluster.
You can spread a large number of nodes over a greater distance.

8.10.3 Disadvantages

This configuration has the following disadvantages:

Satellites with single LAN adapters have a single point of failure that causes cluster transitions if the adapter fails.
High cost of LAN connectivity for highly available satellites.

8.10.4 Key Availability Strategies

The configuration in Figure 8-7 incorporates the following strategies, which are critical to its success:

This configuration has no single point of failure.
The FDDI interconnect has sufficient bandwidth to serve satellite nodes from the base server configuration.
All shared storage is MSCP served from the base configuration, which is appropriately configured to serve a large number of nodes.

8.11 Multiple-Site OpenVMS Cluster System

Multiple-site OpenVMS Cluster configurations contain nodes that are located at geographically separated sites. Depending on the technology used, the distances between sites can be as great as 150 miles. FDDI, asynchronous transfer mode (ATM), and DS3 are used to connect these separated sites to form one large cluster. Available from most common telephone service carriers, DS3 and ATM services provide long-distance, point-to-point communications for multiple-site clusters.

Figure 8-8 shows a typical configuration for a multiple-site OpenVMS Cluster system. Figure 8-8 is followed by an analysis of the configuration that includes:

Analysis of components
Advantages

Figure 8-8 Multiple-Site OpenVMS Cluster Configuration Connected by WAN Link

8.11.1 Components

Although Figure 8-8 does not show all possible configuration combinations, a multiple-site OpenVMS Cluster can include:

Two data centers with an intersite link (FDDI, ATM, or DS3) connected to a DECconcentrator or GIGAswitch crossbar switch.
Intersite link performance that is compatible with the applications that are shared by the two sites.
Up to 96 Alpha and VAX (combined total) nodes. In general, the rules that apply to OpenVMS LAN and extended LAN (ELAN) clusters also apply to multiple-site clusters.
Reference: For LAN configuration guidelines, see Section 4.12.6. For ELAN configuration guidelines, see Section 10.7.7.

8.11.2 Advantages

The benefits of a multiple-site OpenVMS Cluster system include the following:

A few systems can be remotely located at a secondary site and can benefit from centralized system management and other resources at the primary site. For example, a main office data center could be linked to a warehouse or a small manufacturing site that could have a few local nodes with directly attached, site-specific devices. Alternatively, some engineering workstations could be installed in an office park across the city from the primary business site.
Multiple sites can readily share devices such as high-capacity computers, tape libraries, disk archives, or phototypesetters.
Backups can be made to archival media at any site in the cluster. A common example would be to use disk or tape at a single site to back up the data for all sites in the multiple-site OpenVMS Cluster. Backups of data from remote sites can be made transparently (that is, without any intervention required at the remote site).
In general, a multiple-site OpenVMS Cluster provides all of the availability advantages of a LAN OpenVMS Cluster. Additionally, by connecting multiple, geographically separate sites, multiple-site OpenVMS Cluster configurations can increase the availability of a system or elements of a system in a variety of ways:
- Logical volume/data availability---Volume shadowing or redundant arrays of independent disks (RAID) can be used to create logical volumes with members at both sites. If one of the sites becomes unavailable, data can remain available at the other site.
- Site failover---By adjusting the VOTES system parameter, you can select a preferred site to continue automatically if the other site fails or if communications with the other site are lost.

Reference: For additional information about multiple-site clusters, see OpenVMS Cluster Systems.

8.12 Disaster-Tolerant OpenVMS Cluster Configurations

Disaster-tolerant OpenVMS Cluster configurations make use of Volume Shadowing for OpenVMS, high-speed networks, and specialized management software.

Disaster-tolerant OpenVMS Cluster configurations enable systems at two different geographic sites to be combined into a single, manageable OpenVMS Cluster system. Like the multiple-site cluster discussed in the previous section, these physically separate data centers are connected by FDDI or by a combination of FDDI and ATM, T3, or E3.

The OpenVMS disaster-tolerant product was formerly named the Business Recovery Server (BRS). BRS has been subsumed by a services offering named Disaster Tolerant Cluster Services, which is a system management and software service package. For more information about Disaster Tolerant Cluster Services, contact your HP Services representative.

Contents

Index