Implementing vSphere Metro Storage Cluster (vMSC) using HP 3PAR Peer Persistence (2055904)
What is vMSC?
vSphere Metro Storage Cluster (vMSC) is a certified configuration for stretched storage cluster architectures. A vMSC configuration is designed to maintain data availability beyond a single physical or logical site. A storage device configured in the vMSC configuration is supported after successful vMSC certification. All supported storage devices are listed in the VMware Storage Compatibility Guide.
What is HP 3PAR Peer Persistence?
HP 3PAR Peer Persistence is an extension of HP 3PAR Remote Copy software and HP 3PAR OS that enables a pair of HP 3PAR StoreServ Storage systems, located at metropolitan distances, to act as peers to each other and present a nearly continuous storage system to hosts connected to them. Volumes presented to hosts are replicated across the pair of arrays and kept in sync. Each pair of replicated and synchronized volumes across each array share the same WWN and appear as the same volume to the hosts. Taking advantage of Asymmetric Logical Unit Access (ALUA) capabilities that allow paths to a SCSI device to be marked as having different characteristics, hosts connect to volumes in one array using active paths, and connect to replicated volumes in the other array through standby paths. ALUA Path status and host availability to the volumes are controlled by Peer Persistence software. This capability allows customers to configure a high-availability solution between two sites or data centers where switchover and switchback of access to the volumes across arrays remains transparent to the hosts and applications running on those hosts.
HP 3PAR Quorum Witness
The HP 3PAR Quorum Witness is a component provisioned as a virtual machine that is typically deployed at a third site. The HP 3PAR Quorum Witness, along with the two HP 3PAR StoreServ Storage systems, forms a three part quorum system. This quorum system allows monitoring of the status of both the HP 3PAR StoreServ Storage systems and the storage site inter-links. A number of site and inter-link failure scenarios can be recognized by this three part quorum system, and appropriate failover actions implemented. In the event of a disaster that may bring either one of the storage systems or sites down and in conjunction with Peer Persistence software, a failover to the surviving StoreServ system is automatically initiated. During this failover operation, replicated volumes on the remaining storage system are made active. The host paths to those volumes are also made active, thereby ensuring that hosts can continue to access their volumes without any disruption or outage. Communication between the three sites for quorum is via the Quorum Witness IP and the service management IP’s of the two HP 3PAR StoreServ Storage systems. HP 3PAR Quorum Witness does not actively participate in data storage and a failure or removal of the HP 3PAR Quorum Witness from an otherwise functioning environment will have no impact. The HP 3PAR Quorum Witness only comes into play when one site or the ISL has failed or if two quorum members have failed simultaneously.
These requirements must be satisfied to support a vMSC configuration with HP 3PAR.
- VMware ESXi 5.0 or higher, Metro Storage Cluster configured for uniform host access per VMware requirements and best practices.
- HP 3PAR StoreServ Storage arrays configured for Peer Persistence with Automated Transparent Failover per HP 3PAR Remote Copy Software User's Guide and Implementing vSphere Metro Storage Cluster using HP 3PAR Peer Persistence technical white paper.
- HP 3PAR Quorum Witness software must be installed on a virtual machine at a third site.
- vSphere vCenter Server connected to ESXi hosts in both data center.
- Maximum round trip latency on the storage network between sites should not exceed 2.6ms RTT.
- FC-switched, FCoE end-to-end, and software/hardware iSCSI SAN for host-array connectivity, and array inter-link connectivity setup as remote-copy-over-FC (RCFC) or remote-copy-over-IP (RCIP).
- Any IP subnet used by the virtual machine must be accessible by all ESXi hosts in all data centers within the Metro Storage Cluster.
A tested and certified VMware Metro Storage Cluster using HP 3PAR StoreServ Storage is supported in accordance with the VMware description of a uniform access vMSC configuration. Particular to the uniform access configuration, host data path connections at each site are cross-connected to the peer data center site storage array. ESXi hosts access volumes in the data center local array via active paths. Connectivity to standby peer volumes on the distance array is maintained in standby mode until such time as a failover or switchover. In concert with HP 3PAR Peer Persistence, HP 3PAR Quorum Witness and with Automated Transparent Failover (ATF) enabled, a minimally disruptive switchover or failover of volume access across sites can be achieved.
For example, in case of an array failure on one site (site 1):
- Loss of quorum is detected by the Quorum Witness and the surviving storage array at site 2.
- Peer Persistence software on array at site 2 makes the peer volumes and the cross-connected paths from the hosts on site 1 to the array on site 2, active.
- Site 1 hosts, virtual machines, and applications access peer volumes on site 2 and continue normal operation.
Note: The failover is automated with Peer Persistence and ATF, but transparent failback after fault correction is a manual process.
Sample tested scenarios
|Scenario||HP 3PAR StoreServ Storage System Behavior||VMware HA Behavior|
|Single Array-Host Path Failure||Hosts use alternate paths to maintain volume access.||No effect observed|
|Single Array Node Failure||Hosts use alternate paths to the surviving array node(s) at the site and maintain volume access.||No effect observed|
|Single Storage Inter-Site Link Failure||No effect. Inter-site connectivity is maintained by alternate link||No effect observed|
|Storage Inter-Site Links fail||Peer volume synchronization is disabled and Automated Transparent Failover is disabled.||No effect observed|
|Quorum Witness Failure||Automated Transparent Failover is disabled||No effect observed|
|Simultaneous Quorum Witness and Storage Inter-site Links Fail||Peer volume synchronization is disabled and Automated Transparent Failover is disabled.||No effect observed|
|Single Site Storage Array Failure||Automated Failover occurs and Peer volumes and paths are made active on the surviving site.||No effect observed|
|Complete Site Failure||Automated Failover occurs and Peer volumes and paths are made active on the surviving site.||Virtual machines are restarted on ESXi hosts on the surviving site|