Adjusting the VMware High Availability failover timeout value
search cancel

Adjusting the VMware High Availability failover timeout value

book

Article ID: 344319

calendar_today

Updated On:

Products

VMware vCenter Server VMware vSphere ESXi

Issue/Introduction

Network interruptions may cause a loss of virtual machine functionality if the VMware High Availability (HA) timeout threshold is exceeded on affected ESX hosts, shutting down virtual machines. Depending on your environment, it may be necessary to adjust the VMware HA timeout value.

This article provides steps to change the VMware HA timeout value on your ESX hosts.


Environment

VMware ESX Server 3.0.x
VMware VirtualCenter 2.5.x
VMware ESX Server 3.5.x
VMware vCenter Server 4.1.x
VMware VirtualCenter 2.0.x
VMware vCenter Server 4.0.x

Resolution

Caution: Increasing the VMware HA timeout value also increases the amount of time before real failures are detected. A longer failure detection time may cause a longer outage within your environment. Changing the HA timeout also affects the isolation detection time and the small gap during which the host regains network connectivity before the isolation detection time is activated. Perform these steps at your own risk.
The method for adjusting the timeout depends on which version of VirtualCenter you are using to manage your ESX host. It is important to use the correct instructions for your version of VirtualCenter. Express the new timeout value in milliseconds. To convert seconds to milliseconds, multiply by 1000. For example, the default timeout value 15 seconds = 15000 milliseconds.

VirtualCenter 2.0.x


To increase the VMware HA timeout value for VirtualCenter 2.0.x:
  1. Log in to the ESX host console command-line as root.
  2. Run the following command to see the timeout value:

    /opt/LGTOaam512/bin/ftcli -cmd getFailureDetectionInfo

  3. Run the following commands to increase the timeout value:

    Note: FT_DOMAIN is the VMware HA cluster domain. It has nothing to do with your network domain. Enter the value as shown.

    FT_DOMAIN=vmware

    FT_DIR=/opt/LGTOaam512

    export FT_DOMAIN

    export FT_DIR

    /opt/LGTOaam512/bin/ftcli -cmd "setFailureDetectionTime XXXXX"

    Note: Replace XXXXX with the new timeout value in milliseconds.

VirtualCenter 2.5.x and vSphere 4.0.x

To increase the VMware HA timeout value for VirtualCenter 2.5.x and vSphere 4.0.x:

  1. In VMware Infrastructure Client (VI Client), click on the HA cluster.
  2. In the Summary tab, click Edit Settings.
  3. In the Settings dialog box, deselect Enable VMware HA and click OK.

    Note: You must wait for all reconfigure and unconfigure tasks to finish.
  4. Click on the HA Cluster.
  5. In the Summary tab, click Edit Settings.
  6. Select Enable VMware HA.
  7. In the left pane, click VMware HA.
  8. Click Advanced Options.
  9. In the Advanced Options (HA) dialog box:
    1. In the option name field, enter das.failuredetectiontime .
    2. For the value, enter the new timeout value in milliseconds.

      Note: The default value for das.failuredetectiontime is 15000 milliseconds (15 seconds).

  10. Click OK.
  11. Click OK.

Note: You must wait for all reconfigure and configuring tasks to complete. The change propagates to the ESX hosts in the cluster during HA configuration.