Knowledge Base

The VMware Knowledge Base provides support solutions, error messages and troubleshooting guides
 
Search the VMware Knowledge Base (KB)   View by Article ID
 

VMware Fault Tolerance FAQ (1013428)

Purpose

This article provides information on Frequently Asked Questions that can help to resolve Fault Tolerance related issues.

Resolution

What is VMware Fault Tolerance?
VMware Fault Tolerance is a feature that allows a new level of guest redundancy. Information regarding this feature can be found in the vSphere Availability Guide for your version of ESXi/ESX.
 
How do I turn it on?
The feature is enabled on a per virtual machine basis. Instructions for enabling Fault Tolerance can be found in the Turning on Fault Tolerance for Virtual Machines section of the vSphere Availability Guide for your version of ESXi/ESX.
 
What happens when I turn on Fault Tolerance?
In very general terms, a second virtual machine is created to work in tandem with the virtual machine you have enabled Fault Tolerance on. This virtual machine resides on a different host in the cluster, and runs in virtual lockstep with the primary virtual machine. When a failure is detected, the second virtual machine takes the place of the first one with the least possible interruption of service. More specific information about how this is achieved can be found in the Protecting Mission-Critical Workloads with VMware Fault Tolerance whitepaper.
 
Why can't I turn Fault Tolerance on?
VMware Fault Tolerance can be enabled on any virtual machine that resides in a cluster that meets the necessary requirements. If you have difficulty enabling Fault Tolerance for a specific virtual machine, see The Turn on Fault Tolerance option is disabled (1010631).
 
How do I turn Fault Tolerance off?
For Instructions on disabling Fault Tolerance, see Disabling or Turning Off VMware FT (1008026).
 
How do I tell if my environment is ready for Fault Tolerance?
The VMware SiteSurvey Tool is used to check your environment for compliance with VMware Fault Tolerance. It can be downloaded from the VMware Shared Utilities page.
 
Where do I find the product's website?
VMware has a website for Fault Tolerance on the VMware vSphere page.
 
What happens during a failure?
When a host running the Primary virtual machine fails, a transparent failover occurs to the corresponding Secondary virtual machine. During this failover, there is no data loss or noticeable service interruption. In addition, VMware HA automatically restores redundancy by restarting a new Secondary virtual machine on another host. Similarly, if the host running the Secondary virtual machine fails, VMware HA starts a new Secondary virtual machine on a different host. In either case there is no noticeable outage.
 
What is the logging time delay between the Primary and Secondary Fault Tolerance virtual machines?
The actual delay is based on the network latency between the Primary and Secondary. vLockstep executes the same instructions on the Primary and Secondary, but because this happens on different hosts, there could be a small latency, but no loss of state. This is typically less than 1 millsecond (ms). Fault Tolerance includes synchronization to ensure that the Primary and Secondary are synchronized.
 
In a cluster with more than 3 hosts, can you tell Fault Tolerance where to put the Fault Tolerance virtual machine or does it chose on its own?
You can place the original (or Primary virtual machine). You have full control with DRS or vMotion to assign it to any node. The placement of the Secondary, when created, is automatic based on the available hosts. But when the Secondary is created and placed, you can vMotion it to the preferred host.
 
What happens if the host containing the Primary virtual machine comes back online (after a node failure)?
This node is put back in the pool of available hosts. There is no attempt to start or migrate the Primary to that host.
 
Is the failover from the Primary virtual machine to the Secondary virtual machine dynamic or does Fault Tolerance restart a virtual machine?
The failover from the Primary to Secondary virtual machine is dynamic with the Secondary continuing execution from the exact point where the Primary left off. It happens automatically with no data loss, no downtime, and little delay. Clients see no interruption. After the dynamic failover to the Secondary virtual machine, it becomes the new Primary virtual machine. A new Secondary virtual machine is spawned automatically.
 
Where are Fault Tolerance failover events logged?
All failover events are logged by vCenter Server.
 
I encountered an error message that I can't find in the Knowledge Base.  Where else should I check?
The vSphere Availability Guide contains a list of known errors in the Fault Tolerance Error Messages.
 
Does Fault Tolerance support Intel Hyper-Threading Technology?
Yes, Fault Tolerance does support Intel Hyper-Threading Technology on systems that have it enabled. Enabling or disabling Hyper-Threading has no impact on Fault Tolerance.
 
What happens if vCenter Server is offline when a failover event occurs?
When Fault Tolerance is configured for a virtual machine, vCenter Server need not be online for FT to work. Even if vCenter Server is offline, failover still occurs from the Primary to the Secondary virtual machine. Additionally, the spawning of a new Secondary virtual machine also occurs without vCenter Server.

How many virtual CPUs can I use on a Fault Tolerant virtual machine ?
vCenter Server 4.x and vCenter Server 5.x support 1 virtual CPU per protected virtual machine.

Additional Information

Tags

enable-fault-tolerance  fault-tolerance-faq

See Also

Request a Product Feature

To request a new product feature or to provide feedback on a VMware product, please visit the Request a Product Feature page.

Feedback

  • 41 Ratings

Did this article help you?
This article resolved my issue.
This article did not resolve my issue.
This article helped but additional information was required to resolve my issue.
What can we do to improve this information? (4000 or fewer characters)
  • 41 Ratings
Actions
KB: