Knowledge Base

The VMware Knowledge Base provides support solutions, error messages and troubleshooting guides
 
Search the VMware Knowledge Base (KB)   View by Article ID
 

ESXi host fails with a purple diagnostic screen mentioning "SP_TryLockIRQWithRA" (2033780)

Symptoms

  • An ESXi host stops responding and displays a purple diagnostic screen indicating that a Page Fault (#PF) exception has occurred.

  • The purple diagnostic screen contains information similar to:

    #PF Exception 14 in world wwww:WorldName IP 0xnnnnnn addr 0xnnn
    SP_TryLockIRQWithRA@vmkernel#nover+0xnn
    CpuSchedRebalancePcpuMigrateIdleInt
    @vmkernel#nover+0xnn
    CpuSchedRebalance_PcpuMigrateIdle@vmkernel#nover+0xnn
    CpuSchedChooseAndSwitch@vmkernel#nover+0xnn
    CpuSchedDispatch@vmkernel#nover+0xnn

  • The ESXi host is using Intel Xeon CPUs which support Flex Priority.

Purpose

This article describes a specific issue. If you experience all of the above symptoms, consult the sections below. If you experience some but not all of these symptoms, your issue is not related to this article. Search the Knowledge Base for your symptoms, seeĀ Interpreting an ESX/ESXi host purple diagnostic screen (1004250) for similar issues, or Open a Support Request.

Resolution

This is a known issue affecting ESX/ESXi 4.1 and ESXi 5.0/5.1.

Workaround

This issue will not occur in environments which have an altered system configuration. Implement the workaround applicable to your version of ESX/ESXi.

  • For systems running ESXi 5.0/5.1

    VMware has released a patch which prevents the failure from occurring. No failures have been observed with Flex Priority disabled. For more information, see the release notes:


    If the system had Hyperthreading previously disabled as a workaround, re-enable it.

  • For systems running ESX/ESXi 4.1

    This issue has not been observed in environments where Hyperthreading is disabled. This reduces the capacity of the host and indirectly avoids conditions that lead to the outage.

    Disabling Hyperthreading may cause some workloads to experience performance impact, but many workloads will be unaffected. VMware recommends that any production workload be monitored post-change to determine whether there is a performance impact, and whether that impact is preferable to the risk of additional outages.

    To disable Hyperthreading in the server BIOS, consult the hardware vendor's documentation for instructions.

Additional Information

To be alerted when this article is updated, click Subscribe to Document in the Actions box.

See Also

Request a Product Feature

To request a new product feature or to provide feedback on a VMware product, please visit the Request a Product Feature page.

Feedback

  • 73 Ratings

Did this article help you?
This article resolved my issue.
This article did not resolve my issue.
This article helped but additional information was required to resolve my issue.
What can we do to improve this information? (4000 or fewer characters)
  • 73 Ratings
Actions
KB: