Storage path does not fail over when TUR command repeatedly returns retry requests
search cancel

Storage path does not fail over when TUR command repeatedly returns retry requests

book

Article ID: 345231

calendar_today

Updated On:

Products

VMware vCenter Server VMware vSphere ESXi

Issue/Introduction

Typically, when a storage path experiences problems, an ESXi host sends the Test Unit Ready (TUR) command to confirm that the path is down before initiating a path failover. However, if the TUR command is unsuccessful and repeatedly returns a retry operation request (VMK_STORAGE_RETRY_OPERATION), the host continues to retry the command without triggering the failover. Usually, the following errors cause the host to retry the TUR command:
  • SCSI_HOST_BUS_BUSY 0x02
  • SCSI_HOST_SOFT_ERROR 0x0b
  • SCSI_HOST_RETRY 0x0c


Environment

VMware vSphere ESXi 5.5
VMware vCenter Server Appliance 5.5.x
VMware vSphere ESXi 6.5
VMware vCenter Server Appliance 6.5.x
VMware vCenter Server 5.5.x
VMware vCenter Server Appliance 6.0.x
VMware vCenter Server 6.0.x
VMware vCenter Server 6.5.x
VMware vSphere ESXi 6.0

Resolution

To resolve this issue, use the enable_action_OnRetryErrors option. This option allows the ESXi host to mark a problematic path as dead. After marking the path as dead, the host can trigger a failover and use an alternative working path.
To configure the option, perform these steps:
  1. Enable or disable the option by running an appropriate command:

     

    • Enable: #esxcli storage nmp satp generic deviceconfig set -c enable_action_OnRetryErrors -d naa.XXX
    • Disable: #esxcli storage nmp satp generic deviceconfig set -c disable_action_OnRetryErrors -d naa.XXX

     

  2. Check the status of the enable_action_OnRetryErrors option by running the following command:

    esxcli storage nmp device list

    The following example output indicates that the option has been enabled:

    naa.XXX
    Device Display Name: DGC Fibre Channel Disk (naa.XXX)
    Storage Array Type: VMW_SATP_CX Storage Array Type Device
    Config: {navireg ipfilter action_OnRetryErrors}

Note: The enable_action_OnRetryErrors option is persistent across reboots.