VMware
 

Knowledge Base

Search the Knowledge Base:

Products:
Search In:
 

SCSI Reservation Failures on HDS USP or NSC

Details

VMware ESX 3.0.x, 3.5, ESX 4.0 VMkernel Logs show SCSI Reservation errors similar to what is listed in Troubleshooting SCSI Reservation failures on Virtual Infrastructure 3.x and vSphere 4.0 (1005009). 
 
The HDS USP or NSC (Tagmastore family) array exhibits high storage Port Utilization with low I/Os going through port.
 
Consult your HDS engineer on how to monitor these events on the array.

Solution

Troubleshooting SCSI Reservation Failures for Physical and Virtual LUNs on HDS USP or NSC

 

Storage PortFan-in Ratio

 

This is defined as the number of initiators sharing the same Storage Processor Port.

Consult with HDS support about how to identify and the best practice for the Fan-in Ratio.

 

VMware recommends reducing the Fan-in Ratio if you encounter SCSI Reservation Failures.

 

LUN Type

Queue depth, LUN-to-ESX ratio, LUN-to-VM ratio 

Based on log and Fibre Channel trace analysis, one cause of this issue is that the Command Queue is exhausted.

 

If these symptoms are exhibited, the following are the best practices to workaround the problem.
 
Note: This section does not apply to HDS USP  V or USP VM).
 
Perform one of the following:
  • Reduce the Queue Depth on each Server's HBA connected to that array.
  • Reduce the number of ESX Server hosts sharing a given LUN or set of LUNs on that array.
  • Reduce the number of VMs per LUN so that the possible number of hosts accessing a given LUN would be reduced. For example, if you limit the number of virtual machines per LUN to 4, the highest number of ESX hosts running these virtual machines does not exceed 4 in a DRS/HA cluster. 

    Note: This option may result in using more LUNs of smaller sizes. The maximum number of LUNs accessible by a ESX 3.x and ESX 4.x is 256.

Recommended the Queue Depth setting on the HBA

 

 Number of hosts sharing a LUN         Queue Depth Value

 8                                                         2

 4                                                         4

 2                                                         8

 

As fewer hosts share a given LUN, the queue depth setting is higher.

 

Note:VMware has not received similar reports for USP-V or USP-VM and the above does not apply to these models. The HBA driver’s default queue depth is sufficient.

 

Changing the Queue Depth

 
Run the following commands at the ESX Server Console
 
Note: These procedures require rebooting the server.

 

QLogic HBAs:  

  1. Run the following command to identify the HBAs driver name:

    # vmkload_mod -l | grep qla

    The output appears similar to:

    qla2300_707_vmw

  2. Substitute the <driver_name> parameter below with the name from the above output. Substitute the nn parameter with the Queue depth value calculated above, and run the following commands:

    # esxcfg-module -s "ql2xmaxqdepth=nn" <driver_name>
    # esxcfg-boot -b
    # reboot

Emulex HBAs:

  1. Run the following command to identify the HBAs driver name:

    # vmkload_mod -l | grep lpfcdd

    The output appears similar to:

    lpfcdd_7xx

  2. Substitute the <driver_name> parameter below with the name from the above output. Substitute the nn parameter with the Queue depth value calculated above.

    # esxcfg-module -s “lpfc0_lun_queue_depth=nn” <driver_name>

    If you have 2 Emulex HBAs in the server, the command is:

    # esxcfg-module -s "lpfc0_lun_queue_depth=nn lpfc1_lun_queue_depth=nn" <driver_name>
    # esxcfg-boot -b
    # reboot
Items specific to Virtual LUNs
 
HDS USP and NSC provide access to physical LUNs (internal to the array) as well as Virtual LUNs whose physical LUNs are actually hosted on other arrays behind them.
  • Virtual LUNs must match the physical LUNs RAID Level. For example, if the physical LUN is RAID5, the virtual LUN must be setup using RAID5 as well.
  • A physical LUN that is represented by a Virtual LUN must be on Tier 1 type physical disks (Fibre SCSI Disks or Fibre SAS Disks) and with minimum 10K RPM rating, to provide the best I/O performance.
  • LUSE LUNs should not be used with Virtual LUNs. For more information, see http://www.vmware.com/pdf/hds_svd_technote.pdf.

Keywords

HDS, USP, TagmaStore, SCSI Reservation, Conflict Retries

Feedback

Rating: 1 - Lowest 2 3 4 5 - Highest (0 Ratings)   

Did this article help you?
This article resolved my issue.
This article did not resolve my issue.
This article helped but additional information was required to resolve my issue.
What can we do to improve this information? (2000 or fewer characters)
Submit
Rating: 1 - Lowest 2 3 4 5 - Highest (0 Ratings)   
Actions