Knowledge Base

The VMware Knowledge Base provides support solutions, error messages and troubleshooting guides
 
Search the VMware Knowledge Base (KB)   View by Article ID
 

VMware vCenter Server Heartbeat Out of Disk Behavior

Purpose

This article provides information about VMware vCenter Server Heartbeat Out of Disk behavior.

Resolution

Active Server CommsMgr Logs

The queue on the active server is used to store intercepted data prior to sending across the channel to the passive server. Queue build-ups here indicate communication problems with the Secondary server, or insufficient bandwidth for the data being replicated. The queue statistics are displayed in the VMware vCenter Server Heartbeat Console on the System > Status & Control tab.

These updates are stored in memory or to disk in the default location, C:\VMware\ VMware vCenter Server Heartbeat\r2\log. The maximum size on disk is configurable, MaxDiskUsage, is by default 1GB. Both these settings can be configured via the Configure Server wizard when VMware vCenter Server Heartbeat has been stopped.

Channel Disconnect

The queue is written out to disk if the active server is replicating and:

  1. The passive server was never connected.
  2. The VMware Channel suddenly disconnects and the configured number of heartbeats is very large.

In either case, the following may happen:

  • The MaxDiskUsage is reached; an alert NFChannelExceededMaxDiskUsageException is logged.

    Exception in CommsMgr [L9] Exceeded the maximum disk usage(NFChannelExceeded MaxDiskUsageException)

  • If available space on the drive is less than the MaxDiskUsage then NFChannelIOException is logged.

    Exception in CommsMgr [M4] Cannot open log file 2004-09-07-203.log(NFChannelCannotOpenIOException) because there is not enough space on the disk (IOException)

In both these situations, VMware vCenter Server Heartbeat:

  1. Ceases to log updates to the data.
  2. Discards all existing logs.
  3. Upon channel reconnection, a NFChannelLostMessageEvent is generated.
  4. A Full System Check is initiated to get the system back in sync.

Passive Server CommsMgr Logs

When data is received on the passive server, it is stored in the passive server (safe) queue until Apply is ready to handle updating the protected file. Depending upon system load, this is either in memory or to disk. Under normal operating conditions, this queue remains small. The passive server (safe) queue stats are displayed in the VMware vCenter Server Heartbeat Console on the System > Status & Control tab.

A build up in the queue may indicate a problem applying updates to the protected files.
 
Common causes are:
  • Hardware / software problems with the disk subsystem.
  • Under spec'd equipment, for example disk drives on the passive server are far slower than the active server disks.
  • Applications running on the passive server blocking updates.

When this queue gets very large, the protected application begins to slow down to avoid overloading the passive server with updates. If the configured limit is reached, VMware vCenter Server Heartbeat raises a NFChannelExceededMaxDiskUsageException on the passive server. The VMware vCenter Server Heartbeat server must not be shutdown, opting to leave the application running on the active server. The application's performance returns to normal.

Remedial Action

The passive server's hardware must be investigated for problems.

  1. Start by checking the Windows Application and System logs for Errors and Warnings regarding impending hardware failure, or other problems.
  2. Device Manager may show problems with drivers or RAID controllers malfunctioning.
  3. Alternatively, run system diagnostic checks that are supplied with the hardware.

Passive Server Lacks Protected Disk Space

This occurs when the active server has more disk space than the passive server. Protected data cannot be written to the passive server, so updates fail. Apply raises a Disk Full or Quota Exceeded exception in the log saying that it cannot create files. The system attempts to stop.

Error, Disk Full Or Quota Exceeded, [N27]Failed to write information for the file: D:\protected\some file.txt to the disk. Either the disk is full or the quota (for the SYSTEM account) has been exceeded.

Safeguards

Employ the following to prevent Out of Disk Behavior:

  1. Ensure that the available disk space on a drive exceeds MaxDiskUsage. The default configuration requires 1GB of disk space on the VMware vCenter Server Heartbeat installation drive.
  2. The passive server disk space must equal that of the active server.
  3. Consider hosting the VMware vCenter Server Heartbeat log directory on its own disk, or one that does not host:
    1. Protected Application data.
    2. The Windows System folder.

Additional Information

vCSHBRef248

Request a Product Feature

To request a new product feature or to provide feedback on a VMware product, please visit the Request a Product Feature page.

Feedback


Did this article help you?
This article resolved my issue.
This article did not resolve my issue.
This article helped but additional information was required to resolve my issue.
What can we do to improve this information? (4000 or fewer characters)
Actions