Using esxtop to identify storage performance issues
Details
Solution
Configuring monitoring using esxtop
-
Start esxtop by typing esxtop at the command line.
-
Press d to switch to disk view (HBA mode).
-
Press f to modify the fields that are displayed.
-
To view the entire Device name, press SHIFT + L and enter 38 in Change the name field size.
-
Press b, c, d, e, h, and j to toggle the fields and press Enter.
-
Press s, then 2 to alter the update time to every 2 seconds and press Enter.
-
See Analyzing esxtop columns for a description of relevant columns.
Note: The following options are only available in VMware ESX 3.5 and later.
To monitor storage performance on a per-LUN basis:
-
Start esxtop by typing esxtop from the command line.
-
Press u to switch to disk view (LUN mode).
-
Press f to modify the fields that are displayed.
-
Press b, c, f, and h to toggle the fields and press Enter.
-
Press s, then 2 to alter the update time to every 2 seconds and press Enter.
-
See Analyzing esxtop columns for a description of relevant columns.
For more information, see How to increase the width of the device field in esxtop to show the complete naa id (1035989).
-
Start esxtop by typing esxtop at the command line.
-
Type v to switch to disk view (virtual machine mode).
-
Press f to modify the fields that are displayed.
-
Press b, d, e, h, and j to toggle the fields and press Enter.
-
Press s, then 2 to alter the update time to every 2 seconds and press Enter.
-
See Analyzing esxtop columns for a description of relevant columns.
Column | Description |
CMDS/s | This is the number of IOPS (Input/Output Operations Per Second) being sent to or coming from the device or virtual machine being monitored |
DAVG/cmd | This is the average response time in milliseconds per command being sent to the device |
KAVG/cmd | This is the amount of time the command spends in the VMkernel |
GAVG/cmd | This is the response time as it is perceived by the guest operating system. This number is calculated with the formula: DAVG + KAVG = GAVG |
All arrays perform differently, howeverDAVG/cmd, KAVG/cmd, and GAVG/cmd should not exceed more than 10 milliseconds (ms) for sustained periods of time.
Note: VMware ESX 3.0.x does not include direct functionality to monitor individual LUNs or virtual machines usingesxtop. Inactive LUNs lower the average for DAVG/cmd, KAVG/cmd, and GAVG/cmd.
If the response time increases to over 5000 ms (or 5 seconds), VMware ESX will time out the command and abort the operation. These events are logged; abort messages and other SCSI errors can be reviewed inthe following logs:
- ESX – /var/log/vmkernel
- ESXi – /var/log/messages
Leave a Reply