After upgrade from #esxi 6.7 to 7.0.3 unable to live migrate vm:s with nvidia v:gpu enabled. @NVIDIA @VMware @vExpert @vmwarehorizon

The goal of this action was to upgrade a cluster with nvidia graphics card from 6.7 to 7.0.3. Before the upgrade everything worked fine. You where able to live vmotion vm:s with vgpu to another host.

Problem:

Everything worked from vm side, but was not able to vmotion live vm:s between hosts.

Steps:

  1. Upgrade esxi
  2. Upgrade nvidia driver
  3. Start vm:s
  4. Vmotion failed between hosts

Solution:

This change must be applied on every host in the cluster that you want to live vmotion machines in between.

How to see if ECC is enabled or disabled.

nvidia-smi -q | grep "Ecc Mode" -A2

To disable

If your Current config is enabled you need to disable ECC, and you do this with this command:

nvidia-smi -e 0

And reboot is required!

And do not forget 1 host like I did! 🙂

Thats it!

//Roger

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.