2
0
mirror of https://github.com/xcat2/xcat-core.git synced 2025-06-13 09:50:19 +00:00

Add update_nvidia_driver section to CUDA installaltion documentation

This commit is contained in:
Casandra Qiu
2016-03-30 14:29:38 -04:00
parent b4d258c628
commit 21388caa4e
2 changed files with 42 additions and 0 deletions

View File

@ -17,3 +17,4 @@ Within the NVIDIA CUDA Toolkit, installing the ``cuda`` package will install bot
deploy_cuda_node.rst
verify_cuda_install.rst
management.rst
update_nvidia_driver.rst

View File

@ -0,0 +1,41 @@
Upgrade NVIDIA Driver
=====================
If the user wants to update the newer NVIDIA driver on the system, need to :doc:`create New CUDA software reposity </advanced/gpu/nvidia/repo/index>` . Assume the newer driver is in the ``/install/cuda-7.5/ppc64le/nvidia_new`` for the following processes.
Diskful
-------
#. Change pkgdir for the cuda image: ::
chdef -t osimage -o rhels7.2-ppc64le-install-cudafull \
pkgdir=/install/cuda-7.5/ppc64le/nvidia_new,/install/cuda-7.5/ppc64le/cuda-deps
#. Use xdsh command to remove all the nvidia rpms: ::
xdsh <noderange> "yum remove *nvidia* -y"
#. Run updatenode command to upgrade NVIDIA driver on the compute node: ::
updatenode <noderange> -S
#. Reboot compute node: ::
rpower <noderange> off
rpower <noderange> on
#. Verify the newer driver level on the compute node: ::
nvidia-smi | grep Driver
Diskless
--------
For update new NVIDIA driver on the diskless compute node, the easy and simple way is re-generate the osimage with New NIVIDIA driver reposity and re-provision the node with this osimage because node needs to be reboot in order for NIVIDIA driver to load. Please follow :doc:`this doc </advanced/gpu/nvidia/osimage/index>` to create osimage definitions and deploy CUDA nodes.