Comments on: nvidia-smi: Control Your GPUs https://www.microway.com/hpc-tech-tips/nvidia-smi_control-your-gpus/ We Speak HPC & AI Tue, 28 May 2024 16:50:42 +0000 hourly 1 https://wordpress.org/?v=6.7.1 By: Primary use of nvidia-smi instructions – The Linux Cluster - Slightly Sarcastic https://www.microway.com/hpc-tech-tips/nvidia-smi_control-your-gpus/#comment-12 Tue, 17 Oct 2023 19:23:35 +0000 http://https://www.microway.com/hpc-tech-tips/?p=58#comment-12 […] There’s a excellent article written by Microway on this utility. Check out nvidia-smi: Management Your GPUs […]

]]>
By: Eliot Eshelman https://www.microway.com/hpc-tech-tips/nvidia-smi_control-your-gpus/#comment-5 Sun, 17 May 2020 20:03:46 +0000 http://https://www.microway.com/hpc-tech-tips/?p=58#comment-5 In reply to Bo-Yuan Ning.

Bo-Yuan: you may need to read through NVIDIA documentation and confirm what capabilities your GPUs have. NVIDIA’s datacenter GPUs (e.g., Quadro, Tesla) have more capabilities than the consumer-focused GPUs. I understand some GPUs start in P2 mode by default. But for any GPU, you cannot manually force it to a higher Performance State. Instead, you must ensure it is properly powered and cooled. Then, you can manually set the Graphics and Memory Clocks to max speed and the GPU will maintain those speeds as much as possible.

To start, you’ll want to query the status of your GPUs to determine if they might be throttled for power and/or thermal reasons. Start by running:
nvidia-smi --query --display=TEMPERATURE,POWER,PERFORMANCE

You might see that a Thermal Slowdown is Active, which means the GPUs are not being fully cooled. Several other slowdown reasons could also be shown.

If no Clock Throttle Reasons are Active, you can double-check that the GPU clocks are set to max. Query possible clock speeds with:
nvidia-smi --query-gpu=clocks.max.memory,clocks.max.graphics --format=csv,nounits

Take the two max clock speed values from the above command and manually set to these speeds (and note that these settings reset at reboot) with:
nvidia-smi --applications-clocks=[mem clock],[graphics clock]

]]>
By: Bo-Yuan Ning https://www.microway.com/hpc-tech-tips/nvidia-smi_control-your-gpus/#comment-4 Tue, 28 Apr 2020 11:11:31 +0000 http://https://www.microway.com/hpc-tech-tips/?p=58#comment-4 Feels great to find this intro accidentally. I have a question that how to make my GPUs be in the P0 performance state when using them to do some computations? So far, through nvidia-smi info, the maximum performance is still limited to P2 even though the Volatile GPU-Utility of the GPUs is 100%. Thanks a lot!

]]>
By: auro https://www.microway.com/hpc-tech-tips/nvidia-smi_control-your-gpus/#comment-3 Fri, 11 Oct 2019 21:51:08 +0000 http://https://www.microway.com/hpc-tech-tips/?p=58#comment-3 One of the best places to visit when you need nvidia-smi support

]]>
By: Alexandre https://www.microway.com/hpc-tech-tips/nvidia-smi_control-your-gpus/#comment-2 Fri, 08 Mar 2019 12:27:19 +0000 http://https://www.microway.com/hpc-tech-tips/?p=58#comment-2 I was not aware of all these options for nvidia-smi before reading your post: thank you very much!

]]>