Skip to main content
  1. Blog/

Force Ollama to use only a single Nvidia GPU

·163 words·1 min

I also run a multi NVidia setup. The first one is a weak T400 to run the graphical output, the second is my main GPU. I don’t like the models being split across the two GPUs since one of them is really weak and the other one is quite good. I discovered a way for selecting and enabling which GPUs are visible to specific programs within windows.

Configure GPU Allocation for Windows Programs with Nvidia CUDA
#

  1. Open Nvidia Control Panel on your Windows machine.

  2. Navigate to ‘Manage 3D Settings’ from the main menu.

  3. Switch to the ‘Program Settings’ tab at the top of the window.

  4. Select the desired program from the dropdown menu (e.g., Ollama.exe).

  5. Locate the ‘CUDA - GPUs’ section in the program settings.

  6. Choose the desired GPU or list of GPUs to allocate to that program.

  7. Click ‘Apply’ to save your changes.

  8. Restart your program (such as Ollama.exe) for the changes to take effect. Restart if it does not work.