NVMLError(ret) pynvml.nvml.NVMLError_InvalidArgument: Invalid Argument #168

GabrielZZZ · 2022-10-13T14:05:06Z

Hi,

I am using Ubuntu 20.04 with Nvidia RTX3090. When I followed the instructions to train the model, it always gives me this error:
File "/opt/conda/lib/python3.8/site-packages/pynvml/nvml.py", line 366, in check_return raise NVMLError(ret) pynvml.nvml.NVMLError_InvalidArgument: Invalid Argument
Does anyone know any possible solutions? That would be very helpful.

The text was updated successfully, but these errors were encountered:

Gienapp · 2022-11-07T11:43:59Z

Hi, I'm having the same problem. Did you come up with a solution?

GabrielZZZ · 2022-11-07T23:36:18Z

I am afraid not. It appears the forum is not very active.

Gienapp · 2022-11-08T09:02:18Z

For me the solution was to change the number nproc_per_node in the training command from 8 to the number of GPUs my server has.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVMLError(ret) pynvml.nvml.NVMLError_InvalidArgument: Invalid Argument #168

NVMLError(ret) pynvml.nvml.NVMLError_InvalidArgument: Invalid Argument #168

GabrielZZZ commented Oct 13, 2022

Gienapp commented Nov 7, 2022

GabrielZZZ commented Nov 7, 2022

Gienapp commented Nov 8, 2022

NVMLError(ret) pynvml.nvml.NVMLError_InvalidArgument: Invalid Argument #168

NVMLError(ret) pynvml.nvml.NVMLError_InvalidArgument: Invalid Argument #168

Comments

GabrielZZZ commented Oct 13, 2022

Gienapp commented Nov 7, 2022

GabrielZZZ commented Nov 7, 2022

Gienapp commented Nov 8, 2022