`cuda.cudart.getLocalRuntimeVersion()` raises `RuntimeError: Failed to dlopen libcudart.so.12` #89

Matt711 · 2024-09-10T15:11:32Z

Is this a bug? getLocalRuntimeVersion() fails for me in cuda 11.8 environment. I'm asking because I see that the API call is in the cuda-python 11.8 release notes.

In the source code, we're hard coding libcudart.so.12. Is that right?

Repro

In [1]: from cuda import cudart

In [2]: cudart.getLocalRuntimeVersion()
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
Cell In[2], line 1
----> 1 cudart.getLocalRuntimeVersion()

File ~/.conda/envs/rapids/lib/python3.11/site-packages/cuda/cudart.pyx:24961, in cuda.cudart.getLocalRuntimeVersion()

File ~/.conda/envs/rapids/lib/python3.11/site-packages/cuda/ccudart.pyx:2365, in cuda.ccudart.getLocalRuntimeVersion()

File ~/.conda/envs/rapids/lib/python3.11/site-packages/cuda/_lib/ccudart/ccudart.pyx:2121, in cuda._lib.ccudart.ccudart._getLocalRuntimeVersion()

RuntimeError: Failed to dlopen libcudart.so.12

The text was updated successfully, but these errors were encountered:

Matt711 · 2024-09-10T15:21:14Z

xref rmm/1675

leofang · 2024-09-10T15:44:38Z

It seems to be a backport mistake that we should fix:

cuda-python/cuda/_lib/ccudart/ccudart.pyx.in

Lines 2451 to 2457 in 64cc9ae

    
           # Load 
        
           handle = dlfcn.dlopen('libcudart.so.12', dlfcn.RTLD_NOW) 
        
           if handle == NULL: 
        
               with gil: 
        
                   raise RuntimeError(f'Failed to dlopen libcudart.so.12') 
        
           __cudaRuntimeGetVersion = dlfcn.dlsym(handle, 'cudaRuntimeGetVersion')

@Matt711 how urgent is this?

Matt711 · 2024-09-10T16:16:44Z

@Matt711 how urgent is this?

Not urgent. We already have a workaround using numba.cuda. I also don't mind working on this @leofang, if you could point me in the right direction.

leofang · 2024-09-10T16:23:10Z

Thanks, @Matt711. The offending code that I linked to above is from the 11.8.x branch, so ideally we can just fix the lines referencing libcudart.so.12 to .11. But we're transitioning to a new development/release process so let me check with @vzhurba01 later today first, and get back to you later.

leofang · 2024-09-10T21:36:32Z

@Matt711 we discussed and will try to get a new 11.8.x release out next week, with this bug fixed and perhaps also #75 backported.

Matt711 · 2024-09-11T04:49:52Z

Thanks @leofang

wence- · 2024-09-30T09:14:29Z

@leofang, @vzhurba01 did this backport/release occur?

vzhurba01 · 2024-09-30T20:27:47Z

Not yet. The wheels and conda packages are currently going through pre-release validation. I'll update this issue once posting is complete.

vzhurba01 · 2024-10-07T19:24:18Z

FYI I've updated the repo with the fix under the patch release 11.8.4 (tag v11.8.4).

I created new issue #139 to track the wheels/conda uploads for this patch release. I'm thinking of keeping this current issue open though until they are uploaded, and then give a notice before closing.

vzhurba01 · 2024-11-04T22:12:59Z

https://pypi.org/project/cuda-python/11.8.4/
https://anaconda.org/nvidia/cuda-python/files?version=11.8.4

Issue #139 is now complete as both PYPI and Conda (nvidia channel) are now updated with the 11.8.4 patch. Thank you all for your patience.

github-actions bot added the triage Needs the team's attention label Sep 10, 2024

leofang added bug Something isn't working and removed triage Needs the team's attention labels Sep 10, 2024

leofang added this to the cuda-12-RC1, cuda-11-RC1 milestone Sep 10, 2024

leofang added the P0 High priority - Must do! label Sep 10, 2024

Matt711 mentioned this issue Sep 11, 2024

[Improvement] Fetch runtime version with cuda-python instead of numba rapidsai/rmm#1675

Open

3 tasks

leofang assigned vzhurba01 Sep 19, 2024

leofang mentioned this issue Oct 8, 2024

Patch 11.8.4 #138

Merged

leofang added the cuda.bindings Everything related to the cuda.bindings module label Oct 10, 2024

vzhurba01 closed this as completed Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`cuda.cudart.getLocalRuntimeVersion()` raises `RuntimeError: Failed to dlopen libcudart.so.12` #89

`cuda.cudart.getLocalRuntimeVersion()` raises `RuntimeError: Failed to dlopen libcudart.so.12` #89

Matt711 commented Sep 10, 2024

Matt711 commented Sep 10, 2024

leofang commented Sep 10, 2024

Matt711 commented Sep 10, 2024

leofang commented Sep 10, 2024

leofang commented Sep 10, 2024

Matt711 commented Sep 11, 2024

wence- commented Sep 30, 2024 •

edited

Loading

vzhurba01 commented Sep 30, 2024

vzhurba01 commented Oct 7, 2024

vzhurba01 commented Nov 4, 2024

cuda.cudart.getLocalRuntimeVersion() raises RuntimeError: Failed to dlopen libcudart.so.12 #89

cuda.cudart.getLocalRuntimeVersion() raises RuntimeError: Failed to dlopen libcudart.so.12 #89

Comments

Matt711 commented Sep 10, 2024

Matt711 commented Sep 10, 2024

leofang commented Sep 10, 2024

Matt711 commented Sep 10, 2024

leofang commented Sep 10, 2024

leofang commented Sep 10, 2024

Matt711 commented Sep 11, 2024

wence- commented Sep 30, 2024 • edited Loading

vzhurba01 commented Sep 30, 2024

vzhurba01 commented Oct 7, 2024

vzhurba01 commented Nov 4, 2024

`cuda.cudart.getLocalRuntimeVersion()` raises `RuntimeError: Failed to dlopen libcudart.so.12` #89

`cuda.cudart.getLocalRuntimeVersion()` raises `RuntimeError: Failed to dlopen libcudart.so.12` #89

wence- commented Sep 30, 2024 •

edited

Loading