-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA Illegal memory access when starting training #18
Comments
Hm, I haven't seen that myself. Is the trace above with |
I tried it again this morning both with CUDA_LAUNCH_BLOCKING=1 and without, and neither triggered the crash. I think it might have had something to do with the other notebooks I still had open with CUDA contexts and some interaction there. I'll close this for now and reopen if I run into it again. |
I triggered it again, here's the stack trace where I put
|
This is using:
It runs fine with default torch sgd, so I assume it's not a problem with my model.
Let me know if you need more information.
The text was updated successfully, but these errors were encountered: