Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using a smaller block size #18

Open
pclucas14 opened this issue Jun 22, 2022 · 0 comments
Open

Using a smaller block size #18

pclucas14 opened this issue Jun 22, 2022 · 0 comments

Comments

@pclucas14
Copy link

Hi,

First of all thanks for setting up this package :) It's super helpful, thanks

I'm wondering, is there a way to use a smaller block size ? I tried modifying the python code so that no errors are thrown, however I'm hitting a

RuntimeError: CUDA error: an illegal memory access was encountered

error when calling the cuda kernel. I tried to look a bit into the kernel code, and it seems that the block_size argument is not used. So I'm curious how the kernel knows to expect a minimal size of 32.

Any clarifications would be super helpful!

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant