Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build autotuned kernels in parallel #4806

Open
saagarjha opened this issue Sep 25, 2024 · 0 comments
Open

Build autotuned kernels in parallel #4806

saagarjha opened this issue Sep 25, 2024 · 0 comments

Comments

@saagarjha
Copy link
Contributor

Autotuning takes a while and for us most of that time is actually spent compiling the JIT kernel for each configuration rather than running the code. Since this process happens on the host CPU and should not affect timings it would be nice if it could be run in parallel and then once that is done all the configurations could be tested on the GPU linearly. Is this something that might be worth supporting?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants