Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CUDA] add multi-node multi-GPU support for new CUDA version and improve efficiency #5993

Closed
Xuweijia-buaa opened this issue Jul 20, 2023 · 1 comment

Comments

@Xuweijia-buaa
Copy link
Contributor

Xuweijia-buaa commented Jul 20, 2023

Summary

Current multi-GPU version just support one node scenario, we can add support for multi-node Multi-GPU scenario for new CUDA version, to deal with big data that can not fit in one node.

And we can also improve efficiency for such scenario. In the current Multi-GPU version, each device loads a complete histogram and performs follow-up work based on it. When dealing with datasets that have too many features, the full histogram may be too large to fit in one device. Even the histogram can fit in, each device will do repetitive work afterwards.

We can use a mixed data and feature parallel strategy in such scenario, like in the distributed data parallel scenario, alleviate loading and computing pressure on each device.

@Xuweijia-buaa Xuweijia-buaa changed the title [CUDA] support multi-node multi-GPU scenario for new CUDA version [CUDA] add multi-node multi-GPU support for new CUDA version and imporve effiency Jul 20, 2023
@Xuweijia-buaa Xuweijia-buaa changed the title [CUDA] add multi-node multi-GPU support for new CUDA version and imporve effiency [CUDA] add multi-node multi-GPU support for new CUDA version and imporve efficiency Jul 20, 2023
@Xuweijia-buaa Xuweijia-buaa changed the title [CUDA] add multi-node multi-GPU support for new CUDA version and imporve efficiency [CUDA] add multi-node multi-GPU support for new CUDA version and improve efficiency Jul 20, 2023
@jameslamb
Copy link
Collaborator

Added this to #2302 with other feature requests. Thanks for documenting it!

Per this repo's policy described there, I'm closing this discussion for now. Anyone wishing to implement this, leave a comment saying that you're working on it and we can re-open this discussion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants