Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Success with OPT-175B #1

Open
taesiri opened this issue Jul 26, 2022 · 2 comments
Open

Success with OPT-175B #1

taesiri opened this issue Jul 26, 2022 · 2 comments

Comments

@taesiri
Copy link

taesiri commented Jul 26, 2022

Hello,

Thank you for sharing this great implementation with the community.

I just wanted to open this Issue and share my success in running the OPT-175B model on a DGX station.

Screenshot 2022-07-25 at 8 39 03 PM

The model takes ~3 minutes to load and it uses ~58% of memory on the first 7 GPUs and 28% of the last one.

Please feel free to close this issue.

@BenfengXu
Copy link

Congratulations! May I ask the specific configuration of your DGX station? Is it 8XA100 (40GB) or (80GB)?

@taesiri
Copy link
Author

taesiri commented Aug 30, 2022

@BenfengXu You need the 8x80GB variant, as the model does not fit in 8x40GB (unless you do some tricks like this).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants