-
Notifications
You must be signed in to change notification settings - Fork 175
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add zloss #133
Add zloss #133
Conversation
Anything we can do to get this merged? We want to polish the OLMoE release, and it would be nice if we could just take a dependency on your version instead of @Muennighoff's. |
@Muennighoff can you resolve merge conflicts and make sure tests are passing? CC: @josejg for review |
Here's our implementation for reference - main...josejg/zloss . The main difference is that the modeling code is responsible for doing @mvpatel2000 Wdyt? @Muennighoff could you refactor the PR to address these comments? We can also merge our implementation giving you credit in the PR if that's easier. |
oh just merge yours 👍 |
Sg, tracking it here - #151 |
Closing in favor of #151 |
No description provided.