-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pull in upstream changes from argonne-lcf
@ main
#12
Conversation
Reviewer's Guide by SourceryThis pull request integrates upstream changes from the Class diagram for updated optimizer hierarchyclassDiagram
class Optimizer {
<<interface>>
+step()
}
class Adam
class AdamW
class AdamW8bit
class GaLoreAdamW
class GaLoreAdamW8bit
class Adafactor
class GaLoreAdafactor
class SGD
class Lamb
class Shampoo
Optimizer <|-- Adam
Optimizer <|-- AdamW
Optimizer <|-- AdamW8bit
Optimizer <|-- GaLoreAdamW
Optimizer <|-- GaLoreAdamW8bit
Optimizer <|-- Adafactor
Optimizer <|-- GaLoreAdafactor
Optimizer <|-- SGD
Optimizer <|-- Lamb
Optimizer <|-- Shampoo
note for Adam "Base Adam optimizer"
note for AdamW "Adam with weight decay"
note for AdamW8bit "8-bit AdamW for memory efficiency"
note for GaLoreAdamW "Low-rank AdamW variant"
note for Lamb "Layer-wise Adaptive Moments"
File-Level Changes
Tips and commandsInteracting with Sourcery
Customizing Your ExperienceAccess your dashboard to:
Getting Help
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey @saforem2 - I've reviewed your changes - here's some feedback:
Overall Comments:
- There is some commented out code in the optimizer section that should be removed rather than left as comments (e.g. the duplicate GaLoreAdamW section)
Here's what I looked at during the review
- 🟢 General issues: all looks good
- 🟢 Security: all looks good
- 🟢 Testing: all looks good
- 🟢 Complexity: all looks good
- 🟢 Documentation: all looks good
Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.
Summary by Sourcery
Update the optimizer code and configuration scripts.
Enhancements:
get_megatron_optimizer
function.Tests: