infV2 fix for OPT size variants #4694

mrwyattii · 2023-11-16T19:29:37Z

The OPT model has inconsistent checkpoint layer names:
125m, 1.3b, 2.7b, 66b: model.decoder.* and lm_head.weights
6.7b, 13b, 30b: decoder.* and no lm_head.weights
350m: decoder.* and project_*

This PR extends support to all OPT models except 350m. We will have a future PR to handle the unique features of this model.

As part of this PR, wildcards can now be used in the model container PARAM_MAPPINGS, such as *decoder.embed_tokens.weights

Co-authored-by: Jeff Rasley <[email protected]>

fixes for different OPT model size variants

6c6c575

mrwyattii requested review from RezaYazdaniAminabadi, jeffra, awan-10, cmikeh2 and arashb as code owners November 16, 2023 19:29

mrwyattii added 2 commits November 16, 2023 11:30

Merge branch 'master' into mrwyattii/infv2-fix-OPT

102a088

Merge branch 'master' into mrwyattii/infv2-fix-OPT

3a67924

mrwyattii mentioned this pull request Nov 16, 2023

Unable to load relatively large opt models (opt-6.7b opt-30b) microsoft/DeepSpeed-MII#305

Open

mrwyattii added 2 commits November 16, 2023 14:01

fix for unit test failure

81c1dd7

reduce time for cloning large repos

4833818

mrwyattii requested a review from loadams as a code owner November 16, 2023 22:14

jeffra approved these changes Nov 17, 2023

View reviewed changes

Merge branch 'master' into mrwyattii/infv2-fix-OPT

9ab1e65

mrwyattii merged commit a3926bb into master Nov 17, 2023
14 of 16 checks passed

mrwyattii deleted the mrwyattii/infv2-fix-OPT branch November 17, 2023 00:17

mauryaavinash95 pushed a commit to mauryaavinash95/DeepSpeed that referenced this pull request Feb 17, 2024

infV2 fix for OPT size variants (microsoft#4694)

ae4f027

Co-authored-by: Jeff Rasley <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

infV2 fix for OPT size variants #4694

infV2 fix for OPT size variants #4694

mrwyattii commented Nov 16, 2023

infV2 fix for OPT size variants #4694

infV2 fix for OPT size variants #4694

Conversation

mrwyattii commented Nov 16, 2023