Skip to content

Latest commit

 

History

History
44 lines (30 loc) · 1010 Bytes

README_ez.md

File metadata and controls

44 lines (30 loc) · 1010 Bytes

Mgeatron-LM

ENV

UT test

  • script: bash test/test.sh

  • ut_list: (megatron/mpu/tests)

    • test_cross_entropy.py
    • test_data.py
    • test_initialize.py
    • test_layers.py
    • test_random.py
  • note

model support plan

T5

func

  1. [] pretrain_t5.sh
    • bash test/pretrain_t5.sh
  2. pretrain_t5_distributed_with_mp.sh
    • bash test/pretrain_t5_distributed_with_mp.sh
  3. pretrain_t5_xxB.sh
  4. pretrain_t5_xxxB.sh

GPT-3

func

  1. pretrain_gpt.sh
  2. pretrain_gpt_distributed.sh
  3. pretrain_gpt_distributed_with_mp.sh
  4. pretrain_gpt3_175B.sh