Add new model: simple_gpt_tp_manual #1969

xmfan · 2023-10-07T00:37:13Z

Similar to simple_gpt, but instead of using the DTensor API to apply Tensor Parallelism (TP), we use the manual weights sharding implementation and directly functional collectives. 2 main reasons it is beneficial to add this:

DTensor + compile is not ready yet
DTensor has a CPU overhead, and adding this less overhead model will help us track the improvement/regression

Tests:

in benchmark/
python test.py -k "test_simple_gpt_manual_tp_"

in pytorch/
PYTHONPATH=benchmark/ python pytorch/benchmarks/dynamo/torchbench.py --float16 -dcuda --inference --backend=inductor --multiprocess --performance --only simple_gpt_tp_manual

facebook-github-bot · 2023-10-10T17:51:20Z

@xmfan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-10-10T19:55:32Z

@xmfan merged this pull request in cdd87f0.

facebook-github-bot added the cla signed label Oct 7, 2023

xmfan had a problem deploying to docker-s3-upload October 7, 2023 00:37 — with GitHub Actions Error

Add new model: simple_gpt_tp_manual

320111e

xmfan force-pushed the xmfan/simple_gpt_manual_tp branch from 5b068d7 to 320111e Compare October 7, 2023 01:01

xmfan temporarily deployed to docker-s3-upload October 7, 2023 01:02 — with GitHub Actions Inactive

xuzhao9 approved these changes Oct 9, 2023

View reviewed changes

fix distributed init logic for torchbench dynamo runner

567e5d2

xmfan temporarily deployed to docker-s3-upload October 9, 2023 22:15 — with GitHub Actions Inactive

xmfan marked this pull request as ready for review October 9, 2023 22:41

facebook-github-bot closed this in cdd87f0 Oct 10, 2023

facebook-github-bot added the Merged label Oct 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new model: simple_gpt_tp_manual #1969

Add new model: simple_gpt_tp_manual #1969

xmfan commented Oct 7, 2023 •

edited

Loading

facebook-github-bot commented Oct 10, 2023

facebook-github-bot commented Oct 10, 2023

Add new model: simple_gpt_tp_manual #1969

Add new model: simple_gpt_tp_manual #1969

Conversation

xmfan commented Oct 7, 2023 • edited Loading

facebook-github-bot commented Oct 10, 2023

facebook-github-bot commented Oct 10, 2023

xmfan commented Oct 7, 2023 •

edited

Loading