Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
570 workflow run results
570 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

nv-accelerate-v100
nv-accelerate-v100 #7733: Scheduled
January 3, 2024 00:05 10m 48s master
January 3, 2024 00:05 10m 48s
nv-accelerate-v100
nv-accelerate-v100 #7732: Merge group checks requested
January 2, 2024 23:44 11m 26s
January 2, 2024 23:44 11m 26s
Addressing ipg Buffer Data Race Condition in Zero Stage2
nv-accelerate-v100 #7731: Pull request #3727 synchronize by loadams
January 2, 2024 23:41 10m 44s xxr3376:master
January 2, 2024 23:41 10m 44s
Pipeline: Add support to eval micro bs configuration
nv-accelerate-v100 #7730: Pull request #4859 synchronize by loadams
January 2, 2024 23:36 12m 46s nelyahu:pp_eval_micro_bs
January 2, 2024 23:36 12m 46s
Update transformers workflow to use latest torch
nv-accelerate-v100 #7729: Pull request #4798 synchronize by loadams
January 2, 2024 23:36 11m 16s loadams/transformers-torch-update
January 2, 2024 23:36 11m 16s
Update transformers workflow to use latest torch
nv-accelerate-v100 #7728: Pull request #4798 synchronize by loadams
January 2, 2024 21:21 8m 27s loadams/transformers-torch-update
January 2, 2024 21:21 8m 27s
nv-accelerate-v100
nv-accelerate-v100 #7727: Merge group checks requested
January 2, 2024 20:58 8m 35s
January 2, 2024 20:58 8m 35s
Unit tests for MiCS
nv-accelerate-v100 #7725: Pull request #4792 synchronize by loadams
January 2, 2024 19:21 5m 3s zarzen:mics-unittests
January 2, 2024 19:21 5m 3s
[NPU] Fix npu offload bug
nv-accelerate-v100 #7724: Pull request #4883 synchronize by loadams
January 2, 2024 18:18 17m 22s CurryRice233:offload
January 2, 2024 18:18 17m 22s
nv-accelerate-v100
nv-accelerate-v100 #7723: Merge group checks requested
January 2, 2024 18:18 5m 9s
January 2, 2024 18:18 5m 9s
Retrieve CUDA available memory via torch.cuda.mem_get_info()
nv-accelerate-v100 #7722: Pull request #4847 synchronize by loadams
January 2, 2024 17:50 9m 27s XuehaiPan:cuda-available-memory
January 2, 2024 17:50 9m 27s
Update transformers workflow to use latest torch
nv-accelerate-v100 #7721: Pull request #4798 synchronize by loadams
January 2, 2024 17:44 9m 29s loadams/transformers-torch-update
January 2, 2024 17:44 9m 29s
Nvme offload checkpoint
nv-accelerate-v100 #7720: Pull request #4707 synchronize by loadams
January 2, 2024 17:28 9m 0s nvme_offload_checkpoint
January 2, 2024 17:28 9m 0s
add sharded loading for safetensors in AutoTP
nv-accelerate-v100 #7719: Pull request #4854 synchronize by loadams
January 2, 2024 16:41 9m 32s sywangyi:safetensor_autoTP
January 2, 2024 16:41 9m 32s
Fix f-string messages
nv-accelerate-v100 #7718: Pull request #4865 synchronize by loadams
January 2, 2024 16:38 9m 37s li-plus:fix-fstr
January 2, 2024 16:38 9m 37s
[Fix] Fix cpu inference UT failure
nv-accelerate-v100 #7717: Pull request #4430 synchronize by loadams
January 2, 2024 16:16 9m 13s delock:gma/fix_cpu_inference
January 2, 2024 16:16 9m 13s
nv-accelerate-v100
nv-accelerate-v100 #7715: Merge group checks requested
January 2, 2024 13:19 10m 21s
January 2, 2024 13:19 10m 21s
[NPU] Fix npu offload bug
nv-accelerate-v100 #7713: Pull request #4883 synchronize by tjruwase
January 2, 2024 05:54 10m 39s CurryRice233:offload
January 2, 2024 05:54 10m 39s
nv-accelerate-v100
nv-accelerate-v100 #7712: Scheduled
January 2, 2024 00:05 9m 14s master
January 2, 2024 00:05 9m 14s
optimize grad_norm calculation in stage3.py
nv-accelerate-v100 #7710: Pull request #4436 synchronize by ShadenSmith
January 1, 2024 04:50 9m 8s mmhab:optimize_grad_norm_calc
January 1, 2024 04:50 9m 8s
nv-accelerate-v100
nv-accelerate-v100 #7709: Scheduled
January 1, 2024 00:06 8m 4s master
January 1, 2024 00:06 8m 4s
nv-accelerate-v100
nv-accelerate-v100 #7708: Scheduled
December 31, 2023 00:06 7m 7s master
December 31, 2023 00:06 7m 7s
add sharded loading for safetensors in AutoTP
nv-accelerate-v100 #7707: Pull request #4854 synchronize by tjruwase
December 30, 2023 13:58 10m 38s sywangyi:safetensor_autoTP
December 30, 2023 13:58 10m 38s