Skip to content

Actions: microsoft/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,754 workflow run results
1,754 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support Triton 2.2+
nv-lightning-v100 #9890: Pull request #4937 synchronize by loadams
February 20, 2024 23:10 19m 26s loadams/triton-22-update
February 20, 2024 23:10 19m 26s
Rename and update to torch 1.11
nv-lightning-v100 #9889: Pull request #5141 synchronize by loadams
February 20, 2024 23:10 15m 59s loadams/update-torch-113
February 20, 2024 23:10 15m 59s
Revert "Fix UserWarning: The torch.cuda.*DtypeTensor constructors are…
nv-lightning-v100 #9888: Pull request #5165 opened by loadams
February 20, 2024 22:48 12m 47s loadams/revert-userwarning
February 20, 2024 22:48 12m 47s
nv-lightning-v100
nv-lightning-v100 #9887: Merge group checks requested
February 20, 2024 22:42 15m 39s
February 20, 2024 22:42 15m 39s
nv-lightning-v100
nv-lightning-v100 #9886: Merge group checks requested
February 20, 2024 22:18 12m 16s
February 20, 2024 22:18 12m 16s
Update pytest and transformers with fixes for pytest>= 8.0.0
nv-lightning-v100 #9885: Pull request #5164 synchronize by loadams
February 20, 2024 21:09 8m 12s loadams/update-pytest
February 20, 2024 21:09 8m 12s
get_grad_norm_direct: fix a case of empty norm group
nv-lightning-v100 #9884: Pull request #5148 synchronize by lekurile
February 20, 2024 20:50 3m 35s nelyahu:fix_get_grad_norm_direct
February 20, 2024 20:50 3m 35s
Update pytest and transformers with fixes for pytest>= 8.0.0
nv-lightning-v100 #9883: Pull request #5164 synchronize by loadams
February 20, 2024 20:40 9m 7s loadams/update-pytest
February 20, 2024 20:40 9m 7s
get_grad_norm_direct: fix a case of empty norm group
nv-lightning-v100 #9882: Pull request #5148 synchronize by lekurile
February 20, 2024 19:43 10m 18s nelyahu:fix_get_grad_norm_direct
February 20, 2024 19:43 10m 18s
Fix gradient clipping
nv-lightning-v100 #9881: Pull request #5150 synchronize by loadams
February 20, 2024 19:42 8m 10s tohtana/fix_fp32_clipping
February 20, 2024 19:42 8m 10s
MOE: Fix save checkpoint when TP > 1
nv-lightning-v100 #9880: Pull request #5157 synchronize by mosheisland
February 20, 2024 18:51 3m 36s mosheisland:moe/ckp
February 20, 2024 18:51 3m 36s
Use ninja to speed up build
nv-lightning-v100 #9878: Pull request #5088 synchronize by mrwyattii
February 20, 2024 17:51 5m 32s jinzhen-lin:master
February 20, 2024 17:51 5m 32s
DeepSpeedZeroOptimizer_Stage3: remove cuda specific optimizer
nv-lightning-v100 #9877: Pull request #5138 synchronize by mrwyattii
February 20, 2024 17:50 3m 37s nelyahu:stage3_fused_adam_removal
February 20, 2024 17:50 3m 37s
Update pytest and transformers with fixes for pytest>= 8.0.0
nv-lightning-v100 #9876: Pull request #5164 opened by loadams
February 20, 2024 17:43 10m 16s loadams/update-pytest
February 20, 2024 17:43 10m 16s
Distributed in-memory map-reduce for data analyzer
nv-lightning-v100 #9875: Pull request #5129 synchronize by loadams
February 20, 2024 17:43 7m 9s bm-synth:distributed_data_analyzer
February 20, 2024 17:43 7m 9s
Update flops profiler to handle attn and __matmul__
nv-lightning-v100 #9873: Pull request #4724 synchronize by loadams
February 20, 2024 17:38 8m 17s KimmiShi:flops_profiler_attn
February 20, 2024 17:38 8m 17s
Pin to PyTest 8.0.0
nv-lightning-v100 #9871: Pull request #5163 opened by loadams
February 20, 2024 16:35 8m 55s loadams/pytest8
February 20, 2024 16:35 8m 55s
Use ninja to speed up build
nv-lightning-v100 #9870: Pull request #5088 synchronize by jinzhen-lin
February 20, 2024 13:43 4m 1s jinzhen-lin:master
February 20, 2024 13:43 4m 1s
Use ninja to speed up build
nv-lightning-v100 #9869: Pull request #5088 synchronize by tjruwase
February 20, 2024 13:28 9m 15s jinzhen-lin:master
February 20, 2024 13:28 9m 15s
ZeRO0 does not handle BF16 gradients properly
nv-lightning-v100 #9867: Pull request #5154 synchronize by tohtana
February 20, 2024 08:26 9m 13s tohtana/fix_bf16_opt_update_hp
February 20, 2024 08:26 9m 13s
nv-lightning-v100
nv-lightning-v100 #9865: Scheduled
February 20, 2024 00:15 11m 28s master
February 20, 2024 00:15 11m 28s
Distributed in-memory map-reduce for data analyzer
nv-lightning-v100 #9864: Pull request #5129 synchronize by bm-synth
February 19, 2024 14:42 8m 14s bm-synth:distributed_data_analyzer
February 19, 2024 14:42 8m 14s