Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
570 workflow run results
570 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Remove hooks on gradient accumulation on engine/optimizer destroy
nv-accelerate-v100 #7669: Pull request #4858 synchronize by chiragjn
December 22, 2023 03:14 12m 41s chiragjn:cj_remove_grad_acc_hooks
December 22, 2023 03:14 12m 41s
add sharded loading for safetensors in AutoTP
nv-accelerate-v100 #7667: Pull request #4854 synchronize by sywangyi
December 22, 2023 00:39 9m 53s sywangyi:safetensor_autoTP
December 22, 2023 00:39 9m 53s
Fix exception handling in get_all_ranks_from_group() function
nv-accelerate-v100 #7666: Pull request #4862 synchronize by HeyangQin
December 22, 2023 00:27 9m 21s HeyangQin/fix_issue_4853
December 22, 2023 00:27 9m 21s
Fix exception handling in get_all_ranks_from_group() function
nv-accelerate-v100 #7665: Pull request #4862 opened by HeyangQin
December 22, 2023 00:26 18s HeyangQin/fix_issue_4853
December 22, 2023 00:26 18s
nv-accelerate-v100
nv-accelerate-v100 #7664: Scheduled
December 22, 2023 00:05 8m 41s master
December 22, 2023 00:05 8m 41s
nv-accelerate-v100
nv-accelerate-v100 #7663: Merge group checks requested
December 21, 2023 19:13 9m 27s
December 21, 2023 19:13 9m 27s
add sharded loading for safetensors in AutoTP
nv-accelerate-v100 #7662: Pull request #4854 synchronize by mrwyattii
December 21, 2023 18:27 6m 35s sywangyi:safetensor_autoTP
December 21, 2023 18:27 6m 35s
Pipeline: Add support to eval micro bs configuration
nv-accelerate-v100 #7661: Pull request #4859 opened by nelyahu
December 21, 2023 10:40 10m 16s nelyahu:pp_eval_micro_bs
December 21, 2023 10:40 10m 16s
Remove hooks on gradient accumulation on engine/optimizer destroy
nv-accelerate-v100 #7660: Pull request #4858 synchronize by chiragjn
December 21, 2023 10:02 10m 29s chiragjn:cj_remove_grad_acc_hooks
December 21, 2023 10:02 10m 29s
add sharded loading for safetensors in AutoTP
nv-accelerate-v100 #7656: Pull request #4854 opened by sywangyi
December 21, 2023 03:56 14s sywangyi:safetensor_autoTP
December 21, 2023 03:56 14s
Update version.txt after 0.12.6 release
nv-accelerate-v100 #7654: Pull request #4850 opened by mrwyattii
December 21, 2023 00:47 9m 57s AutoPR/0.12.6
December 21, 2023 00:47 9m 57s
nv-accelerate-v100
nv-accelerate-v100 #7653: Scheduled
December 21, 2023 00:05 11m 23s master
December 21, 2023 00:05 11m 23s
Mixtral FastGen Support
nv-accelerate-v100 #7652: Pull request #4828 synchronize by mrwyattii
December 21, 2023 00:05 5m 12s cholmes/mixtral-fastgen-support
December 21, 2023 00:05 5m 12s
Mixtral FastGen Support
nv-accelerate-v100 #7651: Pull request #4828 synchronize by cmikeh2
December 20, 2023 23:18 5m 11s cholmes/mixtral-fastgen-support
December 20, 2023 23:18 5m 11s
Add Cache to Comm Group
nv-accelerate-v100 #7650: Pull request #4849 opened by cmikeh2
December 20, 2023 22:18 1h 4m 41s cholmes/comm-group-cache
December 20, 2023 22:18 1h 4m 41s
Mixtral FastGen Support
nv-accelerate-v100 #7649: Pull request #4828 synchronize by cmikeh2
December 20, 2023 21:58 58m 58s cholmes/mixtral-fastgen-support
December 20, 2023 21:58 58m 58s
Support cpu tensors without direct device invocation
nv-accelerate-v100 #7648: Pull request #3842 synchronize by tjruwase
December 20, 2023 21:55 42m 22s abhilash1910:abhilash1910_cpu_fix
December 20, 2023 21:55 42m 22s
optimize grad_norm calculation in stage3.py
nv-accelerate-v100 #7647: Pull request #4436 synchronize by tjruwase
December 20, 2023 21:53 17m 34s mmhab:optimize_grad_norm_calc
December 20, 2023 21:53 17m 34s
nv-accelerate-v100
nv-accelerate-v100 #7646: Merge group checks requested
December 20, 2023 21:52 13m 33s
December 20, 2023 21:52 13m 33s
[NPU]Add ZeRO-Infinity feature for NPU
nv-accelerate-v100 #7644: Pull request #4809 synchronize by tjruwase
December 20, 2023 21:47 7m 44s misstek:npu_nvme_infinity
December 20, 2023 21:47 7m 44s
nv-accelerate-v100
nv-accelerate-v100 #7643: Merge group checks requested
December 20, 2023 20:51 10m 5s
December 20, 2023 20:51 10m 5s
Update flops profiler to handle attn and __matmul__
nv-accelerate-v100 #7642: Pull request #4724 synchronize by mrwyattii
December 20, 2023 18:43 43m 1s KimmiShi:flops_profiler_attn
December 20, 2023 18:43 43m 1s
engine.py: remove unused _curr_save_path
nv-accelerate-v100 #7641: Pull request #4844 synchronize by mrwyattii
December 20, 2023 18:38 8m 22s nelyahu:remove_curr_save_path
December 20, 2023 18:38 8m 22s
Retrieve CUDA available memory via torch.cuda.mem_get_info()
nv-accelerate-v100 #7640: Pull request #4847 synchronize by mrwyattii
December 20, 2023 18:37 47m 28s XuehaiPan:cuda-available-memory
December 20, 2023 18:37 47m 28s