Popular repositories Loading
-
-
lp-sparsemap
lp-sparsemap PublicLP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
-
Repositories
Showing 10 of 65 repositories
- Megatron-LM-pretrain Public Forked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
deep-spin/Megatron-LM-pretrain’s past year of commit activity - reranking-laws Public
deep-spin/reranking-laws’s past year of commit activity - mt-pref-alignment Public
deep-spin/mt-pref-alignment’s past year of commit activity - tower-eval Public
deep-spin/tower-eval’s past year of commit activity - nanotron Public Forked from huggingface/nanotron
Minimalistic large language model 3D-parallelism training
deep-spin/nanotron’s past year of commit activity - infinite-former Public
deep-spin/infinite-former’s past year of commit activity