deepspeed

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen video-large-language-models video-language-model

Updated Sep 24, 2024
Jupyter Notebook

intelligent-machine-learning / glake

Star

GLake: optimizing GPU memory management and IO transmission.

memory gpu pytorch onnx deepspeed llm

Updated Aug 3, 2024
Python

sunzeyeah / RLHF

Star

Implementation of Chinese ChatGPT

nlp deep-learning pytorch glm pangu deepspeed chatgpt

Updated Nov 20, 2023
Python

LambdaLabsML / distributed-training-guide

Star

Best practices & guides on how to write distributed pytorch training code

gpu cluster mpi cuda slurm pytorch sharding kuberentes distributed-training nccl gpu-cluster deepspeed fsdp lambdalabs

Updated Oct 30, 2024
Python

stanleylsx / llms_tool

Star

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

bloom pytorch falcon llama moss mistral aquila baichuan deepspeed chatglm chatglm2 internlm llama2 qwen xverse baichuan2 aquila2 chatglm3

Updated Dec 8, 2023
Python

git-cloner / llama2-lora-fine-tuning

Star

llama2 finetuning with deepspeed and lora

lora finetuning deepspeed llama2

Updated Jul 28, 2023
Python

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

pytorch llama gpt lora finetune ppo peft deepspeed llm chatgpt rlhf reward-models chatglm chatglm-6b

Updated Apr 28, 2023
Python

HomebrewNLP / revlib

Star

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

deep-learning pytorch tpu revnet xla deepspeed momentumnet

Updated Aug 6, 2022
Python

bobo0810 / LearnDeepSpeed

Star

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

examples deepspeed large-language-models

Updated Sep 7, 2023
Python

openpsi-project / ReaLHF

Star

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

distributed-systems reinforcement-learning distributed-computing transformers large-scale-machine-learning deepspeed megatron-lm large-language-models llm reinforcement-learning-from-human-feedback llm-training llm-framework

Updated Sep 20, 2024
Python

CoinCheung / gdGPT

Star

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

nlp bloom pipeline pytorch deepspeed llm full-finetune model-parallization flash-attention llama2 baichuan2-7b chatglm3-6b mixtral-8x7b

Updated Feb 5, 2024
Python

OpenCSGs / llm-inference

Star

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepspeed

Here are 71 public repositories matching this topic...

InternLM / lmdeploy

OpenRLHF / OpenRLHF

PKU-Alignment / safe-rlhf

zjunlp / KnowLM

alibaba / Megatron-LLaMA

Xirider / finetune-gpt2xl

shm007g / LLaMA-Cult-and-More

OpenMOSS / CoLLiE

Coobiw / MPP-LLaVA

intelligent-machine-learning / glake

sunzeyeah / RLHF

LambdaLabsML / distributed-training-guide

stanleylsx / llms_tool

git-cloner / llama2-lora-fine-tuning

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

HomebrewNLP / revlib

bobo0810 / LearnDeepSpeed

openpsi-project / ReaLHF

CoinCheung / gdGPT

OpenCSGs / llm-inference

Improve this page

Add this topic to your repo