Stars: 57.0k
| Created at: 2023-10-06
| Last updated: 2025-01-17
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
https://github.com/lllyasviel/FooocusStars: 42.7k
| Created at: 2023-08-09
| Last updated: 2025-01-17
Focus on prompting and generating
https://github.com/GitHubDaily/GitHubDailyStars: 33.7k
| Created at: 2018-12-30
| Last updated: 2025-01-17
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
https://github.com/Zeyi-Lin/HivisionIDPhotosStars: 14.3k
| Created at: 2023-06-18
| Last updated: 2025-01-17
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
https://github.com/KwaiVGI/LivePortraitStars: 13.7k
| Created at: 2024-07-03
| Last updated: 2025-01-17
Bring portraits to life!
https://github.com/PKU-YuanGroup/Open-Sora-PlanStars: 11.8k
| Created at: 2024-02-20
| Last updated: 2025-01-17
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
https://github.com/hua1995116/awesome-ai-paintingStars: 11.4k
| Created at: 2022-10-08
| Last updated: 2025-01-17
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
https://github.com/instantX-research/InstantIDStars: 11.3k
| Created at: 2023-12-11
| Last updated: 2025-01-17
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
https://github.com/guoyww/AnimateDiffStars: 10.9k
| Created at: 2023-06-17
| Last updated: 2025-01-17
Official implementation of AnimateDiff.
https://github.com/THUDM/CogVideoStars: 10.3k
| Created at: 2022-05-29
| Last updated: 2025-01-17
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
https://github.com/TencentARC/PhotoMakerStars: 9.7k
| Created at: 2023-12-06
| Last updated: 2025-01-17
PhotoMaker [CVPR 2024]
https://github.com/SillyTavern/SillyTavernStars: 9.4k
| Created at: 2023-02-09
| Last updated: 2025-01-17
LLM Frontend for Power Users.
https://github.com/NVIDIA/TensorRT-LLMStars: 9.2k
| Created at: 2023-08-16
| Last updated: 2025-01-17
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
https://github.com/fudan-generative-vision/halloStars: 8.1k
| Created at: 2024-06-12
| Last updated: 2025-01-17
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
https://github.com/digoal/blogStars: 8.1k
| Created at: 2015-08-02
| Last updated: 2025-01-17
Opensource,Database,AI,Business,Minds. git clone --depth 1 https://github.com/digoal/blog
https://github.com/Acly/krita-ai-diffusionStars: 7.7k
| Created at: 2023-09-01
| Last updated: 2025-01-17
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
https://github.com/Tencent/HunyuanVideoStars: 7.5k
| Created at: 2024-11-28
| Last updated: 2025-01-17
HunyuanVideo: A Systematic Framework For Large Video Generation Model
https://github.com/LiheYoung/Depth-AnythingStars: 7.2k
| Created at: 2024-01-22
| Last updated: 2025-01-17
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
https://github.com/jagenjo/litegraph.jsStars: 7.1k
| Created at: 2013-09-26
| Last updated: 2025-01-17
A graph node engine and editor written in Javascript similar to PD or UDK Blueprints, comes with its own editor in HTML5 Canvas2D. The engine can run client side or server side using Node. It allows to export graphs as JSONs to be included in applications independently.
https://github.com/AbdBarho/stable-diffusion-webui-dockerStars: 6.9k
| Created at: 2022-08-27
| Last updated: 2025-01-17
Easy Docker setup for Stable Diffusion with user-friendly UI
https://github.com/modelscope/DiffSynth-StudioStars: 6.7k
| Created at: 2023-12-07
| Last updated: 2025-01-16
Enjoy the magic of Diffusion models!
https://github.com/sczhou/ProPainterStars: 5.8k
| Created at: 2023-09-01
| Last updated: 2025-01-17
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
https://github.com/tencent-ailab/IP-AdapterStars: 5.6k
| Created at: 2023-08-16
| Last updated: 2025-01-17
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
https://github.com/Doubiiu/ToonCrafterStars: 5.5k
| Created at: 2024-05-28
| Last updated: 2025-01-17
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
https://github.com/aigc-apps/sd-webui-EasyPhotoStars: 5.0k
| Created at: 2023-08-28
| Last updated: 2025-01-17
📷 EasyPhoto | Your Smart AI Photo Generator.
https://github.com/AILab-CVC/YOLO-WorldStars: 4.9k
| Created at: 2024-01-29
| Last updated: 2025-01-17
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
https://github.com/Stability-AI/StableSwarmUIStars: 4.7k
| Created at: 2023-05-12
| Last updated: 2025-01-17
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
https://github.com/luosiallen/latent-consistency-modelStars: 4.4k
| Created at: 2023-10-06
| Last updated: 2025-01-16
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
https://github.com/DepthAnything/Depth-Anything-V2Stars: 4.4k
| Created at: 2024-06-13
| Last updated: 2025-01-17
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
https://github.com/fudan-generative-vision/champStars: 4.1k
| Created at: 2024-03-17
| Last updated: 2025-01-17
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
https://github.com/Kwai-Kolors/KolorsStars: 4.1k
| Created at: 2024-07-05
| Last updated: 2025-01-17
Kolors Team
https://github.com/philz1337x/clarity-upscalerStars: 4.0k
| Created at: 2024-03-15
| Last updated: 2025-01-18
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
https://github.com/Tencent/HunyuanDiTStars: 3.8k
| Created at: 2024-05-10
| Last updated: 2025-01-17
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
https://github.com/leejet/stable-diffusion.cppStars: 3.7k
| Created at: 2023-08-13
| Last updated: 2025-01-17
Stable Diffusion and Flux in pure C/C++
https://github.com/TencentARC/InstantMeshStars: 3.6k
| Created at: 2024-04-10
| Last updated: 2025-01-17
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
https://github.com/antgroup/echomimicStars: 3.4k
| Created at: 2024-07-03
| Last updated: 2025-01-17
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
https://github.com/TMElyralab/MuseTalkStars: 3.3k
| Created at: 2024-03-26
| Last updated: 2025-01-17
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
https://github.com/AiuniAI/Unique3DStars: 3.2k
| Created at: 2024-05-30
| Last updated: 2025-01-17
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
https://github.com/VinsonLaro/stable-diffusion-webui-chineseStars: 3.1k
| Created at: 2022-10-10
| Last updated: 2025-01-17
stable-diffusion-webui 的汉化扩展
https://github.com/ToTheBeginning/PuLIDStars: 3.0k
| Created at: 2024-04-17
| Last updated: 2025-01-17
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
https://github.com/PixArt-alpha/PixArt-alphaStars: 2.9k
| Created at: 2023-10-12
| Last updated: 2025-01-17
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
https://github.com/PeterH0323/Streamer-SalesStars: 2.8k
| Created at: 2024-04-05
| Last updated: 2025-01-17
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
https://github.com/NVlabs/SanaStars: 2.7k
| Created at: 2024-10-11
| Last updated: 2025-01-17
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
https://github.com/genmoai/mochiStars: 2.7k
| Created at: 2024-09-11
| Last updated: 2025-01-17
The best OSS video generation models
https://github.com/Doubiiu/DynamiCrafterStars: 2.7k
| Created at: 2023-11-27
| Last updated: 2025-01-17
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
https://github.com/xyflow/awesome-node-based-uisStars: 2.6k
| Created at: 2022-11-14
| Last updated: 2025-01-17
A curated list with resources about node-based UIs
https://github.com/ant-research/MagicQuillStars: 2.6k
| Created at: 2024-11-12
| Last updated: 2025-01-17
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
https://github.com/dusty-nv/jetson-containersStars: 2.6k
| Created at: 2020-04-29
| Last updated: 2025-01-17
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
https://github.com/Lightricks/LTX-VideoStars: 2.6k
| Created at: 2024-11-20
| Last updated: 2025-01-17
Official repository for LTX-Video
https://github.com/TMElyralab/MuseVStars: 2.6k
| Created at: 2024-03-25
| Last updated: 2025-01-17
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
https://github.com/Tencent/Hunyuan3D-1Stars: 2.6k
| Created at: 2024-10-31
| Last updated: 2025-01-17
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
https://github.com/butaixianran/Stable-Diffusion-Webui-Civitai-HelperStars: 2.5k
| Created at: 2023-03-07
| Last updated: 2025-01-15
Stable Diffusion Webui Extension for Civitai, to manage your model much more easily.
https://github.com/TMElyralab/MusePoseStars: 2.4k
| Created at: 2024-05-24
| Last updated: 2025-01-17
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
https://github.com/antgroup/echomimic_v2Stars: 2.3k
| Created at: 2024-11-20
| Last updated: 2025-01-17
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
https://github.com/IceClear/StableSRStars: 2.3k
| Created at: 2023-04-02
| Last updated: 2025-01-16
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
https://github.com/tencent-ailab/V-ExpressStars: 2.3k
| Created at: 2024-05-21
| Last updated: 2025-01-17
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
https://github.com/taishi-i/awesome-ChatGPT-repositoriesStars: 2.3k
| Created at: 2023-04-02
| Last updated: 2025-01-17
A curated list of resources dedicated to open source GitHub repositories related to ChatGPT
https://github.com/KohakuBlueleaf/LyCORISStars: 2.2k
| Created at: 2023-02-27
| Last updated: 2025-01-17
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
https://github.com/Alpha-VLLM/Lumina-T2XStars: 2.1k
| Created at: 2024-03-28
| Last updated: 2025-01-17
Lumina-T2X is a unified framework for Text to Any Modality Generation
https://github.com/jbilcke-hf/clapperStars: 2.1k
| Created at: 2024-05-31
| Last updated: 2025-01-17
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
https://github.com/adieyal/sd-dynamic-promptsStars: 2.1k
| Created at: 2022-10-08
| Last updated: 2025-01-14
A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
https://github.com/lllyasviel/LayerDiffuseStars: 2.1k
| Created at: 2024-02-27
| Last updated: 2025-01-17
Transparent Image Layer Diffusion using Latent Transparency
https://github.com/PRIS-CV/DemoFusionStars: 2.0k
| Created at: 2023-10-29
| Last updated: 2025-01-12
Let us democratise high-resolution generation! (CVPR 2024)
https://github.com/uhub/awesome-cStars: 2.0k
| Created at: 2015-08-12
| Last updated: 2025-01-17
A curated list of awesome C frameworks, libraries and software.
https://github.com/sergree/matcheringStars: 1.9k
| Created at: 2018-09-28
| Last updated: 2025-01-17
🎚️ Open Source Audio Matching and Mastering
https://github.com/thisjam/sd-webui-oldsix-promptStars: 1.8k
| Created at: 2023-07-27
| Last updated: 2025-01-17
sd-webui中文提示词插件、老手新手炼丹必备
https://github.com/xinsir6/ControlNetPlusStars: 1.8k
| Created at: 2024-07-02
| Last updated: 2025-01-17
ControlNet++: All-in-one ControlNet for image generations and editing!
https://github.com/XLabs-AI/x-fluxStars: 1.8k
| Created at: 2024-08-05
| Last updated: 2025-01-17
None
https://github.com/ChenyangSi/FreeUStars: 1.8k
| Created at: 2023-09-14
| Last updated: 2025-01-16
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
https://github.com/kijai/ComfyUI-LivePortraitKJStars: 1.8k
| Created at: 2024-07-04
| Last updated: 2025-01-17
ComfyUI nodes for LivePortrait
Included Nodes (10)
- DownloadAndLoadLivePortraitModels
- KeypointScaler, KeypointsToImage
- LivePortraitComposite, LivePortraitCropper, LivePortraitLoadCropper, LivePortraitLoadFaceAlignmentCropper, LivePortraitLoadMediaPipeCropper, LivePortraitProcess, LivePortraitRetargeting
Stars: 1.7k
| Created at: 2024-02-29
| Last updated: 2025-01-17
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
https://github.com/instantX-research/InstantStyleStars: 1.7k
| Created at: 2023-12-22
| Last updated: 2025-01-15
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
https://github.com/Coyote-A/ultimate-upscale-for-automatic1111Stars: 1.7k
| Created at: 2023-01-02
| Last updated: 2025-01-15
None
https://github.com/aigc-apps/EasyAnimateStars: 1.7k
| Created at: 2024-04-11
| Last updated: 2025-01-17
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Included Nodes (10)
- EasyAnimate_TextBox, EasyAnimateI2VSampler, EasyAnimateT2VSampler, EasyAnimateV2VSampler, EasyAnimateV5_I2VSampler, EasyAnimateV5_T2VSampler, EasyAnimateV5_V2VSampler
- LoadEasyAnimateLora, LoadEasyAnimateModel
- TextBox
Stars: 1.6k
| Created at: 2022-08-17
| Last updated: 2025-01-17
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation