Starred repositories
Deep learning face detection and recognition, implemented by pytorch. (pytorch实现的人脸检测和人脸识别)
Detect and recognize the faces from camera / 调用摄像头进行人脸识别,支持多张人脸同时识别
Official PyTorch implementation of SegFormer
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
OCR, layout analysis, reading order, table recognition in 90+ languages
基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。
检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Awesome-RAG: Collect typical RAG papers and systems.
the resources about the application based on LLM with RAG pattern
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, and Mohit Bansal (UNC Chape…
The code used to train and run inference with the ColPali architecture.
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Stable Diffusion web UI
Curated tutorials and resources for Large Language Models, AI Painting, and more.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
🔥 官方推荐 🔥 RuoYi-Vue 全新 Pro 版本,优化重构所有功能。基于 Spring Boot + MyBatis Plus + Vue & Element 实现的后台管理系统 + 微信小程序,支持 RBAC 动态权限、数据权限、SaaS 多租户、Flowable 工作流、三方登录、支付、短信、商城、CRM、ERP、AI 大模型等功能。你的 ⭐️ Star ⭐️,是作者生发的动力!
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image