large-vision-language-model

Here are 13 public repositories matching this topic...

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

multi-modality instruction-following in-context-learning large-language-models chain-of-thought instruction-tuning visual-instruction-tuning large-vision-language-model multimodal-instruction-tuning large-vision-language-models multimodal-large-language-models multimodal-in-context-learning multimodal-chain-of-thought

Updated Dec 13, 2024

PKU-YuanGroup / Video-LLaVA

Star

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

multi-modal instruction-tuning large-vision-language-model

Updated Dec 3, 2024
Python

InternLM / InternLM-XComposer

Star

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

foundation gpt language-model multimodal multi-modality vision-transformer gpt-4 visual-language-learning llm chatgpt instruction-tuning large-language-model supervised-finetuning mllm vision-language-model large-vision-language-model

Updated Dec 12, 2024
Python

PKU-YuanGroup / MoE-LLaVA

Star

Mixture-of-Experts for Large Vision-Language Models

moe multi-modal mixture-of-experts large-vision-language-model

Updated Dec 3, 2024
Python

MMStar-Benchmark / MMStar

Star

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

evaluation multimodality multimodal-learning visual-question-answering multimodal large-language-models llm llms large-vision-language-model large-vision-language-models large-multimodal-models lvlms lvlm

Updated Sep 26, 2024
Python

richard-peng-xia / CARES

Star

[NeurIPS'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

trustworthy-ai vision-language-model large-vision-language-model medical-multimodal-learning

Updated Dec 4, 2024
Python

yu-rp / apiprompting

Star

[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models

visual-prompting prompting vision-language-model large-vision-language-model large-vision-language-models large-multimodal-models vision-language-models

Updated Oct 10, 2024
Python

jqtangust / hawk

Star

[NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies

video-understanding anomaly-detection large-vision-language-model

Updated Oct 7, 2024

SuperBruceJia / Awesome-Large-Vision-Language-Model

Star

Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model

machine-learning natural-language-processing computer-vision deep-learning artificial-intelligence artificial-general-intelligence general-artificial-intelligence vision-and-language foundation-models large-language-models large-vision-language-model large-vision-language-models multimodal-large-language-models

Updated Sep 24, 2024

ADL-X / LLAVIDAL

Star

This is the offical repository of LLAVIDAL

llvm activities-of-daily-living action-recognition large-vision-language-model

Updated Dec 5, 2024
Python

Ruiyang-061X / VL-Uncertainty

Star

Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".

uncertainty-estimation large-vision-language-model hallucination-detection

Updated Nov 17, 2024

lucaswychan / quant-lvlm

Star

Leverage multimodal large vision language model for quantitative analysis

pytorch quantitative-finance multimodal-learning large-vision-language-model

Updated Oct 20, 2024
Python

ai4ce / LUWA

Star

computer-vision archeology anthropology ai4science large-vision-language-model

Updated Oct 18, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the large-vision-language-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the large-vision-language-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

large-vision-language-model

Here are 13 public repositories matching this topic...

BradyFU / Awesome-Multimodal-Large-Language-Models

PKU-YuanGroup / Video-LLaVA

InternLM / InternLM-XComposer

PKU-YuanGroup / MoE-LLaVA

MMStar-Benchmark / MMStar

richard-peng-xia / CARES

yu-rp / apiprompting

jqtangust / hawk

SuperBruceJia / Awesome-Large-Vision-Language-Model

ADL-X / LLAVIDAL

Ruiyang-061X / VL-Uncertainty

lucaswychan / quant-lvlm

ai4ce / LUWA

Improve this page

Add this topic to your repo