Awesome-Modality-Priors-in-MLLMs

This is a repository for organizing papres related to modality priors of Multimodal Large Language Models (MLLM).

Modality priors in multimodal large language models (MLLMs) include visual priors, language priors, etc., which refer to inherent biases or preconceptions embedded in components such as the visual encoder and language model of MLLMs. These priors come from the text data on which visual pre-training and language model training are based, and affect how the model combines other modalities to interpret and generate language. They affect the model's predictions, leading to potential biases or expectations about the relationship between different types of data in a multimodal context.

⭐ If you find this list useful, welcome to star it!

Paper List (Updating...)

Previous Study in VQA

(CVPR 2017) Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering

(CVPR 2018) Don’t Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering

(NIPS 2018) Overcoming Language Priors in Visual Question Answering with Adversarial Regularization

(SIGIR 2019) Quantifying and Alleviating the Language Prior Problem in Visual Question Answering

(CVPR 2021) Counterfactual VQA: A Cause-Effect Look at Language Bias

(TIP 2021) Loss Re-Scaling VQA: Revisiting the Language Prior Problem From a Class-Imbalance View

(EMNLP 2022) Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA

(COLING 2022) Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar Instances

(JMLR 2023) Overcoming Language Priors for Visual Question Answering via Loss Rebalancing Label and Global Context

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Awesome-Modality-Priors-in-MLLMs

⭐ If you find this list useful, welcome to star it!

Paper List (Updating...)

Benchmark & Dataset

Evaluation

Mitigation

Related Works

First Paper of Language Priors in Multimodality

Previous Study in VQA

Files

README.md

Latest commit

History

README.md

File metadata and controls

Awesome-Modality-Priors-in-MLLMs

⭐ If you find this list useful, welcome to star it!

Paper List (Updating...)

Benchmark & Dataset

Evaluation

Mitigation

Related Works

First Paper of Language Priors in Multimodality

Previous Study in VQA