Skip to content

Latest commit

 

History

History
32 lines (25 loc) · 2.49 KB

method.md

File metadata and controls

32 lines (25 loc) · 2.49 KB

Method

This section provides a summary of representative state-of-the-art (SOTA) algorithms for several Visual-Linguistic Reasoning task, such as visual question answering (VQA) and medical report generation tasks. All algorithms have been implemented using PyTorch. The CausalVLR library will be continuously updated in the coming years. In this section, we will provide a concise introduction to the selected algorithms.

Update News

🔥 2023.8.19.

  • v0.0.2 was released in 8/19/2023
  • Support CaCo-CoT for Faithful Reasoning task in LLMs

🔥 2023.6.29.

Model Zoo

Task Model Benchmark
Medical Report Generation VLCI IU-Xray, MIMIC-CXR
VQA CMCIR SUTD-TrafficQA, TGIF-QA, MSVD-QA, MSRVTT-QA
Visual Causal Scene Discovery VCSR NExT-QA, Causal-VidQA, and MSRVTT-QA
Model Generalization and Robustness Robust Fine-tuning ImageNet-V2, ImageNet-R, ImageNet-Sketch, ObjectNet, ImageNet-A
Causality-Aware Medical Diagnosis CAMDA MuZhi, DingXiang
Faithful Reasoning in LLMs CaCo-CoT ScienceQA, Com2Sense, BoolQ

Ongoing Projects