Skip to content

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model, ECCV2024

Notifications You must be signed in to change notification settings

ChaduCheng/TypoDeceptions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation


Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model

Hao Cheng*, Erjia Xiao*, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu

HKUST & University of Oxford & Drexel University & Xi’an Jiaotong University

Paper PDF Dataset

Logo

Installation

  1. Please follow the instructions in LLaVA, InstructBLIP and MiniGPT4 to set up the codebase, model weights and conda environment for further experiments.

  2. Download the Typographic Dataset.

  3. Clone this repository into the codebase mentioned above. For instance, after installing LLaVA,

cd LLaVA
git clone https://github.com/ChaduCheng/TypoDeceptions.git

Acknowledgement

  • LLaVA: Large Language and Vision Assistant
  • MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
  • InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
  • CLIP: Learning Transferable Visual Models From Natural Language Supervision

If you find our work useful for your research and applications, please cite using this BibTeX:

@article{cheng2024unveiling,
  title={Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model},
  author={Cheng, Hao and Xiao, Erjia and Gu, Jindong and Yang, Le and Duan, Jinhao and Zhang, Jize and Cao, Jiahang and Xu, Kaidi and Xu, Renjing},
  journal={ECCV},
  year={2024}
}

About

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model, ECCV2024

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages