Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model

Hao Cheng*, Erjia Xiao*, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu^†

HKUST & University of Oxford & Drexel University & Xi’an Jiaotong University

Installation

Please follow the instructions in LLaVA, InstructBLIP and MiniGPT4 to set up the codebase, model weights and conda environment for further experiments.
Download the Typographic Dataset.
Clone this repository into the codebase mentioned above. For instance, after installing LLaVA,

cd LLaVA
git clone https://github.com/ChaduCheng/TypoDeceptions.git

Acknowledgement

LLaVA: Large Language and Vision Assistant
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
CLIP: Learning Transferable Visual Models From Natural Language Supervision

If you find our work useful for your research and applications, please cite using this BibTeX:

@article{cheng2024unveiling,
  title={Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model},
  author={Cheng, Hao and Xiao, Erjia and Gu, Jindong and Yang, Le and Duan, Jinhao and Zhang, Jize and Cao, Jiahang and Xu, Kaidi and Xu, Renjing},
  journal={ECCV},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figs		figs
inference		inference
utils		utils
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model

Installation

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

ChaduCheng/TypoDeceptions

Folders and files

Latest commit

History

Repository files navigation

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model

Installation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages