This repo includes official implementations of ChatCAD and ChatCAD+
[Nature COMMSENG] ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models by Sheng Wang, Zihao Zhao, Xi Ouyang, Tianming Liu, Qian Wang, Dinggang Shen
(a) Overview of our proposed strategy. The image is processed by various networks to generate diverse outputs, which are then transformed into text descriptions. The descriptions, served as a link between visual and linguistic information, are combined as inputs to a large language model (LLM). With its ability to reason and its knowledge of the medical field, the LLM can provide a condensed report. (b) Interactive explanations and medical advice from ChatCAD.
[IEEE TMI] ChatCAD+: Towards a Reliable and Universal Interactive CAD using LLMs by Zihao Zhao*, Sheng Wang*, Jinchen Gu*, Yitao Zhu*, Lanzhuju Mei, Zixu Zhuang, Zhiming Cui, Qian Wang, Dinggang Shen
Overview of our proposed ChatCAD+ system. (a) For patients seeking a diagnosis, ChatCAD+ generates reliable medical reports based on the input medical image(s) by referring to local report database. (b) Additionally, for any inquiry from patients, ChatCAD+ retrieves related knowledge from online database and lets large language model generate reliable response.
This repository provides the official implementation of some components of ChatCAD+:
- Modality identification
- Chinese version Interactive CAD of Chest X-rays
- LLM-based knowledge retrieval
- An easy-deploy local web ui based on Gradio
- We would like to thank Merck Manual Professional who make all these medical knowledge public, we sorted their website for easier usage: here
- A BART-based model that has the capability to translate chest X-ray reports into Chinese well [link]
- R2GenCMN: r2gcmn_mimic-cxr.pth and annotation.json
- PCAM weights: JFchexpert.pth
- Place annotation.json under ./r2g/ and pre-trained weights under ./weights/
- For template retrieval system, please download MIMIC-CXR reports from official website and organize them into a dictionary, save as report_en_dict.json under the ./
You can either find them from original repository or download from Google Drive
- pip install -r requirements.txt
- implement web.py and load your openai api-key
If you find our work useful, please consider giving a star ⭐ and citation.
@article{wang2024interactive,
title={Interactive computer-aided diagnosis on medical image using large language models},
author={Wang, Sheng and Zhao, Zihao and Ouyang, Xi and Liu, Tianming and Wang, Qian and Shen, Dinggang},
journal={Communications Engineering},
volume={3},
number={1},
pages={133},
year={2024},
publisher={Nature Publishing Group UK London}
}
@article{zhao2024chatcad+,
title={Chatcad+: Towards a universal and reliable interactive cad using llms},
author={Zhao, Zihao and Wang, Sheng and Gu, Jinchen and Zhu, Yitao and Mei, Lanzhuju and Zhuang, Zixu and Cui, Zhiming and Wang, Qian and Shen, Dinggang},
journal={IEEE Transactions on Medical Imaging},
year={2024},
publisher={IEEE}
}
Our implementation (including coming version) is based on the following codebases. We gratefully thank the authors for their wonderful works.