- Beijing, CHINA
-
03:52
(UTC +08:00) - in/damingw0216
Pinned Loading
-
Image-Retrieval-From-Text
Image-Retrieval-From-Text PublicThis research project has constructed a two-stage clustering-based retrieval framework, as well as a deep learning-based retrieval algorithm using the CLIP model, which demonstrates zero-shot abiliβ¦
Python 3
-
UI-Img-Txt-Contrastive-Learning
UI-Img-Txt-Contrastive-Learning PublicAI4SE UI Multimodal Framework This framework introduces multimodal pretraining for software UIs in AI4SE, enabling tasks like UI captioning, retrieval, and localization. It uniquely combines Grad-β¦
Python 1
-
Data_Scoring_Model
Data_Scoring_Model PublicA demo implement of Quality Comparison Model. This Model aims to evalute image-text precomputed embeddings and sample better data for feeding LLM/MLLM training.
Python
-
LLaVA_Factory
LLaVA_Factory PublicThis work contributes on training/finetuning LLaVA-type MLLM. And tis work mainly borrows from TinyLLaVA_Factory.
Python
-
If the problem persists, check the GitHub status page or contact support.