We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Next-Token Prediction is All You Need
Python 989 27
Emu Series: Generative Multimodal Models from BAAI
Python 1.6k 85
EVA Series: Visual Representation Fantasies from BAAI
Python 2.3k 165
Painter & SegGPT Series: Vision Foundation Models from BAAI
Python 2.5k 171
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
Python 473 29
[ECCV 2024] Tokenize Anything via Prompting
Jupyter Notebook 511 19
[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
Diffusion Feedback Helps CLIP See Better
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Loading…