Zhangzhe's Blog
The projection of my life.
Home
Tags
Categories
Search
0%
MultiModal
Tag
2024
10-11
LLaVA: Visual Instruction Tuning
10-11
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
10-11
CLIP: Learning Transferable Visual Models From Natural Language Supervision