Awesome Visual Embedding - a RhapsodyAI Collection

RhapsodyAI 's Collections

Awesome Visual Embedding

Awesome Visual Embedding

updated Jul 23, 2024

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20, 2024 • 64 • 52
vidore/colidefics

Updated Jul 11, 2024 • 3
vidore/colpali

Visual Document Retrieval • Updated Feb 5 • 81.6k • 440
Unifying Multimodal Retrieval via Document Screenshot Embedding

Paper • 2406.11251 • Published Jun 17, 2024 • 10
ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 48
Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published May 30, 2024 • 37
Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4, 2024 • 7
Synthetic Multimodal Question Generation

Paper • 2407.02233 • Published Jul 2, 2024 • 1
RankCLIP: Ranking-Consistent Language-Image Pretraining

Paper • 2404.09387 • Published Apr 15, 2024