Transformer - a ashishtanwer Collection

ashishtanwer 's Collections

Agents

RAG

LLM

Dataset

Evals

InfraML

Transformer

updated 7 days ago

sentence-transformers/all-mpnet-base-v2

Sentence Similarity • Updated Nov 5 • 22.5M • • 934
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper • 1910.10683 • Published Oct 23, 2019 • 10
google-t5/t5-base

Translation • Updated Feb 14 • 2.27M • 646
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 49
nvidia/canary-1b

Automatic Speech Recognition • Updated May 8 • 77.4k • 334
google/paligemma-3b-mix-224

Image-Text-to-Text • Updated Jul 19 • 362k • 64
openai/clip-vit-large-patch14

Zero-Shot Image Classification • Updated Sep 15, 2023 • 28.3M • 1.53k
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 11
openai/clip-vit-base-patch32

Zero-Shot Image Classification • Updated Feb 29 • 20.5M • • 567
google/paligemma-3b-pt-224

Image-Text-to-Text • Updated Sep 21 • 25.6k • 274
Salesforce/blip2-opt-2.7b

Image-Text-to-Text • Updated about 1 month ago • 325k • 322
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

Paper • 2301.12597 • Published Jan 30, 2023 • 1
Salesforce/blip-image-captioning-large

Image-to-Text • Updated Dec 7, 2023 • 1.78M • • 1.23k
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Paper • 2201.12086 • Published Jan 28, 2022 • 3
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 98
Salesforce/blip-vqa-base

Visual Question Answering • Updated Dec 7, 2023 • 236k • 136
openai/whisper-large-v3

Automatic Speech Recognition • Updated Aug 12 • 3.83M • • 3.86k