Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 4 days ago • 25
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 159
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 77
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Paper • 2502.02481 • Published Feb 4 • 10
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 193
view article Article Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas By MaxNomic and 4 others • Jan 23 • 30
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • Jan 20 • 37
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 203
view article Article Agentic RAG Stack (3/5) - Generate responses using a SmolLM By davidberenstein1957 • Feb 6 • 6
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval Paper • 2412.13205 • Published Dec 3, 2024 • 1
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 36
view article Article Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text By Albertmade and 1 other • Jan 12 • 12