Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes 16-bit, 8-bit and Dynamic 4-bit uploads. Fine-tune them with Unsloth! • 13 items • Updated about 2 hours ago • 21
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning Paper • 2503.19470 • Published 13 days ago • 15
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 211
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 4 days ago • 94
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 12 days ago • 99
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 8 days ago • 92
CoRNStack Collection State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated 11 days ago • 15
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 11 days ago • 79
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 4 days ago • 43
💫StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated 17 days ago • 90