view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 4 days ago • 42
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 67
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 144
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python Oct 22, 2024 • 44