Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 8 days ago • 103
view article Article Zero to Hero with the TRL learning link bomb 💣 By burtenshaw • about 1 month ago • 4
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18 • 176
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 17 items • Updated 3 days ago • 91
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization Paper • 2410.04717 • Published Oct 7 • 17
Persian Models Collection This is the largest collection of Persian models available on Huggingface • 650 items • Updated 5 days ago • 4
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16 • 98
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Paper • 2408.00690 • Published Aug 1 • 23
view article Article Deploy hundreds of open source models on one GPU using LoRAX By macadeliccc • Jul 18 • 3
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 20 days ago • 636
Product Catalog Generator Collection Product Catalog Generator for Persian products which is hosted by Basalam • 7 items • Updated Sep 7 • 8
PersianMind: A Cross-Lingual Persian-English Large Language Model Paper • 2401.06466 • Published Jan 12 • 3
LLaVa-NeXT Collection LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19 • 27