Searching for Better ViT Baselines Collection Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). ā¢ 17 items ā¢ Updated about 9 hours ago ā¢ 8
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation 22 days ago ā¢ 69
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper ā¢ 2404.14619 ā¢ Published 28 days ago ā¢ 120
T2I-Adapter-SDXL Collection The smallest and most efficient control models for SDXL! ā¢ 8 items ā¢ Updated Sep 8, 2023 ā¢ 23
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper ā¢ 2307.09288 ā¢ Published Jul 18, 2023 ā¢ 235
DIY AI For Journalists Collection Compiling resources useful for journalists building prototypes with AI ā¢ 8 items ā¢ Updated Sep 18, 2023 ā¢ 10
OpenChat Collection OpenChat: Advancing Open-source Language Models with Mixed-Quality Data ā¢ 7 items ā¢ Updated Jan 10 ā¢ 31
FinGPT: Large Generative Models for a Small Language Paper ā¢ 2311.05640 ā¢ Published Nov 3, 2023 ā¢ 26
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ā¢ 2211.05100 ā¢ Published Nov 9, 2022 ā¢ 23