Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated Jul 17 • 39
💻 Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated 29 days ago • 40
Papers about model merging Collection referenced in the mergekit repo: https://github.com/cg123/mergekit • 4 items • Updated Feb 13 • 14
view article Article DuckDB: run SQL queries on 50,000+ datasets on the Hugging Face Hub Jun 7, 2023 • 4
SteerLM Collection A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated Jul 17 • 12
Indic Alpaca Datasets Collection This collection comprises an alpaca datasets that encompasses a wide range of Indian languages. • 18 items • Updated Mar 21 • 6
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 53
Preference Datasets for DPO Collection This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Jul 30 • 28
Exploring Design Choices for Building Language-Specific LLMs Paper • 2406.14670 • Published Jun 20 • 1
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 56
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization Paper • 2401.14280 • Published Jan 25 • 1
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 103