Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 5 days ago • 66
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper • 2405.00332 • Published 6 days ago • 21
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published 7 days ago • 50
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published 8 days ago • 58
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs Paper • 2404.16873 • Published 16 days ago • 22
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Feb 19 • 27
Textbooks Are All You Need II: phi-1.5 technical report Paper • 2309.05463 • Published Sep 11, 2023 • 84
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 15
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks Paper • 2404.14723 • Published 14 days ago • 9
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published 12 days ago • 53
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published 15 days ago • 117
Top 10% instruction tuning datasets Collection Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes • 13 items • Updated Sep 25, 2023 • 6
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 20
Learning to Route Among Specialized Experts for Zero-Shot Generalization Paper • 2402.05859 • Published Feb 8 • 4
Instruction-Following Evaluation for Large Language Models Paper • 2311.07911 • Published Nov 14, 2023 • 17
view article Article Mergoo: Efficiently Build Your Own MoE LLM By alirezamsh • about 6 hours ago • 29
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Paper • 2403.07816 • Published Mar 12 • 37
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4 • 57
view article Article Orchestration of Experts: The First-Principle Multi-Model System By alirezamsh • 21 days ago • 8
Orca-Math: Unlocking the potential of SLMs in Grade School Math Paper • 2402.14830 • Published Feb 16 • 23
Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration Paper • 2401.13979 • Published Jan 25 • 2
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated 25 days ago • 52
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated 25 days ago • 88
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages Paper • 2210.11621 • Published Oct 20, 2022 • 1
RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question Paper • 2211.01482 • Published Nov 2, 2022 • 1
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models Paper • 2311.07439 • Published Nov 13, 2023 • 1
What Do Compressed Multilingual Machine Translation Models Forget? Paper • 2205.10828 • Published May 22, 2022 • 1