MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Paper ā¢ 2403.09611 ā¢ Published Mar 14 ā¢ 124
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions Paper ā¢ 2406.09264 ā¢ Published Jun 13 ā¢ 1
Discovering Language Model Behaviors with Model-Written Evaluations Paper ā¢ 2212.09251 ā¢ Published Dec 19, 2022 ā¢ 1
Constitutional AI: Harmlessness from AI Feedback Paper ā¢ 2212.08073 ā¢ Published Dec 15, 2022 ā¢ 2
Training language models to follow instructions with human feedback Paper ā¢ 2203.02155 ā¢ Published Mar 4, 2022 ā¢ 16
Truthful AI: Developing and governing AI that does not lie Paper ā¢ 2110.06674 ā¢ Published Oct 13, 2021 ā¢ 1
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads Paper ā¢ 2405.20053 ā¢ Published May 30 ā¢ 2
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. ā¢ 23 items ā¢ Updated 19 days ago ā¢ 178
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases ā¢ 5 items ā¢ Updated Sep 25 ā¢ 683
Orca-Math: Unlocking the potential of SLMs in Grade School Math Paper ā¢ 2402.14830 ā¢ Published Feb 16 ā¢ 24
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" ā¢ 15 items ā¢ Updated Oct 1 ā¢ 37