Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published 4 days ago • 69
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published 13 days ago • 75
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 99
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions Paper • 2404.13208 • Published Apr 19 • 38
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22 • 240
Improving Factual Consistency of Text Summarization by Adversarially Decoupling Comprehension and Embellishment Abilities of LLMs Paper • 2310.19347 • Published Oct 30, 2023 • 1
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models Paper • 2403.02178 • Published Mar 4 • 1
Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use Paper • 2312.04455 • Published Dec 7, 2023 • 1
Best Practices and Lessons Learned on Synthetic Data for Language Models Paper • 2404.07503 • Published Apr 11 • 24
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models Paper • 2309.12307 • Published Sep 21, 2023 • 84
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models Paper • 2402.01739 • Published Jan 29 • 26
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Paper • 2401.12474 • Published Jan 23 • 33
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 141
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B Paper • 2310.20624 • Published Oct 31, 2023 • 12
UniSA: Unified Generative Framework for Sentiment Analysis Paper • 2309.01339 • Published Sep 4, 2023 • 1
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents Paper • 2305.13040 • Published May 22, 2023 • 1
Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models Paper • 2309.12940 • Published Sep 22, 2023 • 2
Constructive Large Language Models Alignment with Diverse Feedback Paper • 2310.06450 • Published Oct 10, 2023 • 1