Collections
Discover the best community collections!
Collections including paper arxiv:2309.01809
-
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 16 -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 455k • 3.86k -
microsoft/phi-2
Text Generation • Updated • 645k • 3.17k -
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation • Updated • 603k • 975
-
Are Emergent Abilities in Large Language Models just In-Context Learning?
Paper • 2309.01809 • Published • 3 -
Commonsense Knowledge Transfer for Pre-trained Language Models
Paper • 2306.02388 • Published • 1 -
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Paper • 2305.01610 • Published • 2 -
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Paper • 2307.01201 • Published • 2
-
Dissecting In-Context Learning of Translations in GPTs
Paper • 2310.15987 • Published • 5 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 39 -
ZeroGen: Efficient Zero-shot Learning via Dataset Generation
Paper • 2202.07922 • Published • 1 -
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques
Paper • 2310.08101 • Published • 1
-
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Paper • 2309.12288 • Published • 3 -
Are Emergent Abilities in Large Language Models just In-Context Learning?
Paper • 2309.01809 • Published • 3 -
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
Paper • 2309.04564 • Published • 14 -
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 77