Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2309.01809

about 9 hours ago

ibm/AttaQ

Viewer • Updated Jan 26 • 1.4k • 1.56k • 7
ibm/merlinite-7b

Text Generation • Updated Mar 5 • 10.9k • 101
microsoft/Orca-2-13b

Text Generation • Updated Nov 22, 2023 • 9.55k • 658
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11 • 7 • 9

AugmentedLearning

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17
mistralai/Mixtral-8x7B-Instruct-v0.1

Text Generation • Updated 19 days ago • 787k • • 3.99k
microsoft/phi-2

Text Generation • Updated Apr 29 • 401k • • 3.21k
TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • Updated Mar 17 • 351k • • 1.02k

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3
Commonsense Knowledge Transfer for Pre-trained Language Models

Paper • 2306.02388 • Published Jun 4, 2023 • 1
Finding Neurons in a Haystack: Case Studies with Sparse Probing

Paper • 2305.01610 • Published May 2, 2023 • 2
Schema-learning and rebinding as mechanisms of in-context learning and emergence

Paper • 2307.01201 • Published Jun 16, 2023 • 2

Dissecting In-Context Learning of Translations in GPTs

Paper • 2310.15987 • Published Oct 24, 2023 • 5
In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 39
ZeroGen: Efficient Zero-shot Learning via Dataset Generation

Paper • 2202.07922 • Published Feb 16, 2022 • 1
Promptor: A Conversational and Autonomous Prompt Generation Agent for Intelligent Text Entry Techniques

Paper • 2310.08101 • Published Oct 12, 2023 • 1

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs