Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2403.09629

Candidate papers to read in the H4 journal club

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Paper • 2210.14986 • Published Oct 26, 2022 • 4
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 18
Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 73
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Paper • 2309.04269 • Published Sep 8, 2023 • 29

google/flan-t5-large

Text2Text Generation • Updated Jul 17, 2023 • 1.6M • • 505
deepseek-ai/deepseek-coder-6.7b-instruct

Text Generation • Updated Feb 2 • 145k • 328
Object Recognition as Next Token Prediction

Paper • 2312.02142 • Published Dec 4, 2023 • 11
colbert-ir/dspy-Oct11-T5-Large-MH-3k-v1

Text2Text Generation • Updated Oct 11, 2023 • 12 • 1

Research Papers

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 33
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29 • 50
Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29 • 47
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 7

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Paper • 2310.04484 • Published Oct 6, 2023 • 4
Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 4
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 74
Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 14

Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 73
Challenges and Applications of Large Language Models

Paper • 2307.10169 • Published Jul 19, 2023 • 47
Efficiently Modeling Long Sequences with Structured State Spaces

Paper • 2111.00396 • Published Oct 31, 2021 • 1
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

Paper • 2006.08381 • Published Jun 15, 2020

Previous
1
2
3
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs