diwank
's Collections
Viewer
•
Updated
•
1.23k
•
4
ibm/merlinite-7b
Text Generation
•
Updated
•
15.9k
•
99
microsoft/Orca-2-13b
Text Generation
•
Updated
•
22k
•
649
snorkelai/snorkel-curated-instruction-tuning
Preview
•
Updated
•
2
•
9
corbyrosset/researchy_questions
Viewer
•
Updated
•
63
•
20
argilla/ultrafeedback-binarized-preferences
Viewer
•
Updated
•
3.42k
•
58
Viewer
•
Updated
•
73
•
7
microsoft/orca-math-word-problems-200k
Viewer
•
Updated
•
6.8k
•
327
m-a-p/CodeFeedback-Filtered-Instruction
Viewer
•
Updated
•
2.33k
•
89
Viewer
•
Updated
•
524
•
21
Viewer
•
Updated
•
5.77k
•
374
sanjay920/gemma-function-calling
Viewer
•
Updated
•
29
•
6
HuggingFaceH4/deita-10k-v0-sft
Viewer
•
Updated
•
1.73k
•
28
philschmid/slimorca-dedup-chatml
Viewer
•
Updated
•
2
Viewer
•
Updated
•
97
•
45
Viewer
•
Updated
•
1.86k
•
125
Preview
•
Updated
•
4
•
19
Preview
•
Updated
•
4
•
19
harpreetsahota/diverse-token-sampler
Viewer
•
Updated
•
3
harpreetsahota/Instruction-Following-Evaluation-for-Large-Language-Models
Viewer
•
Updated
•
26
•
5
harpreetsahota/elicit-bias-prompts
Viewer
•
Updated
•
3
harpreetsahota/elicit-offensive-language-prompts
Viewer
•
Updated
•
3
glaiveai/glaive-code-assistant
Viewer
•
Updated
•
839
•
70
argilla/OpenHermesPreferences
Viewer
•
Updated
•
39k
•
168
HuggingFaceH4/grok-conversation-harmless2
Viewer
•
Updated
•
4
•
6
NousResearch/func-calling-eval-singleturn
Viewer
•
Updated
•
28
•
4
Viewer
•
Updated
•
1.02k
•
161
Updated
•
292
•
21
iohadrubin/gorilla_openfunctions_yaml_train
Viewer
•
Updated
•
1
Viewer
•
Updated
•
11
Viewer
•
Updated
•
2
Viewer
•
Updated
•
8
nvidia/OpenMathInstruct-1
Viewer
•
Updated
•
2.33k
•
171
Viewer
•
Updated
•
2.81k
•
177
reasoning-machines/gsm-hard
Viewer
•
Updated
•
801
•
28
grimulkan/physical-reasoning
Viewer
•
Updated
•
97
•
6
alexredna/slim_orca_hermes_reasoning_sft
Viewer
•
Updated
•
2
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
•
53
•
7
AtlasUnified/atlas-storyteller
Viewer
•
Updated
•
81
•
5
AtlasUnified/atlas-converse
Preview
•
Updated
•
5
•
3
AtlasUnified/Code-Instruct-Sets
Viewer
•
Updated
•
2
•
6
AtlasUnified/atlas-math-sets
Viewer
•
Updated
•
204
•
3
ProlificAI/social-reasoning-rlhf
Viewer
•
Updated
•
10
•
15
mamachang/medical-reasoning
Viewer
•
Updated
•
428
•
8
Viewer
•
Updated
•
1.04k
•
52
argilla/prompt-collective
Viewer
•
Updated
•
4
•
6
Viewer
•
Updated
•
12
•
10
Updated
•
27
•
10
Dahoas/code-review-instruct-critique-revision-python
Viewer
•
Updated
•
1
•
9
Viewer
•
Updated
•
63
•
3
Amod/mental_health_counseling_conversations
Viewer
•
Updated
•
4k
•
145
vibhorag101/phr_mental_therapy_dataset
Viewer
•
Updated
•
533
•
15
to-be/annomi-motivational-interviewing-therapy-conversations
Viewer
•
Updated
•
144
•
4
leonweber/teaching_motivational_quotes
Viewer
•
Updated
•
5
Viewer
•
Updated
•
24.8k
•
77
manishiitg/camel-ai-physics
Viewer
•
Updated
•
2
•
1
StarfleetAI/function-calling
Viewer
•
Updated
•
1
•
8
khaimaitien/multi-hop-qa-function-calling-format-V1.0
Viewer
•
Updated
•
1
•
5
flozi00/german-function-calling
Viewer
•
Updated
•
3
starsnatched/MemGPT-Functions-DPO
Viewer
•
Updated
•
35
•
7
hypervariance/function-calling-sharegpt
Viewer
•
Updated
•
189
•
20
togethercomputer/glaive-function-calling-v2-formatted
Viewer
•
Updated
•
407
•
22
bigcode/starcoder2-15b
Text Generation
•
Updated
•
36.1k
•
499
m-a-p/OpenCodeInterpreter-DS-33B
Text Generation
•
Updated
•
873
•
96
deepseek-ai/deepseek-coder-33b-instruct
Text Generation
•
Updated
•
23.5k
•
399
Viewer
•
Updated
•
2
ChuckMcSneed/various_RP_system_prompts
Viewer
•
Updated
•
15
Viewer
•
Updated
•
98
•
46
codellama/CodeLlama-70b-Python-hf
Text Generation
•
Updated
•
3.6k
•
104
codellama/CodeLlama-34b-Python-hf
Text Generation
•
Updated
•
6.53k
•
93
codeparrot/github-jupyter-parsed
Viewer
•
Updated
•
4
•
6
codeparrot/github-jupyter
Viewer
•
Updated
•
6
•
5
codeparrot/github-jupyter-text-code-pairs
Viewer
•
Updated
•
213
•
7
codeparrot/github-jupyter-code-to-text
Viewer
•
Updated
•
33
•
20
bigcode/jupyter-code-text-pairs
Viewer
•
Updated
•
2
•
7
JetBrains-Research/jupyter-errors-dataset
Viewer
•
Updated
•
1
Viewer
•
Updated
•
8
•
20
Locutusque/function-calling-chatml
Viewer
•
Updated
•
341
•
42
teknium/dataforge-economics
Viewer
•
Updated
•
152
•
45
Viewer
•
Updated
•
1
Viewer
•
Updated
•
3
Viewer
•
Updated
•
6
•
12
Viewer
•
Updated
•
1
unalignment/toxic-dpo-v0.1
Viewer
•
Updated
•
428
•
128
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
•
3.66k
•
98
cognitivecomputations/ultrachat-uncensored
Viewer
•
Updated
•
228
•
35
prometheus-eval/Feedback-Collection
Viewer
•
Updated
•
56
•
91
Viewer
•
Updated
•
26
•
14
Updated
•
5.06k
•
15
Viewer
•
Updated
•
18k
•
81
Heralax/Augmental-Dataset
Viewer
•
Updated
•
71
•
15
isaacrehg/poetry-instructions
Viewer
•
Updated
•
10
Viewer
•
Updated
•
142
•
19
matthh/gutenberg-poetry-corpus
Viewer
•
Updated
•
6
argilla/ultrafeedback-curated
Viewer
•
Updated
•
13
•
18
argilla/distilabel-math-preference-dpo
Viewer
•
Updated
•
368
•
61
alvarobartt/HelpSteer-AIF
Viewer
•
Updated
•
5
jamescalam/agent-conversations-retrieval-tool
Viewer
•
Updated
•
26
•
14
HydraLM/GPTeacher_toolformer_list_dict
Viewer
•
Updated
•
1
Viewer
•
Updated
•
2.55k
•
74
Viewer
•
Updated
•
115
•
13
Updated
•
13.2k
•
24
Viewer
•
Updated
•
5.95k
•
32
Viewer
•
Updated
•
1.62k
•
198
freecs/ArtificialThinkerSet
Viewer
•
Updated
•
9
Updated
•
1.47k
•
55
andersonbcdefg/synthetic_retrieval_tasks
Viewer
•
Updated
•
75
TuringsSolutions/NYTWritingStyleGuide
Updated
•
2
•
49
intfloat/llm-retriever-tasks
Viewer
•
Updated
•
8
athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW
Viewer
•
Updated
•
85
•
52
NeuralNovel/Creative-Logic-v1
Viewer
•
Updated
•
2
•
8
NeuralNovel/Neural-Story-v1
Viewer
•
Updated
•
7
•
8
nlpie/Llama2-MedTuned-Instructions
Viewer
•
Updated
•
155
•
37
Updated
•
610
•
55
Viewer
•
Updated
•
2
•
77
Viewer
•
Updated
•
47
•
18
jondurbin/contextual-dpo-v0.1
Viewer
•
Updated
•
168
•
27
unalignment/toxic-dpo-v0.2
Viewer
•
Updated
•
1.38k
•
81
Viewer
•
Updated
•
8
•
1
llm-blender/Unified-Feedback
Viewer
•
Updated
•
1.65k
•
15
Viewer
•
Updated
•
1.13k
•
24
chats-bug/input_tools_plans
Viewer
•
Updated
•
4
jvhoffbauer/gsm8k-toolcalls
Viewer
•
Updated
•
1
sam-mosaic/wiki-concept-gen-chatml
Viewer
•
Updated
•
1
Viewer
•
Updated
•
1
sam-mosaic/orca-gpt4-chatml
Viewer
•
Updated
•
5
Viewer
•
Updated
•
9
mohit-raghavendra/self-instruct-wikipedia
Viewer
•
Updated
•
1
NumbersStation/NSText2SQL
Viewer
•
Updated
•
240
•
64
alpayariyak/opencoder-instruct
Viewer
•
Updated
•
1
mlabonne/know_medical_dialogue_v2
Viewer
•
Updated
•
15
•
4
Viewer
•
Updated
•
19
•
11
Muennighoff/natural-instructions
Updated
•
1.66k
•
41
Viewer
•
Updated
•
114
•
8
Viewer
•
Updated
•
19
•
21
AlekseyKorshuk/chain-of-thoughts-chatml-deduplicated
Viewer
•
Updated
•
2
Viewer
•
Updated
•
2
Viewer
•
Updated
•
2
kaist-ai/Multilingual-CoT-Collection
Updated
•
19
•
18
causal-lm/cot_alpaca_gpt4
Viewer
•
Updated
•
1
Viewer
•
Updated
•
1
argilla/distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
15k
•
141
Norquinal/claude_evol_instruct_210k
Viewer
•
Updated
•
13
•
16
Norquinal/claude_multiround_chat_30k
Viewer
•
Updated
•
92
•
41
umd-zhou-lab/claude2_alpaca
Viewer
•
Updated
•
5
Norquinal/WizardLM_alpaca_claude_evol_instruct_70k
Viewer
•
Updated
•
10
Norquinal/claude_evol_instruct_100k
Viewer
•
Updated
•
7
Norquinal/claude_multi_instruct_1k
Viewer
•
Updated
•
12
Norquinal/claude_multiround_chat_1k
Viewer
•
Updated
•
10
•
8
Norquinal/claude_multi_instruct_30k
Viewer
•
Updated
•
885
•
8
Viewer
•
Updated
•
9
Updated
•
2
•
1
Viewer
•
Updated
•
2.29k
•
13
Viewer
•
Updated
•
2
dream-textures/textures-color-1k
Viewer
•
Updated
•
58
•
9
Viewer
•
Updated
•
57
•
32
renumics/food101-enriched
Updated
•
214
•
6
james-burton/wine_reviews_all_text
Viewer
•
Updated
•
129
•
3
Viewer
•
Updated
•
852
•
14
wellecks/naturalproofs-gen
Viewer
•
Updated
•
16
•
2
Viewer
•
Updated
•
20
•
44
Viewer
•
Updated
•
207
•
11
LangChainDatasets/agent-vectordb-qa-sota-pg
Viewer
•
Updated
•
2
•
3
LangChainDatasets/two-player-dnd
Viewer
•
Updated
•
4
LangChainDatasets/agent-search-calculator
Viewer
•
Updated
•
3
•
15
dim/essayforum_writing_prompts_6k
Viewer
•
Updated
•
3
lionelchg/dolly_creative_writing
Viewer
•
Updated
•
3
•
4
euclaise/WritingPrompts_curated
winglian/long-alpaca-4k-ctx
Viewer
•
Updated
•
2
Viewer
•
Updated
•
3
allenai/scifact_entailment
Viewer
•
Updated
•
3
hackaprompt/hackaprompt-dataset
Viewer
•
Updated
•
475
•
28
Preview
•
Updated
•
2
•
25
Viewer
•
Updated
•
3
gagan3012/Numerical_understanding
Viewer
•
Updated
•
1
Viewer
•
Updated
•
9
•
7
Viewer
•
Updated
•
3
•
2
TuringsSolutions/PlannerTrainingSet
Viewer
•
Updated
•
2
Viewer
•
Updated
•
13
•
13
BatsResearch/bonito-v1
Text2Text Generation
•
Updated
•
1.05k
•
78
deepseek-ai/deepseek-coder-33b-base
Text Generation
•
Updated
•
9.41k
•
61
m-a-p/OpenCodeInterpreter-CL-70B
Text Generation
•
Updated
•
8
•
25
m-a-p/OpenCodeInterpreter-SC2-15B
Text Generation
•
Updated
•
103
•
3
billxbf/rewoo-instruction-finetuning
Viewer
•
Updated
•
2
•
1
Viewer
•
Updated
•
3
reshinthadith/synthetic_program_synthesis_python_1M
Viewer
•
Updated
•
2
•
4
theblackcat102/multiround-programming-convo
Viewer
•
Updated
•
41
•
5
Viewer
•
Updated
•
112
•
63
matlok/multimodal-python-copilot-training-overview
Viewer
•
Updated
•
17
euclaise/mathoverflow-accepted
Viewer
•
Updated
•
3
CohereForAI/c4ai-command-r-v01
Text Generation
•
Updated
•
41.5k
•
997
BEE-spoke-data/coedit-reworded-deduped
Viewer
•
Updated
•
82
•
2
Viewer
•
Updated
•
63
•
41
Viewer
•
Updated
•
154
•
5
Viewer
•
Updated
•
217
•
5
imodels/multitask-tabular-datasets
msakarvadia/handwritten_multihop_reasoning_data
Viewer
•
Updated
•
4
Viewer
•
Updated
•
1.08k
•
21
alex43219/prolog-dataset-small-balanced
Viewer
•
Updated
•
1
Viewer
•
Updated
•
199
•
21
pszemraj/riddlesense_plusplus
Viewer
•
Updated
•
6
•
3
Viewer
•
Updated
•
121
•
4
Viewer
•
Updated
•
235
•
13
Viewer
•
Updated
•
1
•
9
euclaise/reddit-instruct-curated
Viewer
•
Updated
•
61
•
14
Viewer
•
Updated
•
3
kenhktsui/open-toolformer-retrieval
Viewer
•
Updated
•
6
•
6
Viewer
•
Updated
•
4
Updated
•
334
•
4
microsoft/Promptist
Text Generation
•
Updated
•
1.18k
•
61
Jellywibble/dalio_handwritten-conversations
Viewer
•
Updated
•
1
Viewer
•
Updated
•
49
•
31
Viewer
•
Updated
•
44
glaiveai/glaive-function-calling-v2
Viewer
•
Updated
•
1.71k
•
271
Viewer
•
Updated
•
826
•
6
declare-lab/InstructEvalImpact
Viewer
•
Updated
•
7
Updated
•
2
•
18
rewoo/planner_instruction_tuning_2k
Viewer
•
Updated
•
43
•
28
Viewer
•
Updated
•
26.1k
•
98
Viewer
•
Updated
•
4.11k
•
86
Viewer
•
Updated
•
14.2k
•
213
Viewer
•
Updated
•
10k
•
390
Viewer
•
Updated
•
2
•
60
chats-bug/agent_action_plan
Viewer
•
Updated
•
4
•
9
Viewer
•
Updated
•
259
•
128
Viewer
•
Updated
•
5
Viewer
•
Updated
•
6
euclaise/naturalinstructions2_preferences
Viewer
•
Updated
•
1
euclaise/megacot_heuristic_filtered
Viewer
•
Updated
•
1
•
2
Viewer
•
Updated
•
1
euclaise/WritingPrompts_binarized
Viewer
•
Updated
•
1
euclaise/WritingPrompts_preferences
Viewer
•
Updated
•
6
Viewer
•
Updated
•
8
euclaise/gsm8k_self_correct
Viewer
•
Updated
•
2
euclaise/DirtyWritingPrompts
Viewer
•
Updated
•
1
Viewer
•
Updated
•
844
•
8
Viewer
•
Updated
•
4
•
2
Viewer
•
Updated
•
8
•
3
Viewer
•
Updated
•
10.4k
•
52
Viewer
•
Updated
•
1.89k
•
5
Viewer
•
Updated
•
11
•
5
Updated
•
13.1k
•
3
Viewer
•
Updated
•
403
•
4
Viewer
•
Updated
•
2
prometheus-eval/Perception-Collection
Viewer
•
Updated
•
6
lmsys/mt_bench_human_judgments
Updated
•
20.7k
•
76
arnoudbuzing/linear-equation-training
Viewer
•
Updated
•
2
•
1
arnoudbuzing/quadratic-equation-training
Viewer
•
Updated
•
2
•
1
arnoudbuzing/cubic-equation-training
Viewer
•
Updated
•
2
•
1
Viewer
•
Updated
•
12.8k
•
118
AGBonnet/augmented-clinical-notes
Viewer
•
Updated
•
183
•
12
HuggingFaceH4/orca_dpo_pairs
Viewer
•
Updated
•
1.66k
•
15
CohereForAI/aya_collection
Viewer
•
Updated
•
523
•
135
snorkelai/Snorkel-Mistral-PairRM-DPO
Text Generation
•
Updated
•
4.4k
•
103
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
Viewer
•
Updated
•
453
•
33
NousResearch/Genstruct-7B
Text Generation
•
Updated
•
473
•
352
Preview
•
Updated
•
183
•
657
Viewer
•
Updated
•
865
•
93
deepseek-ai/deepseek-llm-67b-base
Text Generation
•
Updated
•
6.19k
•
102
Qwen/Qwen1.5-72B
Text Generation
•
Updated
•
7.7k
•
55
Viewer
•
Updated
•
197
•
18
GenVRadmin/Samvaad-Mixed-Language-2
Viewer
•
Updated
•
4
ResplendentAI/Alpaca_NSFW_Shuffled
Viewer
•
Updated
•
28
•
1
NousResearch/Nous-Hermes-2-Yi-34B
Text Generation
•
Updated
•
80.3k
•
230
OpenPipe/mistral-ft-optimized-1227
Text Generation
•
Updated
•
12.8k
•
77
Yi: Open Foundation Models by 01.AI
Paper
•
2403.04652
•
Published
•
58
open-phi/textbooks_grounded
Viewer
•
Updated
•
1
•
3
Viewer
•
Updated
•
26
•
77
dreamgen/opus-v1-34b
Text Generation
•
Updated
•
2.67k
•
15
roborovski/superprompt-v1
Text2Text Generation
•
Updated
•
4.56k
•
68
roborovski/synthetic-tool-calls-v2-dpo-pairs
Viewer
•
Updated
•
2
roborovski/synthetic-tool-calls-v2
Viewer
•
Updated
•
2
Crystalcareai/CodeFeedback-Alpaca
Viewer
•
Updated
•
3
Crystalcareai/slimorca-dedup-alpaca-100k
Viewer
•
Updated
•
9
•
1
Crystalcareai/synthetic_reasoning_natural_Alpaca_Combined
Viewer
•
Updated
•
20
•
1
Crystalcareai/truthyDPO-intel
Viewer
•
Updated
•
3
•
2
Crystalcareai/Natural-Instructions-Small-Alpaca
Viewer
•
Updated
•
3
•
2
Viewer
•
Updated
•
1.96k
•
27
Viewer
•
Updated
•
41
•
6
Crystalcareai/OH2.5strict
Viewer
•
Updated
•
1
🔥
chat-ui
Viewer
•
Updated
•
150
•
38
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
•
2.19k
•
60
jondurbin/cinematika-v0.1
Viewer
•
Updated
•
272
•
45
ParisNeo/lollms_aware_dataset
Viewer
•
Updated
•
4
grimulkan/LimaRP-augmented
Viewer
•
Updated
•
223
•
15
Viewer
•
Updated
•
10.9k
•
183
Viewer
•
Updated
•
24
Viewer
•
Updated
•
6.42k
•
347
Viewer
•
Updated
•
189
•
92
Viewer
•
Updated
•
24.2k
•
16
tinyBenchmarks/tinyWinogrande
Viewer
•
Updated
•
17.1k
•
1
tinyBenchmarks/tinyAI2_arc
Viewer
•
Updated
•
13.5k
•
3
tinyBenchmarks/tinyHellaswag
Viewer
•
Updated
•
8.64k
•
2
tinyBenchmarks/tinyTruthfulQA
Viewer
•
Updated
•
5.09k
•
2
tinyBenchmarks/tinyAlpacaEval
Viewer
•
Updated
•
42
•
2
Viewer
•
Updated
•
392
•
4
cognitivecomputations/samantha-data
Updated
•
56
•
106
roborovski/synthetic-tool-calls
Viewer
•
Updated
•
1
roborovski/glaive-tool-usage-dpo
Viewer
•
Updated
•
2
Viewer
•
Updated
•
2
roborovski/glaive-function-calling-v2-conversation
Viewer
•
Updated
•
2
Viewer
•
Updated
•
1
Viewer
•
Updated
•
239
•
8
vilm/Quyen-Pro-v0.1
Text Generation
•
Updated
•
713
•
8
01-ai/Yi-9B-200K
Text Generation
•
Updated
•
7.15k
•
71
coseal/CodeUltraFeedback_binarized
Viewer
•
Updated
•
103
•
11
Viewer
•
Updated
•
10
•
20
KTO: Model Alignment as Prospect Theoretic Optimization
Paper
•
2402.01306
•
Published
•
11
Viewer
•
Updated
•
4.09k
•
19
Rephrase and Respond: Let Large Language Models Ask Better Questions for
Themselves
Paper
•
2311.04205
•
Published
•
5
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper
•
2401.01854
•
Published
•
9
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
61
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
171
Self-Instruct: Aligning Language Model with Self Generated Instructions
Paper
•
2212.10560
•
Published
•
5
HuggingFaceH4/self-instruct-seed
Viewer
•
Updated
•
96
•
18
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Paper
•
2311.10775
•
Published
•
7
Dynamic Planning with a LLM
Paper
•
2308.06391
•
Published
•
2
FreedomIntelligence/SocraticChat
Viewer
•
Updated
•
5
Large Language Model as a User Simulator
Paper
•
2308.11534
•
Published
•
2
Natural Language Embedded Programs for Hybrid Language Symbolic
Reasoning
Paper
•
2309.10814
•
Published
•
3
AlpaGasus: Training A Better Alpaca with Fewer Data
Paper
•
2307.08701
•
Published
•
21
Viewer
•
Updated
•
115
•
7
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper
•
2310.12823
•
Published
•
33
Viewer
•
Updated
•
711
•
173
Diversity of Thought Improves Reasoning Abilities of Large Language
Models
Paper
•
2310.07088
•
Published
•
4
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Paper
•
2310.01557
•
Published
•
12
Large Language Models Cannot Self-Correct Reasoning Yet
Paper
•
2310.01798
•
Published
•
30
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language
Feedback
Paper
•
2309.10691
•
Published
•
4
LLM+P: Empowering Large Language Models with Optimal Planning
Proficiency
Paper
•
2304.11477
•
Published
•
2
Quiet-STaR: Language Models Can Teach Themselves to Think Before
Speaking
Paper
•
2403.09629
•
Published
•
54
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step
Reasoning
Paper
•
2308.00436
•
Published
•
20
📢
UGI Leaderboard
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft
Reasoning
Paper
•
2310.16049
•
Published
•
3
Instruction-Following Evaluation for Large Language Models
Paper
•
2311.07911
•
Published
•
17
Viewer
•
Updated
•
7
•
4
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations
Paper
•
2311.08469
•
Published
•
10
Flows: Building Blocks of Reasoning and Collaborating AI
Paper
•
2308.01285
•
Published
•
2
aiflows/CCFlows
Learning to Reason and Memorize with Self-Notes
Paper
•
2305.00833
•
Published
•
4
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Paper
•
2305.03268
•
Published
•
2
Making Large Language Models Better Reasoners with Alignment
Paper
•
2309.02144
•
Published
•
2
Reason for Future, Act for Now: A Principled Framework for Autonomous
LLM Agents with Provable Sample Efficiency
Paper
•
2309.17382
•
Published
•
4
ALERT: Adapting Language Models to Reasoning Tasks
Paper
•
2212.08286
•
Published
•
2
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Paper
•
2402.04858
•
Published
•
13
Viewer
•
Updated
•
44
•
10
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Paper
•
2403.04746
•
Published
•
21
Learning to Decode Collaboratively with Multiple Language Models
Paper
•
2403.03870
•
Published
•
17
Large Language Models as Zero-shot Dialogue State Tracker through
Function Calling
Paper
•
2402.10466
•
Published
•
16
SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State
Tracking
Paper
•
2402.02285
•
Published
•
1
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
Finetuning Method
Paper
•
2402.17193
•
Published
•
23
Towards Optimal Learning of Language Models
Paper
•
2402.17759
•
Published
•
16
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper
•
2402.17753
•
Published
•
17
Viewer
•
Updated
•
1
Generative Representational Instruction Tuning
Paper
•
2402.09906
•
Published
•
50
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for
Language Models
Paper
•
2402.13064
•
Published
•
45
OpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement
Paper
•
2402.14658
•
Published
•
77
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
•
2402.14083
•
Published
•
43
PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification,
Retrieval, and Synthesis in Question Answering
Paper
•
2402.16288
•
Published
•
1
FarReelAILab/Machine_Mindset_MBTI_dataset
Viewer
•
Updated
•
36
Viewer
•
Updated
•
1.62k
•
254
totally-not-an-llm/sharegpt-hyperfiltered-3k
Viewer
•
Updated
•
62
•
11
Viewer
•
Updated
•
14.3k
•
483
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
7.39k
•
97
dmayhem93/self-critiquing-refine
Viewer
•
Updated
•
1
dmayhem93/self-critiquing-critique-and-refine
Viewer
•
Updated
•
1
morzecrew/RefinedPersonaChat
Viewer
•
Updated
•
23
•
2
beratcmn/rephrased-instruction-turkish-poems
Viewer
•
Updated
•
4
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
•
3.59k
•
16
theblackcat102/evol-codealpaca-v1
Viewer
•
Updated
•
930
•
135
Viewer
•
Updated
•
13
•
11
Viewer
•
Updated
•
936
•
18
glaiveai/glaive-code-assistant-v2
Viewer
•
Updated
•
198
•
40
Towards General Computer Control: A Multimodal Agent for Red Dead
Redemption II as a Case Study
Paper
•
2403.03186
•
Published
•
3
PROC2PDDL: Open-Domain Planning Representations from Texts
Paper
•
2403.00092
•
Published
•
1
btan2/cappy-large
Text Classification
•
Updated
•
272
•
19
kaist-ai/mistral-orpo-beta
Text Generation
•
Updated
•
2.69k
•
34
mixedbread-ai/mxbai-colbert-large-v1
Updated
•
9.04k
•
38
Viewer
•
Updated
•
156
•
39
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
Updated
•
3.08k
•
24
QizhiPei/BioT5_finetune_dataset
Viewer
•
Updated
•
8
•
8
GenVRadmin/Aryabhatta-Orca-Maths-Hindi
Viewer
•
Updated
•
3
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model
Collaboration
Paper
•
2310.00280
•
Published
•
3
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal
Language Models
Paper
•
2311.05997
•
Published
•
34
Updated
•
14.3k
•
9
argilla/zephyr-7b-spin-iter3-v0
Text Generation
•
Updated
•
4
•
9
argilla/distilabel-capybara-kto-15k-binarized
Viewer
•
Updated
•
82
•
4
argilla/ultrafeedback-binarized-preferences-cleaned-kto
Viewer
•
Updated
•
48
•
4
argilla/distilabel-intel-orca-kto
Viewer
•
Updated
•
10
•
4
Viewer
•
Updated
•
15
•
9
KnutJaegersberg/dolphin_orca_clustered
GAIR/autoj-13b
Text Generation
•
Updated
•
96
•
8
GAIR/autoj-scenario-classifier
Text Generation
•
Updated
•
12
•
3
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
68
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper
•
2403.03507
•
Published
•
172
Ask Optimal Questions: Aligning Large Language Models with Retriever's
Preference in Conversational Search
Paper
•
2402.11827
•
Published
•
1
Grounding Language Model with Chunking-Free In-Context Retrieval
Paper
•
2402.09760
•
Published
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for
Large Language Models
Paper
•
2403.12881
•
Published
•
14
Preview
•
Updated
•
6
Preview
•
Updated
•
81
•
41
Viewer
•
Updated
•
13
•
9
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
•
3.68M
•
120
fabiochiu/medium-articles
Preview
•
Updated
•
123
•
18
Reverse Training to Nurse the Reversal Curse
Paper
•
2403.13799
•
Published
•
12
Preview
•
Updated
•
25
•
4
Viewer
•
Updated
•
2.7k
•
2
BAAI/bge-reranker-v2-m3
Text Classification
•
Updated
•
603k
•
75
Nexusflow/Starling-RM-34B
Updated
•
2.9k
•
69
Viewer
•
Updated
•
4.28k
•
39
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large
Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper
•
2309.08968
•
Published
•
22
Salesforce/xLAM-v0.1-r
Text Generation
•
Updated
•
2.94k
•
17
In-Context Learning Creates Task Vectors
Paper
•
2310.15916
•
Published
•
39
Are Emergent Abilities in Large Language Models just In-Context
Learning?
Paper
•
2309.01809
•
Published
•
3
kaist-ai/mistral-orpo-capybara-7k
Text Generation
•
Updated
•
2.07k
•
26
prometheus-eval/prometheus-7b-v2.0
Text2Text Generation
•
Updated
•
4.34k
•
39
mistral-community/Mistral-7B-v0.2
Text Generation
•
Updated
•
50k
•
221
Preview
•
Updated
•
46
•
67
LLM Agent Operating System
Paper
•
2403.16971
•
Published
•
62
ORPO: Monolithic Preference Optimization without Reference Model
Paper
•
2403.07691
•
Published
•
54
princeton-nlp/QuRatedPajama-260B
Viewer
•
Updated
•
9
•
5
princeton-nlp/QuRater-1.3B
Text Classification
•
Updated
•
52
•
6
Arcee's MergeKit: A Toolkit for Merging Large Language Models
Paper
•
2403.13257
•
Published
•
16
Can large language models explore in-context?
Paper
•
2403.15371
•
Published
•
30
Locutusque/OpenCerebrum-dpo
Viewer
•
Updated
•
13
•
6
Doctor-Shotgun/theory-of-mind-dpo
Viewer
•
Updated
•
53
•
12
Viewer
•
Updated
•
15
•
4
Viewer
•
Updated
•
10
•
13
KrisPi/PythonTutor-Evol-1k-DPO-GPT4_vs_35
Viewer
•
Updated
•
3
•
9
zerolink/zsql-postgres-dpo
Viewer
•
Updated
•
6
Lakera/gandalf_ignore_instructions
Viewer
•
Updated
•
648
•
19
mrm8488/unnatural-instructions-full
Viewer
•
Updated
•
20
•
13
Updated
•
12
•
1
databricks/dbrx-base
Text Generation
•
Updated
•
2.17k
•
540
databricks/dbrx-instruct
Text Generation
•
Updated
•
30.1k
•
1.07k
ai21labs/Jamba-v0.1
Text Generation
•
Updated
•
42.4k
•
1.12k
NilanE/SmallParallelDocs-Ja_En-6k
Viewer
•
Updated
•
2
Long-form factuality in large language models
Paper
•
2403.18802
•
Published
•
23
NousResearch/OLMo-Bitnet-1B
Text Generation
•
Updated
•
393
•
105
pyp1/VoiceCraft
Text-to-Speech
•
Updated
•
322
•
198
CarperAI/openai_summarize_comparisons
Viewer
•
Updated
•
2.39k
•
33
Updated
•
766
•
181
ivanleomk/gpt4-chain-of-density
Preview
•
Updated
•
6
•
6
AIRI-NLP/cnli_memory_extracted
Viewer
•
Updated
•
1
PowerInfer/Bamboo-base-v0_1
Feature Extraction
•
Updated
•
11
•
20
Lumos: Learning Agents with Unified Data, Modular Design, and
Open-Source LLMs
Paper
•
2311.05657
•
Published
•
26
openbmb/UltraInteract_sft
Viewer
•
Updated
•
766
•
87
openbmb/UltraInteract_pair
Viewer
•
Updated
•
903
•
82
openbmb/Eurus-70b-sft
Text Generation
•
Updated
•
513
•
4
openbmb/Eurus-70b-nca
Text Generation
•
Updated
•
272
•
10
Noise Contrastive Alignment of Language Models with Explicit Rewards
Paper
•
2402.05369
•
Published
•
1
mlabonne/Jambatypus-v0.1
Text Generation
•
Updated
•
9
•
36
ai2lumos/lumos_multimodal_ground_iterative
Viewer
•
Updated
•
1
ai2lumos/lumos_multimodal_plan_iterative
Viewer
•
Updated
•
2
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
•
Updated
•
4
•
3
ai2lumos/lumos_complex_qa_ground_onetime
Viewer
•
Updated
•
3
ai2lumos/lumos_complex_qa_ground_iterative
Viewer
•
Updated
•
19
•
2
ai2lumos/lumos_unified_plan_iterative
Viewer
•
Updated
•
2
ai2lumos/lumos_complex_qa_plan_iterative
Viewer
•
Updated
•
18
•
2
ai2lumos/lumos_unified_ground_iterative
Viewer
•
Updated
•
2
ai2lumos/lumos_web_agent_ground_iterative
Viewer
•
Updated
•
69
•
2
ai2lumos/lumos_web_agent_plan_iterative
Viewer
•
Updated
•
46
•
3
ai2lumos/lumos_maths_ground_iterative
Viewer
•
Updated
•
5
•
1
ai2lumos/lumos_maths_ground_onetime
Viewer
•
Updated
•
1
ai2lumos/lumos_maths_plan_onetime
Viewer
•
Updated
•
2
Symbol-LLM/Symbol-LLM-7B-Instruct
Text Generation
•
Updated
•
4
•
9
MoritzLaurer/deberta-v3-large-zeroshot-v2.0
Zero-Shot Classification
•
Updated
•
868k
•
37
MoritzLaurer/bge-m3-zeroshot-v2.0
Zero-Shot Classification
•
Updated
•
3.79k
•
16
What Makes Good Data for Alignment? A Comprehensive Study of Automatic
Data Selection in Instruction Tuning
Paper
•
2312.15685
•
Published
•
16
Qwen/Qwen1.5-32B
Text Generation
•
Updated
•
7.75k
•
71
Viewer
•
Updated
•
3
Viewer
•
Updated
•
34
•
1
vicgalle/configurable-system-prompt-multitask
Viewer
•
Updated
•
100
•
10
paraloq/json_data_extraction
Viewer
•
Updated
•
103
•
11
Viewer
•
Updated
•
3
iamtarun/python_code_instructions_18k_alpaca
Viewer
•
Updated
•
3.46k
•
120
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Paper
•
2403.15042
•
Published
•
24
Viewer
•
Updated
•
7
•
1
manishiitg/CogStack-Tasks
Viewer
•
Updated
•
14
•
1
Viewer
•
Updated
•
7
•
1
Paper
•
2402.12219
•
Published
•
15
Viewer
•
Updated
•
468
•
16
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
Text Generation
•
Updated
•
2.16k
•
230
Leave No Context Behind: Efficient Infinite Context Transformers with
Infini-attention
Paper
•
2404.07143
•
Published
•
92
Viewer
•
Updated
•
360
•
162
Rho-1: Not All Tokens Are What You Need
Paper
•
2404.07965
•
Published
•
79
Viewer
•
Updated
•
109
•
6
OpenLLM-France/Claire-Dialogue-French-0.1
Viewer
•
Updated
•
6
•
33
Viewer
•
Updated
•
287
•
13
Viewer
•
Updated
•
1.08k
•
81
Updated
•
8.36k
•
199
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
•
2404.03715
•
Published
•
57
Learn Your Reference Model for Real Good Alignment
Paper
•
2404.09656
•
Published
•
79
CantTalkAboutThis: Aligning Language Models to Stay on Topic in
Dialogues
Paper
•
2404.03820
•
Published
•
20
mlabonne/orpo-dpo-mix-40k
Viewer
•
Updated
•
11.7k
•
152
Viewer
•
Updated
•
769
•
271
FreedomIntelligence/evol-instruct-hindi
Viewer
•
Updated
•
1
FreedomIntelligence/OVM-process
Viewer
•
Updated
•
1
Viewer
•
Updated
•
6.43k
•
11
totally-not-an-llm/EverythingLM-data-V3
Viewer
•
Updated
•
85
•
30
RUCAIBox/Story-Generation
Updated
•
18
•
10
Viewer
•
Updated
•
2
•
2
jerryjalapeno/nart-100k-synthetic
Viewer
•
Updated
•
159
•
36
Sao10K/Claude-3-Opus-Instruct-15K
Updated
•
60
•
75
Viewer
•
Updated
•
987
•
28
euclaise/ReMask-3B
Text Generation
•
Updated
•
2.72k
•
13
namespace-Pt/activation-beacon-mistral-7b
Text Generation
•
Updated
•
357
•
1
google/Synthetic-Persona-Chat
Viewer
•
Updated
•
210
•
36
gradientai/Llama-3-8B-Instruct-Gradient-1048k
Text Generation
•
Updated
•
30.3k
•
557
neural-bridge/rag-dataset-12000
Viewer
•
Updated
•
3.05k
•
67
HannahRoseKirk/prism-alignment
Viewer
•
Updated
•
429
•
23
Gigax/NPC-LLM-3_8B
Text Generation
•
Updated
•
79
•
18
Viewer
•
Updated
•
171
•
4
cognitivecomputations/SystemChat-1.2
Viewer
•
Updated
•
5
•
6
mlabonne/arena-preferences
Viewer
•
Updated
•
157
•
8
INTERS: Unlocking the Power of Large Language Models in Search with
Instruction Tuning
Paper
•
2401.06532
•
Published
•
10
Flexibly Scaling Large Language Models Contexts Through Extensible
Tokenization
Paper
•
2401.07793
•
Published
•
3
Preview
•
Updated
•
12
•
8
winglian/Llama-3-8b-64k-PoSE
Text Generation
•
Updated
•
4.23k
•
68
THUDM/CogAgent
Updated
•
12
urchade/gliner_large-v2.1
Token Classification
•
Updated
•
10.9k
•
11
Viewer
•
Updated
•
18
•
25
nvidia/ChatQA-Training-Data
Viewer
•
Updated
•
2.17k
•
108
Viewer
•
Updated
•
181
•
28
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation
•
Updated
•
3.64k
•
9
tenyx/Llama3-TenyxChat-70B
Text Generation
•
Updated
•
1.69k
•
55
ibm-granite/granite-34b-code-base
Text Generation
•
Updated
•
456
•
12
Viewer
•
Updated
•
33.7k
•
47
glaiveai/glaive-code-assistant-v3
Viewer
•
Updated
•
683
•
19
Viewer
•
Updated
•
9
•
7
davanstrien/cosmopedia_chat
Viewer
•
Updated
•
7
Viewer
•
Updated
•
923
•
11
Viewer
•
Updated
•
1
Updated
•
2.07k
•
39
Viewer
•
Updated
•
3.41k
•
45
Viewer
•
Updated
•
301
•
11
Viewer
•
Updated
•
36
•
8
Viewer
•
Updated
•
6
MemGPT/MemGPT-DPO-Dataset
Viewer
•
Updated
•
174
•
6
lmsys/lmsys-arena-human-preference-55k
Viewer
•
Updated
•
700
•
60
TIGER-Lab/MAmmoTH2-8x7B-Plus
Text Generation
•
Updated
•
193
•
6
princeton-nlp/QuRating-GPT3.5-Judgments
Viewer
•
Updated
•
12
•
4
princeton-nlp/AutoCompressor-Llama-2-7b-6k
Updated
•
1.34k
•
2
refuelai/Llama-3-Refueled
Text Generation
•
Updated
•
2.5k
•
164
Viewer
•
Updated
•
26
•
13
EleutherAI/lichess-puzzles
Viewer
•
Updated
•
571
•
18
Viewer
•
Updated
•
306
•
140
selfrag/selfrag_train_data
Viewer
•
Updated
•
407
•
56
Viewer
•
Updated
•
3.91k
•
44
Viewer
•
Updated
•
1.23k
•
104