K - a diwank Collection

Try HuggingChat to chat with AI

jondurbin/py-dpo-v0.1

Viewer • Updated Jan 11, 2024 • 9.47k • 179 • 49

jondurbin/gutenberg-dpo-v0.1

Viewer • Updated Jan 12, 2024 • 918 • 1.56k • 136

jondurbin/cinematika-v0.1

Viewer • Updated Apr 11, 2024 • 47.1k • 679 • 54

ParisNeo/lollms_aware_dataset

Viewer • Updated Oct 27, 2023 • 464 • 169 • 5

grimulkan/LimaRP-augmented

Viewer • Updated Jan 24, 2024 • 804 • 47 • 29

TIGER-Lab/MathInstruct

Viewer • Updated May 15, 2024 • 262k • 6.41k • 273

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 422 • 35

b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 7.79k • 451

migtissera/Synthia-v1.3

Viewer • Updated Nov 16, 2023 • 119k • 105 • 99

tinyBenchmarks/tinyMMLU

Viewer • Updated Jul 8, 2024 • 385 • 7.06k • 18

tinyBenchmarks/tinyWinogrande

Preview • Updated May 25, 2024 • 2.02k • 4

tinyBenchmarks/tinyAI2_arc

Preview • Updated May 25, 2024 • 1.32k • 3

tinyBenchmarks/tinyHellaswag

Viewer • Updated May 25, 2024 • 50k • 2.48k • 4

tinyBenchmarks/tinyTruthfulQA

Preview • Updated May 25, 2024 • 1.56k • 3

tinyBenchmarks/tinyAlpacaEval

Viewer • Updated Apr 19, 2024 • 100 • 209 • 5

tinyBenchmarks/tinyGSM8k

Preview • Updated May 25, 2024 • 1.12k • 5

cognitivecomputations/samantha-data

Updated Mar 29, 2024 • 1.24k • 127

roborovski/synthetic-tool-calls

Viewer • Updated Mar 5, 2024 • 6.01k • 106 • 1

roborovski/glaive-tool-usage-dpo

Viewer • Updated Feb 29, 2024 • 42k • 146 • 2

kalomaze/StackMix-v0.1

Viewer • Updated Feb 28, 2024 • 30 • 81 • 2

roborovski/glaive-function-calling-v2-conversation

Viewer • Updated Feb 19, 2024 • 113k • 53 • 2

mlabonne/truthy-dpo-v0.1

Viewer • Updated Feb 18, 2024 • 1.02k • 56 • 1

ai4bharat/indic-align

Viewer • Updated Jul 25, 2024 • 97.4M • 1.94k • 12

coseal/CodeUltraFeedback_binarized

Viewer • Updated Mar 18, 2024 • 9.5k • 192 • 17

coseal/CodeUltraFeedback

Viewer • Updated Mar 15, 2024 • 10k • 128 • 26

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 16

ai4bharat/sangraha

Viewer • Updated 4 days ago • 268M • 23.2k • 39

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Paper • 2311.04205 • Published Nov 7, 2023 • 5

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

Paper • 2401.01854 • Published Jan 3, 2024 • 11

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 65

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 192

Self-Instruct: Aligning Language Model with Self Generated Instructions

Paper • 2212.10560 • Published Dec 20, 2022 • 9

HuggingFaceH4/self-instruct-seed

Viewer • Updated Jan 31, 2023 • 175 • 105 • 27

ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 10

Dynamic Planning with a LLM

Paper • 2308.06391 • Published Aug 11, 2023 • 2

FreedomIntelligence/SocraticChat

Viewer • Updated Oct 12, 2023 • 50.7k • 91 • 9

Large Language Model as a User Simulator

Paper • 2308.11534 • Published Aug 21, 2023 • 2

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Paper • 2309.10814 • Published Sep 19, 2023 • 3

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 23

mlabonne/alpagasus

Viewer • Updated Aug 3, 2023 • 9.23k • 131 • 8

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

THUDM/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 425 • 203

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Paper • 2310.01557 • Published Oct 2, 2023 • 13

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 35

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Paper • 2304.11477 • Published Apr 22, 2023 • 3

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 77

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Paper • 2308.00436 • Published Aug 1, 2023 • 22

703

UGI Leaderboard

📢

Display a leaderboard with UGI scores

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Paper • 2310.16049 • Published Oct 24, 2023 • 4

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 20

allenai/UNcommonsense

Viewer • Updated Jan 19, 2024 • 18.3k • 246 • 10

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 11

Flows: Building Blocks of Reasoning and Collaborating AI

Paper • 2308.01285 • Published Aug 2, 2023 • 2

aiflows/CCFlows

Updated Dec 10, 2023 • 2

Learning to Reason and Memorize with Self-Notes

Paper • 2305.00833 • Published May 1, 2023 • 5

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

Paper • 2305.03268 • Published May 5, 2023 • 2

Making Large Language Models Better Reasoners with Alignment

Paper • 2309.02144 • Published Sep 5, 2023 • 2

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Paper • 2309.17382 • Published Sep 29, 2023 • 5

ALERT: Adapting Language Models to Reasoning Tasks

Paper • 2212.08286 • Published Dec 16, 2022 • 2

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Paper • 2402.04858 • Published Feb 7, 2024 • 15

Vivacem/MMIQC

Viewer • Updated Jan 20, 2024 • 2.29M • 111 • 16

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7, 2024 • 24

Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6, 2024 • 21

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Paper • 2402.10466 • Published Feb 16, 2024 • 19

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Paper • 2402.02285 • Published Feb 3, 2024 • 1

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 25

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 18

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27, 2024 • 20

Aman279/Locomo

Viewer • Updated Mar 7, 2024 • 35 • 16 • 1

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 54

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 48

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 82

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21, 2024 • 48

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Paper • 2402.16288 • Published Feb 26, 2024 • 1

pandalla/Machine_Mindset_MBTI_dataset

Viewer • Updated Jun 4, 2024 • 161k • 418 • 58

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 1.83k • 288

totally-not-an-llm/sharegpt-hyperfiltered-3k

Viewer • Updated Jul 13, 2023 • 3.24k • 150 • 14

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 22.8k • 591

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 4.98k • 137

dmayhem93/self-critiquing-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 46 • 1

dmayhem93/self-critiquing-critique-and-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 52 • 1

morzecrew/RefinedPersonaChat

Viewer • Updated Aug 7, 2023 • 207k • 63 • 2

beratcmn/rephrased-instruction-turkish-poems

Viewer • Updated Dec 16, 2023 • 4.96k • 78 • 4

Birchlabs/openai-prm800k-stepwise-critic

Viewer • Updated Jun 3, 2023 • 1.09M • 143 • 44

theblackcat102/evol-codealpaca-v1

Viewer • Updated Mar 10, 2024 • 111k • 875 • 158

meta-math/GSM8K_Backward

Viewer • Updated Nov 10, 2023 • 1.27k • 169 • 16

meta-math/MetaMathQA-40K

Viewer • Updated Nov 10, 2023 • 40k • 459 • 24

glaiveai/glaive-code-assistant-v2

Viewer • Updated Apr 4, 2024 • 215k • 297 • 44

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5, 2024 • 5

PROC2PDDL: Open-Domain Planning Representations from Texts

Paper • 2403.00092 • Published Feb 29, 2024 • 1

btan2/cappy-large

Text Classification • Updated Dec 7, 2023 • 165 • 20

VMware/open-instruct

Viewer • Updated Jul 12, 2023 • 143k • 180 • 44

QizhiPei/BioT5_finetune_dataset

Viewer • Updated Sep 2, 2024 • 33 • 1.23k • 6

Tensoic/gooftagoo

Viewer • Updated Mar 16, 2024 • 16.2k • 93 • 9

GenVRadmin/Aryabhatta-Orca-Maths-Hindi

Viewer • Updated Mar 18, 2024 • 200k • 73 • 3

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Paper • 2310.00280 • Published Sep 30, 2023 • 3

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Paper • 2311.05997 • Published Nov 10, 2023 • 37

wangwilliamyang/wikihow

Updated Jan 18, 2024 • 8

argilla/distilabel-capybara-kto-15k-binarized

Viewer • Updated Mar 19, 2024 • 15.1k • 94 • 5

argilla/ultrafeedback-binarized-preferences-cleaned-kto

Viewer • Updated Mar 19, 2024 • 231k • 309 • 9

argilla/distilabel-intel-orca-kto

Viewer • Updated Mar 19, 2024 • 23.1k • 90 • 7

argilla/kto-mix-15k

Viewer • Updated Apr 19, 2024 • 15.3k • 147 • 13

KnutJaegersberg/dolphin_orca_clustered

Updated Sep 14, 2023 • 64 • 1

GAIR/autoj-scenario-classifier

Text Generation • Updated Oct 9, 2023 • 206 • 5

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 73

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 185

Ask Optimal Questions: Aligning Large Language Models with Retriever's Preference in Conversational Search

Paper • 2402.11827 • Published Feb 19, 2024 • 1

Grounding Language Model with Chunking-Free In-Context Retrieval

Paper • 2402.09760 • Published Feb 15, 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19, 2024 • 17

BAAI/OPI

Preview • Updated Nov 6, 2024 • 559 • 8

internlm/Agent-FLAN

Preview • Updated Mar 20, 2024 • 142 • 72

kaist-ai/selfee-train

Viewer • Updated May 31, 2023 • 178k • 123 • 9

fabiochiu/medium-articles

Preview • Updated Jul 17, 2022 • 412 • 23

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20, 2024 • 13

voidful/MuSiQue

Preview • Updated May 20, 2023 • 7 • 4

BAAI/bge-reranker-v2-m3

Text Classification • Updated Jun 24, 2024 • 936k • • 548

allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 7.09k • 87

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 23

In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 43

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 815 • 76

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25, 2024 • 66

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 65

princeton-nlp/QuRatedPajama-260B

Viewer • Updated Apr 16, 2024 • 254M • 5.23k • 7

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 33

Locutusque/OpenCerebrum-dpo

Viewer • Updated Mar 26, 2024 • 21.1k • 82 • 6

Doctor-Shotgun/theory-of-mind-dpo

Viewer • Updated Mar 14, 2024 • 539 • 76 • 16

Locutusque/arc-cot-dpo

Viewer • Updated Mar 26, 2024 • 957 • 94 • 6

fblgit/simple-math-DPO

Viewer • Updated Aug 1, 2024 • 800k • 300 • 17

KrisPi/PythonTutor-Evol-1k-DPO-GPT4_vs_35

Viewer • Updated Nov 18, 2023 • 943 • 78 • 14

zerolink/zsql-postgres-dpo

Viewer • Updated Feb 2, 2024 • 259k • 134 • 8

Lakera/gandalf_ignore_instructions

Viewer • Updated 9 days ago • 1k • 350 • 27

mrm8488/unnatural-instructions-full

Viewer • Updated Dec 21, 2022 • 66k • 118 • 16

NilanE/SmallParallelDocs-Ja_En-6k

Viewer • Updated Mar 5, 2024 • 6.32k • 160 • 2

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27, 2024 • 25

NousResearch/OLMo-Bitnet-1B

Text Generation • Updated Apr 11, 2024 • 219 • 118

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21, 2024 • 58 • 212

CarperAI/openai_summarize_comparisons

Viewer • Updated Feb 27, 2023 • 260k • 2.3k • 40

PygmalionAI/PIPPA

Updated Sep 7, 2023 • 174 • 211

ivanleomk/gpt4-chain-of-density

Preview • Updated Nov 12, 2023 • 177 • 6

AIRI-NLP/cnli_memory_extracted

Viewer • Updated Mar 22, 2024 • 8.23k • 84 • 1

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 32

openbmb/UltraInteract_sft

Viewer • Updated Apr 5, 2024 • 289k • 14.5k • 122

openbmb/UltraInteract_pair

Viewer • Updated Apr 5, 2024 • 220k • 595 • 108

openbmb/Eurus-70b-nca

Text Generation • Updated Apr 12, 2024 • 27 • 11

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8, 2024 • 1

ai2lumos/lumos_multimodal_ground_iterative

Viewer • Updated Mar 19, 2024 • 15.9k • 81 • 1

ai2lumos/lumos_multimodal_plan_iterative

Viewer • Updated Mar 19, 2024 • 15.9k • 107 • 2

ai2lumos/lumos_complex_qa_plan_onetime

Viewer • Updated Mar 19, 2024 • 19.4k • 100 • 3

ai2lumos/lumos_complex_qa_ground_onetime

Viewer • Updated Mar 19, 2024 • 19.2k • 73 • 3

ai2lumos/lumos_complex_qa_ground_iterative

Viewer • Updated Mar 19, 2024 • 19.1k • 98 • 2

ai2lumos/lumos_unified_plan_iterative

Viewer • Updated Mar 19, 2024 • 55.4k • 118 • 2

ai2lumos/lumos_complex_qa_plan_iterative

Viewer • Updated Mar 18, 2024 • 19k • 77 • 6

ai2lumos/lumos_unified_ground_iterative

Viewer • Updated Mar 19, 2024 • 55.5k • 96 • 2

ai2lumos/lumos_web_agent_ground_iterative

Viewer • Updated Mar 18, 2024 • 1.01k • 103 • 2

ai2lumos/lumos_web_agent_plan_iterative

Viewer • Updated Mar 18, 2024 • 1.01k • 97 • 6

ai2lumos/lumos_maths_ground_iterative

Viewer • Updated Mar 18, 2024 • 19.5k • 69 • 3

ai2lumos/lumos_maths_ground_onetime

Viewer • Updated Mar 18, 2024 • 19.8k • 45 • 1

ai2lumos/lumos_maths_plan_onetime

Viewer • Updated Mar 18, 2024 • 19.8k • 94 • 2

Symbol-LLM/Symbol-LLM-7B-Instruct

Text Generation • Updated Jun 23, 2024 • 67 • 13

MoritzLaurer/deberta-v3-large-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 11, 2024 • 65.4k • • 93

MoritzLaurer/bge-m3-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 22, 2024 • 99.7k • 48

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

Pavithree/eli5

Viewer • Updated Apr 23, 2022 • 229k • 276 • 2

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23, 2024 • 1.95k • 151 • 24

paraloq/json_data_extraction

Viewer • Updated Mar 25, 2024 • 484 • 147 • 20

livecodebench/execution

Viewer • Updated Mar 12, 2024 • 479 • 296 • 4

iamtarun/python_code_instructions_18k_alpaca

Viewer • Updated Jul 27, 2023 • 18.6k • 1.37k • 288

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22, 2024 • 27

manishiitg/CogStack-QA

Viewer • Updated Feb 9, 2024 • 24.7k • 48 • 1

manishiitg/CogStack-Tasks

Viewer • Updated Feb 9, 2024 • 4.69k • 61 • 1

manishiitg/CogStack-Conv

Viewer • Updated Feb 9, 2024 • 2.35k • 57 • 1

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19, 2024 • 18

abacusai/SystemChat-1.1

Viewer • Updated Apr 11, 2024 • 20.2k • 117 • 32

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 107

Anthropic/persuasion

Viewer • Updated Apr 9, 2024 • 3.94k • 566 • 184

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90

M4-ai/prm_dpo_pairs

Viewer • Updated Jul 1, 2024 • 93.9k • 76 • 7

OpenLLM-France/Claire-Dialogue-French-0.1

Viewer • Updated Dec 5, 2023 • 37k • 517 • 44

amaydle/npc-dialogue

Viewer • Updated Mar 25, 2023 • 1.92k • 205 • 16

facebook/empathetic_dialogues

Updated Jan 18, 2024 • 1.61k • 97

Salesforce/dialogstudio

Updated Jan 24 • 437 • 220

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61

microsoft/Taskbench

Viewer • Updated Aug 21, 2024 • 17.3k • 693 • 25

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 84

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 26

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 801 • 278

allenai/persona-bias

Updated Feb 5, 2024 • 27 • 11

PleIAs/YouTube-Commons

Updated Jun 26, 2024 • 3.78k • 342

FreedomIntelligence/evol-instruct-hindi

Viewer • Updated Aug 6, 2023 • 59k • 58 • 2

FreedomIntelligence/OVM-process

Viewer • Updated Apr 1, 2024 • 7.47k • 57 • 1

nuprl/CanItEdit

Viewer • Updated Mar 19, 2024 • 105 • 249 • 12

totally-not-an-llm/EverythingLM-data-V3

Viewer • Updated Sep 11, 2023 • 1.07k • 142 • 31

RUCAIBox/Story-Generation

Updated Mar 3, 2023 • 134 • 12

fabraz/writingPromptAug

Viewer • Updated Oct 14, 2023 • 24.1k • 224 • 2

jerryjalapeno/nart-100k-synthetic

Viewer • Updated Jul 16, 2023 • 99.1k • 256 • 41

jat-project/jat-dataset

Viewer • Updated Feb 16, 2024 • 258M • 489k • 37

euclaise/ReMask-3B

Text Generation • Updated Aug 10, 2024 • 79 • 15

google/Synthetic-Persona-Chat

Viewer • Updated Mar 1, 2024 • 10.9k • 1.57k • 100

google/cvss

Updated Feb 10, 2024 • 210 • 13

neural-bridge/rag-dataset-12000

Viewer • Updated Feb 5, 2024 • 12k • 1.93k • 131

HannahRoseKirk/prism-alignment

Viewer • Updated Apr 25, 2024 • 77.9k • 1.57k • 82

Gigax/NPC-LLM-3_8B

Text Generation • Updated May 14, 2024 • 54 • 24

nuprl/MultiPL-T

Viewer • Updated Aug 20, 2024 • 215k • 598 • 7

cognitivecomputations/SystemChat-1.2

Viewer • Updated Apr 30, 2024 • 52 • 45 • 6

mlabonne/arena-preferences

Viewer • Updated Apr 27, 2024 • 2.69k • 78 • 9

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12, 2024 • 12

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

Paper • 2401.07793 • Published Jan 15, 2024 • 3

yutaozhu94/INTERS

Preview • Updated Feb 19, 2024 • 747 • 12

THUDM/CogAgent

Updated Dec 18, 2023 • 18

urchade/gliner_large-v2.1

Token Classification • Updated Apr 10, 2024 • 6.53k • 31

shachardon/ShareLM

Viewer • Updated Aug 6, 2024 • 331k • 875 • 29

nvidia/ChatQA-Training-Data

Viewer • Updated Jun 4, 2024 • 442k • 1.99k • 167

lightblue/tagengo-gpt4

Viewer • Updated Jun 2, 2024 • 78.1k • 177 • 63

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 3.28k • 31

bigcode/commitpackft

Viewer • Updated Aug 20, 2023 • 702k • 9.44k • 66

glaiveai/glaive-code-assistant-v3

Viewer • Updated May 20, 2024 • 950k • 239 • 46

davanstrien/cosmochat

Viewer • Updated May 10, 2024 • 199 • 119 • 12

davanstrien/cosmopedia_chat

Viewer • Updated Mar 8, 2024 • 1.19k • 87 • 7

MemGPT/MSC-Self-Instruct

Viewer • Updated Nov 2, 2023 • 500 • 209 • 11

MemGPT/qa_data

Viewer • Updated Feb 6, 2024 • 18.6k • 47 • 1

google/imageinwords

Updated May 25, 2024 • 473 • 118

grammarly/coedit

Viewer • Updated Oct 21, 2023 • 70.8k • 1.63k • 69

bea2019st/wi_locness

Updated Jan 18, 2024 • 202 • 14

GEM/FairytaleQA

Viewer • Updated Oct 25, 2022 • 10.6k • 385 • 8

grammarly/medit

Viewer • Updated Oct 1, 2024 • 113k • 342 • 13

MemGPT/MemGPT-DPO-Dataset

Viewer • Updated Apr 18, 2024 • 42.3k • 210 • 9

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 825 • 142

princeton-nlp/QuRating-GPT3.5-Judgments

Viewer • Updated Mar 29, 2024 • 250k • 117 • 6

princeton-nlp/AutoCompressor-Llama-2-7b-6k

Updated Nov 22, 2023 • 32 • 2

H-D-T/Select-Stack

Viewer • Updated Sep 2, 2024 • 1.46M • 112 • 16

EleutherAI/lichess-puzzles

Viewer • Updated May 9, 2024 • 1.48M • 1.07k • 21

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 140 • 70

community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24, 2024 • 1.46M • 2.71k • 54

TIGER-Lab/MMLU-Pro

Viewer • Updated Nov 27, 2024 • 12.1k • 43.1k • 327

ylacombe/expresso

Viewer • Updated Apr 30, 2024 • 11.6k • 518 • 40

microsoft/MeetingBank-QA-Summary

Viewer • Updated May 16, 2024 • 862 • 137 • 14

microsoft/MeetingBank-LLMCompressed

Viewer • Updated May 16, 2024 • 5.17k • 124 • 15

nvidia/ChatRAG-Bench

Viewer • Updated May 24, 2024 • 34.6k • 2.56k • 108

xingyaoww/code-act

Viewer • Updated Feb 5, 2024 • 78.4k • 233 • 55

kaist-ai/Multifaceted-Collection-ORPO

Viewer • Updated Jul 1, 2024 • 64.6k • 128 • 10

Alibaba-NLP/gte-Qwen2-7B-instruct

hwjiang/Real3D

Image-to-3D • Updated Jun 14, 2024 • 9 • 19

nvidia/Aegis-AI-Content-Safety-Dataset-1.0

Viewer • Updated Jun 28, 2024 • 12k • 1.01k • 50

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Updated Oct 30, 2024 • 4.07k • 126

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 29

facebook/multi-token-prediction

Updated Jun 18, 2024 • 364

TIGER-Lab/M-BEIR

Viewer • Updated Aug 7, 2024 • 2.86M • 2.21k • 17

tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 485 • 144

mit-han-lab/ShareGPT4V

Preview • Updated Feb 22, 2024 • 262 • 3

mit-han-lab/litepose

Updated Jun 5, 2024 • 1

mit-han-lab/Llama-3-8B-Instruct-QServe-g128

Text Generation • Updated May 6, 2024 • 19 • 1

internlm/internlm-xcomposer2-vl-7b

Visual Question Answering • Updated Apr 12, 2024 • 1.85k • 80

OpenGVLab/InternViT-6B-448px-V1-5

Image Feature Extraction • Updated Dec 9, 2024 • 442 • 79

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated Feb 5 • 3.15k • 410

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated Feb 5 • 590 • 62

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Jan 15 • 24.4k • 1.39k

microsoft/Florence-2-large

Image-Text-to-Text • Updated Dec 8, 2024 • 3.68M • 1.45k

llava-hf/LLaVA-NeXT-Video-7B-DPO-hf

Video-Text-to-Text • Updated Jan 27 • 7.51k • 9

arcee-ai/BAAI-Infinity-Instruct-System

Viewer • Updated Jun 24, 2024 • 2.36M • 221 • 15

hpcai-tech/OpenSora-VAE-v1.2

Updated Jun 17, 2024 • 41.7k • 57

hpcai-tech/OpenSora-STDiT-v3

Updated Jun 17, 2024 • 32.1k • 46

liuqi6777/RankGPT-msmarco-100k-clean

Viewer • Updated Feb 6, 2024 • 87.3k • 85 • 1

failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5

Text Generation • Updated May 30, 2024 • 2.12k • 43

ResplendentAI/NSFW_RP_Format_DPO

Viewer • Updated Mar 17, 2024 • 400 • 89 • 67

microsoft/msr_text_compression

Updated Jan 18, 2024 • 85 • 8

microsoft/msr_sqa

Updated Jan 18, 2024 • 153 • 4

microsoft/crd3

Updated Jan 18, 2024 • 243 • 24

nvidia/domain-classifier

Updated Jan 24 • 107k • 77

jhu-clsp/FollowIR-train

Viewer • Updated Mar 25, 2024 • 1.78k • 139 • 5

vicgalle/Phudge-3

Text Classification • Updated May 30, 2024 • 14 • 3

TWO/sutra-mlt256-v2

Updated May 24, 2024 • 10

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Paper • 2406.19251 • Published Jun 27, 2024 • 9

aiana94/xMINDlarge

Viewer • Updated Oct 25, 2024 • 4.12M • 432 • 4

OpenCo7/UpVoteWeb

Viewer • Updated Jul 17, 2024 • 557M • 683 • 94

davanstrien/magpie-preference

Viewer • Updated 2 days ago • 534 • 1.54k • 13

FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 1.36k • 231

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6, 2024 • 8.79k • 78 • 13

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 4.82k • 204

dell-research-harvard/newswire

Viewer • Updated Jul 2, 2024 • 1.44M • 572 • 71

alexshengzhili/SciGraphQA-295K-train

Viewer • Updated Aug 8, 2023 • 296k • 172 • 11

xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 109k • 1.33k

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 9

laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29, 2024 • 331k • 120 • 8

laion/datacomp-hq

Viewer • Updated Mar 13, 2024 • 20.7M • 351 • 12

laion/Subjects-for-curricular

Viewer • Updated Dec 20, 2023 • 3.99M • 165 • 5

laion/strategic_game_maze

Viewer • Updated Oct 20, 2023 • 345M • 70.6k • 11

mlabonne/llmtwin

Viewer • Updated Aug 27, 2024 • 3.34k • 192 • 10

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 45

NovaSearch/stella_en_400M_v5

NovaSearch/stella_en_1.5B_v5

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20, 2024 • 154 • 49

agentsea/wave-ui-25k

Viewer • Updated Jul 3, 2024 • 25k • 361 • 26

TencentARC/StoryStream

Preview • Updated Jul 17, 2024 • 371 • 27

apple/DCLM-7B

Updated Jul 26, 2024 • 488 • 835

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 12.4k • 309

HuggingFaceTB/bisac-topics

Viewer • Updated Apr 3, 2024 • 5.5k • 19 • 2

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Paper • 2407.11239 • Published Jul 15, 2024 • 8

mistralai/Mistral-Nemo-Base-2407

Text Generation • Updated Nov 6, 2024 • 28.7k • 297

TencentARC/SEED-Story

Text-to-Image • Updated Aug 26, 2024 • 13 • 27

xlangai/BRIGHT

Viewer • Updated 8 days ago • 1.35M • 5.11k • 23

glaiveai/RAG-v1

Viewer • Updated Jun 25, 2024 • 51.4k • 327 • 72

QuietImpostor/Claude-3-Opus-Claude-3.5-Sonnnet-9k

Viewer • Updated Jun 30, 2024 • 9.94k • 115 • 20

PawanKrd/gpt-4o-200k

Viewer • Updated Jun 29, 2024 • 200k • 23 • 24

kalomaze/Opus_Instruct_3k

Viewer • Updated Jul 19, 2024 • 2.95k • 94 • 25

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Paper • 2206.07643 • Published Jun 15, 2022 • 1

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Paper • 2303.15256 • Published Mar 27, 2023 • 1

fireworks-ai/llama-3-firefunction-v2

Text Generation • Updated Jun 18, 2024 • 161 • 143

Stateful Memory-Augmented Transformers for Dialogue Modeling

Paper • 2209.07634 • Published Sep 15, 2022 • 1

cognitivecomputations/SystemChat-2.0

Preview • Updated May 31, 2024 • 189 • 58

CollectiveCognition/chats-data-2023-10-16

Viewer • Updated Oct 16, 2023 • 200 • 65 • 21

Izazk/Sequence-of-action-prediction-mind2web

Viewer • Updated Feb 22, 2024 • 68.9k • 87 • 4

BigAction/mind2web_clean

Viewer • Updated Apr 25, 2024 • 199 • 151 • 4

osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 655 • 100

magicgh/MT-Mind2Web

Viewer • Updated Feb 23, 2024 • 259 • 224 • 2

TencentARC/PhotoMaker-V2

Text-to-Image • Updated Jul 22, 2024 • 30.8k • 138

KevSun/Personality_LM

Text Classification • Updated Jul 29, 2024 • 3.88k • 21

256

Infinite Dataset Hub

♾

Search and save datasets generated with a LLM in real time

chargoddard/SlimOrcaDedupCleaned-Sonnet3.5-DPO

Viewer • Updated Jul 23, 2024 • 168k • 73 • 7

nvidia/Minitron-8B-Base

Text Generation • Updated 23 days ago • 8.65k • 64

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21, 2024 • 623M • 320k • 82

mlfoundations/MINT-1T-ArXiv

Viewer • Updated Sep 19, 2024 • 5.6M • 656 • 48

mlfoundations/MINT-1T-PDF-CC-2024-18

Updated Sep 19, 2024 • 7.94k • 19

AI-MO/NuminaMath-TIR

Viewer • Updated Nov 25, 2024 • 72.5k • 22.4k • 115

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Paper • 2406.00856 • Published Jun 2, 2024 • 12

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 13.5k • 172

LiruiZhao/Diffree

Image-to-Image • Updated Jul 29, 2024 • 37 • 18

BAAI/bge-multilingual-gemma2

Feature Extraction • Updated Jul 31, 2024 • 165k • 173

BAAI/bge-reranker-v2.5-gemma2-lightweight

Text Classification • Updated Sep 6, 2024 • 2.09k • 46

BAAI/IndustryCorpus

Viewer • Updated Jul 23, 2024 • 595M • 1.97k • 53

jspringer/echo-mistral-7b-instruct-lasttoken

Feature Extraction • Updated Feb 26, 2024 • 176 • 6

BAAI/bge-en-icl

Feature Extraction • Updated Jan 15 • 24.6k • 125

AlekseyKorshuk/full_user_edit_responses-clean

Viewer • Updated Mar 30, 2023 • 364k • 51 • 1

m-a-p/MMRA

Viewer • Updated Jul 31, 2024 • 1.02k • 93 • 13

m-a-p/II-Bench

Viewer • Updated Jun 29, 2024 • 1.43k • 432 • 10

BEE-spoke-data/fineweb-1000_64k

Viewer • Updated Jun 23, 2024 • 2k • 81 • 4

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated Feb 3 • 1.41k • 184

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 2.62M • • 9.25k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 1.51M • • 3.49k

numind/NuExtract

Text Generation • Updated Oct 17, 2024 • 1.55k • 219

numind/NuSentiment-multilingual

Feature Extraction • Updated Jan 26, 2024 • 144 • 12

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 46.5k • 271

aipicasso/megalith-10m-florence2

Viewer • Updated Jul 31, 2024 • 9.14M • 175 • 23

ZhengPeng7/BiRefNet

Image Segmentation • Updated 4 days ago • 555k • 329

nvidia/quality-classifier-deberta

Updated Jan 31 • 13.7k • 56

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

tiiuae/falcon-mamba-7b-4bit

Text Generation • Updated Oct 10, 2024 • 58 • 11

nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 175 • 106

THUDM/LongWriter-6k

Viewer • Updated Aug 14, 2024 • 6k • 296 • 175

anthracite-org/Stheno-Data-Filtered

Viewer • Updated Aug 18, 2024 • 31.1k • 35 • 14

anthracite-org/kalo-opus-instruct-22k-no-refusal

Viewer • Updated Aug 13, 2024 • 22.3k • 228 • 28

anthracite-org/nopm_claude_writing_fixed

Viewer • Updated Aug 18, 2024 • 6.35k • 151 • 13

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26, 2024 • 338k • • 671

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated 1 day ago • 39.2k • • 554

fal/AuraFace-v1

Updated Aug 26, 2024 • 89

NexaAIDev/Squid

Updated Sep 3, 2024 • 58 • 34

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 43

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated Jan 29 • 2.38k • 664 • 98

NousResearch/hermes-function-calling-v1

Viewer • Updated Aug 30, 2024 • 11.6k • 1.89k • 267

multimodalart/product-design

Text-to-Image • Updated Sep 22, 2024 • 11.7k • • 36

novateur/WavTokenizer

Text-to-Speech • Updated Dec 2, 2024 • 50

facebook/sapiens

Updated Sep 20, 2024 • 26 • 235

Shakker-Labs/AWPortrait-FL

Text-to-Image • Updated Sep 5, 2024 • 122k • 449

sequelbox/Supernova

Viewer • Updated Sep 27, 2024 • 178k • 165 • 8

544

Vision Arena (Testing VLMs side-by-side)

🖼

Analyze images to detect and label objects

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 635 • 1.72k

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Dec 11, 2024 • 3.79k • 700

deepseek-ai/ESFT-vanilla-lite

Text Generation • Updated Jul 23, 2024 • 441 • 11

yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • Updated Sep 6, 2024 • 367 • 130

gabrielmbmb/distilabel-reflection-tuning

Viewer • Updated Sep 6, 2024 • 5 • 127 • 56

TencentARC/Open-MAGVIT2

Image Feature Extraction • Updated Sep 9, 2024 • 12

openbmb/MiniCPM3-4B

Text Generation • Updated 10 days ago • 21.8k • 407

THUDM/LongCite-glm4-9b

Text Generation • Updated Dec 16, 2024 • 150 • 30

jinaai/reader-lm-1.5b

Text Generation • Updated Jan 17 • 954 • • 589

Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Sep 15, 2024 • 41 • 38

tencent/DepthCrafter

Depth Estimation • Updated Sep 24, 2024 • 264k • 83

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • • 619

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Feb 4 • 80.6k • 1.41k

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19, 2024 • 16

THUdyh/Oryx-7B

Text Generation • Updated Sep 25, 2024 • 132 • 11

THUdyh/Oryx-7B-Image

Text Generation • Updated Sep 23, 2024 • 21 • 3

THUdyh/Oryx-ViT

Image Feature Extraction • Updated 8 days ago • 5

BAAI/SegGPT

Updated Apr 21, 2023 • 18

Salesforce/fineweb_deduplicated

Viewer • Updated Feb 3 • 6.43B • 9.69k • 35

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 5.04k • 87

BAAI/Emu3-Gen

Any-to-Any • Updated Oct 23, 2024 • 1.38k • 209

CultriX/elitebabes-flux

Text-to-Image • Updated Sep 20, 2024 • 87 • • 16

RED-AIGC/StoryMaker

Text-to-Image • Updated Nov 9, 2024 • 163 • 76

google/frames-benchmark

Viewer • Updated Oct 15, 2024 • 824 • 2.13k • 188

Anthropic/discrim-eval

Viewer • Updated Jan 5, 2024 • 18.9k • 1.06k • 45

facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24, 2024 • 561k • 72

Zyphra/Zamba2-2.7B-instruct

Text Generation • Updated 24 days ago • 553 • 82

princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

Updated Oct 31, 2024 • 2.39k • 20

jxm/cde-small-v1

Feature Extraction • Updated Jan 21 • 1.82k • 285

PrincetonPLI/Instruct-SkillMix-SDD

Viewer • Updated Sep 9, 2024 • 8k • 77 • 5

THUDM/cogvlm2-llama3-caption

Video-Text-to-Text • Updated Jan 22 • 6.25k • 86

julien040/hacker-news-posts

Viewer • Updated Jun 6, 2023 • 4.01M • 169 • 6

princeton-nlp/Llama-3-8B-ProLong-512k-Base

Updated Oct 31, 2024 • 2.06k • 9

LLM360/TxT360

Preview • Updated Nov 8, 2024 • 606k • 224

bingbangboom/flux-waterscape

Text-to-Image • Updated Oct 10, 2024 • 644 • • 14

facebook/Self-taught-evaluator-DPO-data

Viewer • Updated about 1 month ago • 57.5k • 90 • 33

facebook/layerskip-llama2-13B

Text Generation • Updated Oct 19, 2024 • 1.05k • 5

ibm-granite/granite-8b-code-instruct-accelerator

Updated May 29, 2024 • 12 • 1

peakji/steiner-32b-preview

Updated Oct 21, 2024 • 19 • 44

CohereForAI/aya-expanse-32b

Text Generation • Updated 7 days ago • 112k • 232

CohereForAI/aya-expanse-8b

Text Generation • Updated 7 days ago • 32k • 345

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 4

McGill-NLP/FaithDial

Viewer • Updated Feb 5, 2023 • 32.3k • 607 • 17

relaxml/Llama-3.1-8b-Instruct-QTIP-4Bit

Updated Oct 28, 2024 • 95 • 2

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13, 2024 • 3

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 291 • 134

marcelbinz/Psych-101

Viewer • Updated Nov 2, 2024 • 60.1k • 253 • 43

nvidia/Nemotron-4-Mini-Hindi-4B-Base

Updated Oct 23, 2024 • 91 • 12

nvidia/Nemotron-4-Mini-Hindi-4B-Instruct

Updated Nov 15, 2024 • 103 • 18

Etched/oasis-500m

Updated Nov 4, 2024 • 171 • 448

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 3 days ago • 375k • • 573

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19 • 218 • 568

THUDM/webrl-llama-3.1-8b

Updated Nov 6, 2024 • 426 • 3

THUDM/webrl-glm-4-9b

Updated Nov 5, 2024 • 29 • 8

hbseong/HarmAug-Guard

Text Classification • Updated 11 days ago • 275 • 38

BAAI/IndustryCorpus2

Viewer • Updated Dec 17, 2024 • 826M • 2.53k • 47

di-zhang-fdu/OpenLongCoT-Pretrain

Viewer • Updated Oct 28, 2024 • 103k • 105 • 87

microsoft/maira-2

Text Generation • Updated 18 days ago • 72.4k • 49

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 37

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 9.39k • 431

Nexusflow/Athene-V2-Chat

Text Generation • Updated Nov 26, 2024 • 8.01k • 285

Nexusflow/Athene-V2-Agent

Text Generation • Updated Nov 21, 2024 • 423 • 127

numind/NuExtract-1.5-tiny

Text Generation • Updated Nov 18, 2024 • 2.13k • • 18

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51

allenai/ACE2-ERA5

Updated Nov 21, 2024 • 2

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 13

nvidia/Hymba-1.5B-Base

Text Generation • Updated Jan 2 • 3.59k • 139

AIDC-AI/Marco-o1

Text Generation • Updated Nov 23, 2024 • 8.54k • 711

allenai/Llama-3.1-Tulu-3-70B

Text Generation • Updated 27 days ago • 4.4k • 54

nachoyawn/three-million-bluesky

Viewer • Updated Nov 28, 2024 • 3.01M • 196 • 10

huihui-ai/QwQ-32B-Preview-abliterated

Text Generation • Updated Nov 28, 2024 • 324 • 99

data-is-better-together/open-image-preferences-v1

Viewer • Updated Dec 9, 2024 • 8.67k • 33.6k • 24

showlab/ShowUI-desktop

Viewer • Updated Dec 17, 2024 • 7.5k • 1.25k • 23

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

nvidia/multilingual-domain-classifier

Updated Jan 24 • 17.3k • 16

TencentARC/Divot

Updated Dec 10, 2024 • 19 • 6

microsoft/RedStone

Updated Dec 5, 2024 • 68 • 33

ruliad/deepthought-8b-llama-v0.01-alpha

Text Generation • Updated Dec 7, 2024 • 127 • 144

TIGER-Lab/ScholarCopilot-v1

Updated Dec 8, 2024 • 28 • 4

TIGER-Lab/ScholarCopilot-Data-v1

Viewer • Updated Dec 15, 2024 • 677k • 147 • 2

facebook/sparsh-dino-base

Updated Oct 21, 2024 • 5

deepseek-ai/DeepSeek-V2.5-1210

Text Generation • Updated Dec 11, 2024 • 2.1k • 252

facebook/metamotivo-M-1

Updated Dec 12, 2024 • 825 • 7

deepseek-ai/DeepSeek-Prover-V1.5-RL

Updated Aug 29, 2024 • 22.1k • 52

tiiuae/Falcon3-10B-Base

Text Generation • Updated Dec 18, 2024 • 26.2k • 35

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 6.09M • 783

HuggingFaceTB/finemath

Viewer • Updated Feb 6 • 48.3M • 11.5k • 292

google/reveal

Viewer • Updated Apr 9, 2024 • 6.1k • 79 • 30

showlab/ShowUI-web

Viewer • Updated 5 days ago • 22k • 41.7k • 11

Writer/omniact

Updated Apr 29, 2024 • 1.13k • 34

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 5.22k • 539

notdiamond/notdiamond-0001

Text Classification • Updated Jul 30, 2024 • 220 • • 138

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 13 days ago • 1.63k • 544

EvaByte/EvaByte-SFT

Updated 9 days ago • 62 • 35

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 73.8k • 289

kyutai/hibiki-2b-rs-bf16

Translation • Updated 28 days ago • 3

perplexity-ai/r1-1776-distill-llama-70b

Text Generation • Updated 11 days ago • 7.38k • 98

nbeerbower/EVA-Gutenberg3-Qwen2.5-32B

Text Generation • Updated Jan 19 • 56 • 4

desklib/ai-text-detector-v1.01

Text Classification • Updated 20 days ago • 3.99k • 4

chillies/mistral-7b-ielts-evaluator-q4

Updated May 27, 2024 • 102 • 11

moonshotai/Moonlight-16B-A3B

Text Generation • Updated 11 days ago • 1.9k • 73

yuan-yang/ReWild

Preview • Updated Jun 26, 2024 • 96 • 2

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 6 days ago • 4.09k • 126

GSAI-ML/LLaDA-8B-Instruct

Text Generation • Updated 11 days ago • 18.6k • 189

ai21labs/AI21-Jamba-Large-1.6

Text Generation • Updated 3 days ago • 195 • 45

ai21labs/AI21-Jamba-Mini-1.6

Text Generation • Updated 3 days ago • 830 • 29