LLMs

Generate text responses from prompts

abacusai/Smaug-72B-v0.1

Text Generation • Updated Feb 23, 2024 • 2.57k • 468

google/gemma-7b

Text Generation • Updated Jun 27, 2024 • 64.5k • • 3.12k

google/gemma-2b

Text Generation • Updated Sep 27, 2024 • 277k • • 967

google/gemma-7b-it

Text Generation • Updated Aug 14, 2024 • 90.9k • 1.15k

google/gemma-2b-it

Text Generation • Updated Sep 27, 2024 • 137k • • 706

mlabonne/OrcaGemma-2B

Text Generation • Updated Feb 26, 2024 • 138 • 6

mlabonne/NeuralHermes-2.5-Mistral-7B

Text Generation • Updated Apr 8, 2024 • 391 • 154

mlabonne/AlphaMonarch-7B

Text Generation • Updated Mar 28, 2024 • 12.1k • 149

h2oai/h2o-danube-1.8b-chat

Text Generation • Updated Apr 3, 2024 • 562 • 54

TIGER-Lab/StructLM-7B

Text Generation • Updated Nov 8, 2024 • 79 • 22

TIGER-Lab/StructLM-13B

Text Generation • Updated Oct 19, 2024 • 13 • 9

abideen/starcoder2-chat

Text Generation • Updated Mar 5, 2024 • 63 • 2

bigcode/starcoder2-7b

Text Generation • Updated Jun 11, 2024 • 32.7k • 168

HuggingFaceH4/ultrachat_200k

Viewer • Updated Oct 16, 2024 • 515k • 11.7k • 508

HuggingFaceH4/zephyr-7b-beta

Text Generation • Updated Oct 16, 2024 • 319k • • 1.65k

Open-Orca/OpenOrca-Platypus2-13B

Text Generation • Updated Sep 24, 2023 • 7.54k • 225

openai/whisper-large-v3

Automatic Speech Recognition • Updated Aug 12, 2024 • 3.79M • • 4.05k

ByteDance/SDXL-Lightning

Text-to-Image • Updated Apr 3, 2024 • 150k • • 1.98k

gorilla-llm/gorilla-openfunctions-v2

Text Generation • Updated Apr 18, 2024 • 445 • 222

mixedbread-ai/mxbai-embed-large-v1

Feature Extraction • Updated Nov 26, 2024 • 1.55M • • 613

HuggingFaceTB/cosmo-1b

Text Generation • Updated Jul 8, 2024 • 708 • 130

NousResearch/Genstruct-7B

Text Generation • Updated Mar 7, 2024 • 6.61k • 373

bigcode/starcoder2-15b

Text Generation • Updated Jun 5, 2024 • 27.9k • • 582

CohereForAI/c4ai-command-r-v01

Text Generation • Updated Sep 27, 2024 • 9.82k • 1.08k

microsoft/udop-large

Image-Text-to-Text • Updated Mar 11, 2024 • 5.34k • 111

elinas/chronos-13b

Text Generation • Updated Jun 23, 2023 • 29 • 33

amazon/chronos-t5-large

Time Series Forecasting • Updated Nov 27, 2024 • 217k • 134

allenai/OLMo-1B

Text Generation • Updated Jul 16, 2024 • 1.4k • 109

allenai/OLMo-7B

Text Generation • Updated Jul 16, 2024 • 12.5k • 631

allenai/OLMo-7B-Instruct

Text Generation • Updated Oct 15, 2024 • 1.54k • 51

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 859 • 2.24k

Efficient-Large-Model/VILA-7b

Text Generation • Updated Mar 4, 2024 • 254 • 26

mistralai/Mixtral-8x7B-Instruct-v0.1

Text Generation • Updated Aug 19, 2024 • 511k • • 4.3k

MaziyarPanahi/Mistral-11B-Instruct-v0.2-Mistral-7B-Instruct-v0.2-slerp

Text Generation • Updated Jan 10, 2024 • 32 • 2

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • Updated Sep 27, 2024 • 3.61M • • 2.66k

stabilityai/stable-code-instruct-3b

Text Generation • Updated Jul 10, 2024 • 2.27k • • 170

mlx-community/Mistral-7B-v0.2-4bit

Updated Mar 25, 2024 • 54 • 8

mistral-community/Mistral-7B-v0.2

Text Generation • Updated Jul 1, 2024 • 30.4k • 232

mlx-community/stable-code-instruct-3b-4bit

Text Generation • Updated Mar 25, 2024 • 96 • 5

databricks/dbrx-instruct

Text Generation • Updated Apr 19, 2024 • 13k • • 1.11k

mlx-community/dbrx-instruct-4bit

Updated Apr 9, 2024 • 22 • 48

jetmoe/jetmoe-8b

Text Generation • Updated Apr 15, 2024 • 2.96k • 245

miqudev/miqu-1-70b

Updated Feb 4, 2024 • 412 • 985

mistral-community/Mixtral-8x22B-v0.1-4bit

Text Generation • Updated Apr 10, 2024 • 251 • 54

mlx-community/codegemma-7b-it-8bit

Text Generation • Updated Apr 9, 2024 • 327 • 4

urchade/gliner_multi-v2.1

Token Classification • Updated Apr 10, 2024 • 33.8k • 111

lightblue/Karasu-Mixtral-8x22B-v0.1

Text Generation • Updated Apr 11, 2024 • 25 • 62

mistral-community/Mixtral-8x22B-v0.1

Text Generation • Updated Jul 1, 2024 • 4.38k • 674

mlx-community/Mixtral-8x22B-4bit

Updated Apr 10, 2024 • 24 • 51

HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1

Text Generation • Updated Apr 18, 2024 • 558 • 266

Snowflake/snowflake-arctic-embed-m

alpindale/WizardLM-2-8x22B

Text Generation • Updated Sep 14, 2024 • 7.15k • 397

lucyknada/microsoft_WizardLM-2-7B

Text Generation • Updated Apr 16, 2024 • 173 • 49

microsoft/layoutlmv3-base

Updated Apr 10, 2024 • 2.2M • 364

mlabonne/Llama-3-linear-8B

Text Generation • Updated Apr 18, 2024 • 15 • 6

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 607k • 6.03k

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • Updated Sep 27, 2024 • 1.91M • • 3.82k

Snowflake/snowflake-arctic-instruct

Text Generation • Updated May 21, 2024 • 14.8k • 351

Snowflake/snowflake-arctic-base

Text Generation • Updated May 14, 2024 • 285 • 66

apple/OpenELM

Updated May 2, 2024 • 1.43k

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61

BAAI/Bunny-Llama-3-8B-V

Text Generation • Updated Jun 24, 2024 • 726 • 84

prometheus-eval/prometheus-7b-v2.0

Text2Text Generation • Updated Nov 29, 2024 • 52.8k • 87

prometheus-eval/prometheus-8x7b-v2.0

Text2Text Generation • Updated Nov 29, 2024 • 3.45k • 49

NousResearch/Hermes-2-Pro-Llama-3-8B

Text Generation • Updated Sep 14, 2024 • 37.1k • 418

mlabonne/Meta-Llama-3-120B-Instruct

Text Generation • Updated Jul 18, 2024 • 62 • 200

ibm-granite/granite-8b-code-base-4k

Text Generation • Updated Sep 2, 2024 • 1.4k • 29

valpy/prompt-classification

Text Classification • Updated Apr 18, 2024 • 117 • 7

google/timesfm-1.0-200m

Time Series Forecasting • Updated May 17, 2024 • 1.94k • 720

bigcode/starcoder2-15b-instruct-v0.1

Text Generation • Updated Nov 3, 2024 • 1.27k • 101

numind/NuNER_Zero-span

Token Classification • Updated 13 days ago • 138 • 15

google/paligemma-3b-pt-896

Image-Text-to-Text • Updated Jul 19, 2024 • 13.9k • 117

abacusai/Smaug-Llama-3-70B-Instruct

Text Generation • Updated Jun 4, 2024 • 4.44k • 147

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21, 2024 • 500k • • 1.36k

Qwen/Qwen2-72B-Instruct

Text Generation • Updated Oct 8, 2024 • 45k • • 705

meta-llama/Meta-Llama-3-70B-Instruct

Text Generation • Updated Dec 15, 2024 • 201k • • 1.46k

google/recurrentgemma-9b

Text Generation • Updated Aug 7, 2024 • 187 • 58

google/recurrentgemma-9b-it

Text Generation • Updated Aug 7, 2024 • 10.2k • 50

681

Qwen2 72B Instruct

💻

Chat with Qwen2-72B-instruct using a system prompt

microsoft/Phi-3-vision-128k-instruct

Text Generation • Updated Aug 20, 2024 • 152k • 949

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Jul 31, 2024 • 21.5k • 1.06k

apple/AIM-7B

Image Classification • Updated Jan 19, 2024 • 96 • 24

tomaarsen/span-marker-roberta-large-ontonotes5

Token Classification • Updated Sep 22, 2023 • 530 • 12

mlabonne/NeuralDaredevil-8B-abliterated

Text Generation • Updated Aug 27, 2024 • 20k • 180

nvidia/NV-Embed-v1

Updated Nov 30, 2024 • 4.78k • 425

deepseek-ai/DeepSeek-Coder-V2-Instruct

Text Generation • Updated Aug 21, 2024 • 6.5k • 574

deepseek-ai/DeepSeek-Coder-V2-Base

Text Generation • Updated Jul 3, 2024 • 848 • 67

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Text Generation • Updated Jul 3, 2024 • 77.7k • 386

microsoft/Florence-2-large

Image-Text-to-Text • Updated Dec 8, 2024 • 725k • 1.41k

deepseek-ai/deepseek-math-7b-rl

Text Generation • Updated Mar 19, 2024 • 1.81k • 74

EPFL-VILAB/4M-7_B_CC12M

Any-to-Any • Updated Oct 7, 2024 • 490 • 16

google/gemma-2-9b-it

Text Generation • Updated Aug 27, 2024 • 560k • • 659

nomic-ai/nomic-embed-vision-v1.5

Image Feature Extraction • Updated about 1 month ago • 28.6k • 142

CAMB-AI/MARS5-TTS

Text-to-Speech • Updated Jul 5, 2024 • 210 • 448

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 687k • 199

apple/DCLM-7B

Updated Jul 26, 2024 • 722 • 831

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.05M • • 3.62k

meta-llama/Llama-3.1-8B

Text Generation • Updated Oct 16, 2024 • 1.19M • 1.42k

meta-llama/Llama-3.1-70B-Instruct

Text Generation • Updated Dec 15, 2024 • 376k • • 786

meta-llama/Llama-3.1-405B-Instruct

Text Generation • Updated Sep 25, 2024 • 72.2k • • 568

mistralai/Mistral-Large-Instruct-2407

Updated Oct 16, 2024 • 11.3k • 823

meta-llama/Prompt-Guard-86M

Text Classification • Updated Jul 25, 2024 • 19.1k • • 218

unsloth/Mistral-Large-Instruct-2407-bnb-4bit

Text Generation • Updated Sep 11, 2024 • 151 • 9

mlabonne/FineLlama-3.1-8B-GGUF

Updated Aug 27, 2024 • 158 • 7

OpenGVLab/InternVL2-40B

Image-Text-to-Text • Updated 11 days ago • 1.5k • 95

McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised

mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated-GGUF

Updated Aug 3, 2024 • 10.9k • 132

aiola/whisper-medusa-v1

Updated Aug 3, 2024 • 117 • 178

nomic-ai/nomic-bert-2048

Fill-Mask • Updated 5 days ago • 167k • 36

Magpie-Align/Llama-3.1-8B-Magpie-Align-v0.1

Text Generation • Updated Aug 19, 2024 • 2.1k • 3

Snowflake/snowflake-arctic-embed-xs

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 1.05M • • 3.38k

mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq

Text Generation • Updated 11 days ago • 27 • 31

nvidia/Minitron-4B-Base

Text Generation • Updated 1 day ago • 3.28k • 130

nvidia/Minitron-8B-Base

Text Generation • Updated 1 day ago • 7.99k • 63

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8

Text Generation • Updated 5 days ago • 8.16k • 19

neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 153k • 38

mlabonne/Hermes-3-Llama-3.1-8B-lorablated

Text Generation • Updated Aug 17, 2024 • 28.4k • 30

mlabonne/Hermes-3-Llama-3.1-70B-lorablated

Text Generation • Updated Oct 16, 2024 • 72 • 26

Mozilla/whisperfile

Updated Oct 2, 2024 • 2.18k • 241

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated Oct 24, 2024 • 34.4k • 555

microsoft/Phi-3.5-mini-instruct

Text Generation • Updated Sep 18, 2024 • 911k • • 805

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26, 2024 • 289k • • 662

cerebras/Llama3-DocChat-1.0-8B

Text Generation • Updated Aug 16, 2024 • 122 • 68

opendatalab/PDF-Extract-Kit

Updated Sep 14, 2024 • 64

NousResearch/Hermes-3-Llama-3.1-70B

Text Generation • Updated Sep 8, 2024 • 4.88k • 106

nvidia/Mistral-NeMo-Minitron-8B-Base

Text Generation • Updated Aug 22, 2024 • 13.7k • 170

ai21labs/AI21-Jamba-1.5-Mini

Text Generation • Updated Sep 17, 2024 • 9.28k • 261

meta-llama/Llama-3.1-70B

Text Generation • Updated Sep 25, 2024 • 65.4k • 346

abacusai/Dracarys-72B-Instruct

Text Generation • Updated Sep 27, 2024 • 2.3k • 21

cartesia-ai/Rene-v0.1-1.3b-pytorch

Updated Aug 28, 2024 • 55 • 55

IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 20 • 30

gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4, 2024 • 416

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 578 • 1.71k

upstage/solar-pro-preview-instruct

Text Generation • Updated Sep 20, 2024 • 15.6k • 445

fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 917 • 448

ICTNLP/Llama-3.1-8B-Omni

Updated Nov 14, 2024 • 2.5k • 394

google/datagemma-rag-27b-it

Text Generation • Updated Sep 12, 2024 • 8.47k • 182

google/datagemma-rig-27b-it

Text Generation • Updated Sep 16, 2024 • 469 • 102

google/shieldgemma-9b

Text Generation • Updated Sep 6, 2024 • 445 • 20

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated 12 days ago • 267k • 1.37k

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • 603

ibm-nasa-geospatial/Prithvi-WxC-1.0-2300M

Updated Dec 12, 2024 • 110 • 67

allenai/Molmo-72B-0924

Image-Text-to-Text • Updated Oct 10, 2024 • 4.67k • 280

meta-llama/Llama-3.2-1B

Text Generation • Updated Oct 24, 2024 • 11.1M • • 1.57k

meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.66M • • 756

meta-llama/Llama-Guard-3-1B-INT4

Text Generation • Updated Sep 25, 2024 • 27

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 1.54M • • 1.32k

meta-llama/Llama-Guard-3-11B-Vision

Image-Text-to-Text • Updated Nov 18, 2024 • 1.16k • 55

nvidia/Llama-3_1-Nemotron-51B-Instruct

Text Generation • Updated Oct 13, 2024 • 90.9k • 204

rhymes-ai/Aria

Image-Text-to-Text • Updated 20 days ago • 25.5k • 614

BSC-LT/salamandra-2b

Text Generation • Updated 3 days ago • 10.8k • 22

arcee-ai/SuperNova-Medius

Text Generation • Updated Oct 28, 2024 • 2.74k • 201

numind/NuExtract-1.5

Text Generation • Updated Nov 18, 2024 • 116k • 196

62

NuExtract 1.5

👀

Playground for NuExtract-v1.5

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Oct 25, 2024 • 127k • • 2.02k

mistralai/Ministral-8B-Instruct-2410

Updated Dec 6, 2024 • 50.6k • 426

TheFinAI/finma-7b-full

Text Generation • Updated Mar 28, 2024 • 197 • 7

Haon-Chen/speed-embedding-7b-instruct

Feature Extraction • Updated Nov 3, 2024 • 434 • 5

opendatalab/PDF-Extract-Kit-1.0

Updated 26 days ago • 31

sentence-transformers-testing/stsb-bert-tiny-lora

numind/NuExtract-1.5-smol

Text Generation • Updated Nov 18, 2024 • 211 • 54

apple/coreml-mobileclip

Updated Nov 19, 2024 • 279 • 40

unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

Updated Nov 15, 2024 • 17.6k • 60

neuralmagic/Sparse-Llama-3.1-8B-2of4

Text Generation • Updated Dec 16, 2024 • 9.9k • 61

Skywork/Skywork-o1-Open-Llama-3.1-8B

Text Generation • Updated 2 days ago • 873 • 108

utter-project/EuroLLM-9B-Instruct

Text Generation • Updated Dec 9, 2024 • 25.8k • 146

utter-project/EuroLLM-9B

Text Generation • Updated Dec 9, 2024 • 4.28k • 66

mlabonne/BigLlama-3.1-1T-Instruct

Text Generation • Updated Aug 8, 2024 • 164 • 78

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 536k • • 1.96k

Personalized Multimodal Large Language Models: A Survey

Paper • 2412.02142 • Published Dec 3, 2024 • 14

Snowflake/snowflake-arctic-embed-l-v2.0

Snowflake/snowflake-arctic-embed-m-v2.0

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • Updated Jan 12 • 101k • • 1.61k

FinGPT/fingpt-mt_llama3-8b_lora

Updated Oct 6, 2024 • 18

taohu/mask

Updated 10 days ago • 5

[MASK] is All You Need

Paper • 2412.06787 • Published Dec 9, 2024 • 3

Qwen/Qwen2.5-72B-Instruct

Text Generation • Updated Jan 12 • 296k • • 728

fishaudio/fish-speech-1.5

Text-to-Speech • Updated Dec 3, 2024 • 9.26k • 462

62

StoryStar

💬

Fantasy story generator

huihui-ai/QwQ-32B-Preview-abliterated

Text Generation • Updated Nov 28, 2024 • 471 • 97

GoodiesHere/Apollo-LMMs-Apollo-7B-t32

Video-Text-to-Text • Updated Dec 18, 2024 • 294 • 50

ibm-granite/granite-3.1-8b-instruct

Text Generation • Updated 16 days ago • 89.3k • 146

RUC-NLPIR/OmniEval-HallucinationEvaluator

Text Generation • Updated Dec 18, 2024 • 1

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 9.54M • 745

answerdotai/ModernBERT-large

Fill-Mask • Updated Jan 15 • 2.2M • 351

LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct

Text Generation • Updated Aug 8, 2024 • 34.2k • 402

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 211k • • 1.61k

881

QwQ-32B-Preview

🔍

QwQ-32B-Preview

1.36k

Qwen2.5 Coder Artifacts

🐢

Generate code from a description

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published Dec 19, 2024 • 14

CohereForAI/c4ai-command-r7b-12-2024

Text Generation • Updated 18 days ago • 11.8k • 356

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 18

deepseek-ai/DeepSeek-V3-Base

Updated 23 days ago • 140k • 1.56k

deepseek-ai/DeepSeek-V3

Text Generation • Updated 23 days ago • 1.75M • • 3.45k

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated Jan 16 • 112k • • 383

nomic-ai/modernbert-embed-base

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated 10 days ago • 1.59M • 1.12k

XiaoduoAILab/Xmodel_VLM

Text Generation • Updated Jun 7, 2024 • 163 • 12

Snowflake/snowflake-arctic-embed-m-v1.5

diffbot/Llama-3.3-Diffbot-Small-XL-2412

Updated Jan 8 • 164 • 6

nvidia/Cosmos-1.0-Diffusion-14B-Text2World

Updated Jan 10 • 78k • 49

nvidia/Cosmos-1.0-Guardrail

Updated Jan 10 • 4.29k • 43

nvidia/Cosmos-1.0-Autoregressive-13B-Video2World

Updated 8 days ago • 610 • 31

nvidia/Cosmos-1.0-Tokenizer-DV8x16x16

Updated Jan 12 • 1.13k • 14

nvidia/Cosmos-1.0-Diffusion-7B-Text2World

Updated Jan 10 • 228k • 203

microsoft/phi-4

Text Generation • Updated 12 days ago • 592k • 1.73k

unsloth/DeepSeek-V3-GGUF

Updated Jan 12 • 170k • 116

ds4sd/docling-models

Updated Dec 10, 2024 • 143k • 79

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 214k • • 1.46k

meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.53M • • 1.02k

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17, 2024 • 1.3M • • 4.22k

lmstudio-community/phi-4-GGUF

Text Generation • Updated Jan 11 • 54.7k • 42

HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1

HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5

HIT-TMG/KaLM-embedding-multilingual-max-instruct-v1

Updated Jan 7 • 8

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 5 days ago • 536k • 955

internlm/internlm3-8b-instruct

Text Generation • Updated 5 days ago • 36.6k • 196

jinaai/ReaderLM-v2

Text Generation • Updated 10 days ago • 34k • • 503

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 16.3k • 531

flowaicom/Flow-Judge-v0.1

Text Generation • Updated Oct 7, 2024 • 620 • 53

Foundations of Large Language Models

Paper • 2501.09223 • Published Jan 16 • 2

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated about 1 month ago • 5.99k • 521

unsloth/phi-4

Text Generation • Updated Jan 13 • 22.4k • 70

mistralai/Codestral-22B-v0.1

Text Generation • Updated Jul 31, 2024 • 21.8k • 1.21k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 7 days ago • 3.98M • • 9.07k

100

MiniMaxText01

💬

Communicate with a multimodal chatbot

Alibaba-NLP/gte-reranker-modernbert-base

onnx-community/Kokoro-82M-ONNX

Text-to-Speech • Updated 9 days ago • 23.3k • 121

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

Updated 22 days ago • 523k • 109

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated 18 days ago • 28.4k • 242

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • Updated 18 days ago • 62.7k • 217

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 7 days ago • 785k • • 1.07k

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated 3 days ago • 2.2M • 807

meta-llama/Llama-3.2-3B

Text Generation • Updated Oct 24, 2024 • 299k • • 504

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated 14 days ago • 640k • • 759

allenai/Llama-3.1-Tulu-3-405B

Text Generation • Updated 6 days ago • 1.29k • • 97

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated 17 days ago • 16.6k • 216

allenai/Llama-3.1-Tulu-3-8B-RM

Text Classification • Updated 17 days ago • 6.67k • 16

allenai/Llama-3.1-Tulu-3-8B

Text Generation • Updated 3 days ago • 15k • 149

TIGER-Lab/Qwen2.5-Math-7B-CFT

Text Generation • Updated 14 days ago • 92 • 6

TIGER-Lab/Qwen2.5-32B-Instruct-CFT

Text Generation • Updated 14 days ago • 130 • 5

bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF

Text Generation • Updated 25 days ago • 2.85M • 210

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Updated 3 days ago • 521k • 229

simplescaling/s1-32B

Text Generation • Updated 5 days ago • 8.47k • 270

simplescaling/step-conditional-control-old

Text Generation • Updated 13 days ago • 209 • 2

open-r1/OpenR1-Math-220k

Viewer • Updated 4 days ago • 450k • 2.43k • 301

simplescaling/s1.1-32B

Text Generation • Updated 4 days ago • 2.55k • 45

agentica-org/DeepScaleR-1.5B-Preview

Updated 5 days ago • 8.91k • 354

NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF

Updated 3 days ago • 7.22k • 49

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated 2 days ago • 230 • 202

FinGPT Forecaster

RWKV-Gradio-2

Qwen2 72B Instruct