Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Novita
fal
Fireworks
Replicate
Together AI
Cerebras
Nebius AI Studio
SambaNova
Cohere
Hyperbolic
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
AutoTrain Compatible
Merge
4-bit precision
custom_code
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
91
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
Updated
Jan 30
•
334
•
20
mradermacher/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 9
•
43
•
2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
Updated
Feb 9
•
33
•
1
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
Updated
Feb 26
•
85
•
1
alpha-ai/Reason-With-Choice-3B-GGUF
Updated
Feb 26
•
105
•
1
mradermacher/Captain-Eris_Violet-GRPO-v0.420-GGUF
Updated
Feb 18
•
254
•
4
mradermacher/Captain-Eris_Violet-GRPO-v0.420-i1-GGUF
Updated
Feb 18
•
998
•
3
Nitrals-Quants/Captain-Eris_Violet-GRPO-v0.420-4bpw-exl2
Text Generation
•
Updated
Feb 19
•
7
•
1
stranger47/SmolLM2-1.7B-Instruct-Lora
Text Generation
•
Updated
Mar 10
•
11
•
1
skyimple/SmolGRPO-135M
Text Generation
•
Updated
Mar 12
•
6
•
1
Jarrodbarnes/Cortex-1-mini
Text Generation
•
Updated
Mar 13
•
7
•
1
NuclearAi/Nuke_X_Gemma3_1B_Reasoner_Testing
Text Generation
•
Updated
25 days ago
•
89
•
2
mradermacher/Nuke_X_Gemma3_1B_Reasoner_Testing-GGUF
Updated
23 days ago
•
355
•
1
mradermacher/Nuke_X_Gemma3_1B_Reasoner_Testing-i1-GGUF
Updated
23 days ago
•
3.52k
•
1
NuclearAi/Nuke_X_Gemma3_1B_Reasoner_v1.0
Text Generation
•
Updated
17 days ago
•
46
•
1
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
•
Updated
Feb 2
•
33
•
1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
Updated
Feb 3
•
48
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
Updated
Feb 3
•
110
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
•
Updated
Feb 3
•
20
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
34
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
6
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
6
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
10
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
6
Triangle104/Bellatrix-Tiny-1B-R1-Q6_K-GGUF
Text Generation
•
Updated
Feb 3
•
7
Triangle104/Bellatrix-Tiny-1B-R1-Q8_0-GGUF
Text Generation
•
Updated
Feb 3
•
6
tecosys/Nutaan-RL1
Reinforcement Learning
•
Updated
Feb 7
•
4
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
Updated
Feb 9
•
52
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
75
alpha-ai/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 26
•
31
•
1
Previous
1
2
3
4
Next