Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Cerebras
Replicate
Nebius AI Studio
Fireworks
Novita
Hyperbolic
SambaNova
Together AI
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
AutoTrain Compatible
Merge
4-bit precision
custom_code
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
67
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
93
alpha-ai/Deep-Reason-SMALL-V0-GGUF
Updated
16 days ago
•
255
•
1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
•
Updated
16 days ago
•
35
•
2
alpha-ai/qwen2.5-reason-thought-lite-GGUF
Updated
16 days ago
•
301
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
•
Updated
16 days ago
•
33
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
•
Updated
16 days ago
•
46
Daemontatox/Cogito-R1
Text Generation
•
Updated
23 days ago
•
307
•
5
mradermacher/Cogito-R1-GGUF
Updated
29 days ago
•
904
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
•
Updated
about 1 month ago
•
16
mradermacher/Cogito-R1-i1-GGUF
Updated
29 days ago
•
1.85k
alpha-ai/Reason-With-Choice-3B-GGUF
Updated
16 days ago
•
616
alpha-ai/Reason-With-Choice-3B
Text Generation
•
Updated
16 days ago
•
66
mradermacher/Reason-With-Choice-3B-GGUF
Updated
25 days ago
•
374
Daemontatox/PathFinderAI-S1
Text Generation
•
Updated
23 days ago
•
258
mradermacher/SmolLM2_135M_Grpo_Checkpoint-GGUF
Updated
22 days ago
•
283
mradermacher/SmolLM2_135M_Grpo_Gsm8k-GGUF
Updated
23 days ago
•
261
mradermacher/SmolLM2_135M_Grpo_Gsm8k-i1-GGUF
Updated
23 days ago
•
486
mradermacher/PathFinderAI-S1-GGUF
Updated
22 days ago
•
704
TimeLordRaps/PathFinderAI-S1-Q4_K_M-GGUF
Text Generation
•
Updated
23 days ago
•
38
mradermacher/SmolLM2_135M_Grpo_Checkpoint-i1-GGUF
Updated
22 days ago
•
464
mradermacher/PathFinderAI-S1-i1-GGUF
Updated
22 days ago
•
1.05k
Rivaidan/Captain-Eris_Violet-GRPO-v0.420-Q8_0-GGUF
Updated
18 days ago
•
32
nharshavardhana/SmolGRPO-135M
Text Generation
•
Updated
10 days ago
•
10
TheMelonGod/Captain-Eris_Violet-GRPO-v0.420-exl2
Text Generation
•
Updated
4 days ago
•
100
Lingyue1/SmolGRPO-135M
Text Generation
•
Updated
9 days ago
•
3
t2190/SmolGRPO-135M
Text Generation
•
Updated
8 days ago
•
23
t2190/GRPO_1
Text Generation
•
Updated
1 day ago
•
11
kaweizhenpi/SmolGRPO-135M
Text Generation
•
Updated
7 days ago
•
8
Shumatsurontek/SmolGRPO-135M
Text Generation
•
Updated
4 days ago
alperenyildiz/SmolGRPO-135M
Text Generation
•
Updated
3 days ago
•
19
Previous
1
2
3
Next