Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Nebius AI Studio
fal
Cerebras
Together AI
Cohere
Replicate
Hyperbolic
Novita
SambaNova
HF Inference API
Misc
Reset Misc
Reinforcement Learning
Inference Endpoints
text-generation-inference
Eval Results
Misc with no match
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
10
Full-text search
Edit filters
Sort: Trending
Active filters:
Reinforcement Learning
Clear all
mrlijun/SMR-R1
Updated
Apr 2
•
8
•
2
HUANG1993/GreedRL-VRP-pretrained-v1
Reinforcement Learning
•
Updated
Apr 26, 2023
•
4
Hawk91/PongNoFrameskip-v4_DQN
Updated
Aug 21, 2023
•
1
ledmands/ALE-Pacman-v5
Reinforcement Learning
•
Updated
Jun 2, 2024
•
50
•
1
Daemontatox/Cogito-R1
Text Generation
•
Updated
Feb 19
•
5
•
5
mradermacher/Cogito-R1-GGUF
Updated
Feb 12
•
388
mradermacher/Cogito-R1-i1-GGUF
Updated
Feb 13
•
477
omreab/SoccerTwos
Updated
Apr 3
•
12
mradermacher/SMR-R1-GGUF
Updated
24 days ago
•
161
mradermacher/SMR-R1-i1-GGUF
Updated
24 days ago
•
269