Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Hyperbolic
Nebius AI Studio
Together AI
fal
Fireworks
Replicate
Novita
SambaNova
HF Inference API
Misc
Reset Misc
GenerativeRL
Eval Results
Misc with no match
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
1
Full-text search
Edit filters
Sort: Trending
Active filters:
GenerativeRL
Clear all
OpenDILabCommunity/LunarLanderContinuous-v2-QGPO
Reinforcement Learning
•
Updated
Dec 4, 2024