Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Hyperbolic
Nebius AI Studio
Together AI
Novita
Replicate
SambaNova
Fireworks
HF Inference API
Misc
Reset Misc
deep-rl-course
Eval Results
Inference Endpoints
Misc with no match
AutoTrain Compatible
text-generation-inference
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
1,586
Full-text search
Edit filters
Sort: Trending
Active filters:
deep-rl-course
Clear all
Zionamsalem/LLV2
Reinforcement Learning
•
Updated
8 days ago
AriYusa/ppo-implementation
Reinforcement Learning
•
Updated
8 days ago
volfy/huggingface_rl_unit8_ppo-CartPole-v1
Reinforcement Learning
•
Updated
8 days ago
volfy/huggingface_rl_unit8_ppo-LunarLander-v3
Reinforcement Learning
•
Updated
8 days ago
MartinRedWhite/unit8-LunarLander-v2-unit8
Reinforcement Learning
•
Updated
8 days ago
volfy/huggingface_rl_unit8_ppo-LunarLander-v2
Reinforcement Learning
•
Updated
8 days ago
Vanheart/ppoCRL-LunarLander-v2
Reinforcement Learning
•
Updated
8 days ago
JuanjoGT13/ppo-CartPole-v1
Reinforcement Learning
•
Updated
7 days ago
amostof/ppoScratch-LunarLander-v2
Reinforcement Learning
•
Updated
about 7 hours ago
twofacejr/ppo-CartPole-v1
Reinforcement Learning
•
Updated
4 days ago
vinhdq842/ppo-LunarLander-v2-scratch
Reinforcement Learning
•
Updated
6 days ago
francescosabbarese/ppo-CartPole-v1
Reinforcement Learning
•
Updated
5 days ago
francescosabbarese/ppo-LunarLander-v2-unit8-pt1
Reinforcement Learning
•
Updated
5 days ago
nasnoussi/ppo-CartPole-v1
Reinforcement Learning
•
Updated
1 day ago
baronase/ppo-cleanrl-CartPole-v1
Reinforcement Learning
•
Updated
3 days ago
baronase/ppo-cleanrl-CartPole-v1_2
Reinforcement Learning
•
Updated
3 days ago
baronase/ppo-cleanrl-LunarLander-v2_1
Reinforcement Learning
•
Updated
3 days ago
baronase/ppo-cleanrl-LunarLander-v2_200k
Reinforcement Learning
•
Updated
3 days ago
lucas-palmiro/ppo-LunarLander-v3
Reinforcement Learning
•
Updated
2 days ago
lucas-palmiro/ppo-early-stopping-LunarLander-v3
Reinforcement Learning
•
Updated
2 days ago
sighmon/ppo-cleanrl-LunarLander-v2
Reinforcement Learning
•
Updated
2 days ago
mrinaldi86/ppo-CartPole-v1
Reinforcement Learning
•
Updated
2 days ago
mrinaldi86/ppo-LunarLander-v3
Reinforcement Learning
•
Updated
2 days ago
nasnoussi/ppo-Pixelcopter-v1
Reinforcement Learning
•
Updated
1 day ago
dragovoid/ppo-LunarLander-v2-u8
Reinforcement Learning
•
Updated
about 10 hours ago
amostof/ppoScratchTest-LunarLander-v2
Reinforcement Learning
•
Updated
about 6 hours ago
Previous
1
...
51
52
53
Next