Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
deep-reinforcement-learning
Eval Results
Inference Endpoints
Other with no match
AutoTrain Compatible
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
27,540
Full-text search
Edit filters
Sort: Most downloads
Active filters:
deep-reinforcement-learning
Clear all
sdpkjc/Walker2d-v4-sac_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-sac_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/HalfCheetah-v4-sac_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
gchindemi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Dec 19, 2023
OpenDILabCommunity/CartPole-v0-SampledEfficientZero
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Walker2d-v4-sac_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-sac_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Walker2d-v4-sac_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Walker2d-v4-sac_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-sac_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Humanoid-v4-sac_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Humanoid-v4-sac_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Humanoid-v4-sac_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-sac_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
thebrownfrog/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 22, 2023
Riad/A2C
Reinforcement Learning
•
Updated
Dec 20, 2023
ahaque12/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Dec 20, 2023
Riad/A2D
Reinforcement Learning
•
Updated
Dec 20, 2023
Vanheart/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 20, 2023
wuwx/ppo-LunarLander-v2-from-scratch
Reinforcement Learning
•
Updated
Dec 20, 2023
OpenDILabCommunity/CartPole-v0-GumbelMuZero
Reinforcement Learning
•
Updated
Dec 20, 2023
OpenDILabCommunity/CartPole-v0-EfficientZero
Reinforcement Learning
•
Updated
Dec 20, 2023
wuwx/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Dec 20, 2023
OpenDILabCommunity/TicTacToe-play-with-bot-MuZero
Reinforcement Learning
•
Updated
Dec 20, 2023
OpenDILabCommunity/TicTacToe-play-with-bot-AlphaZero
Reinforcement Learning
•
Updated
Dec 20, 2023
dude121/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Dec 20, 2023
hpourmodheji/torch-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 25
lutzvdb/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 21, 2023
OpenDILabCommunity/Pendulum-v1-MuZero
Reinforcement Learning
•
Updated
Dec 21, 2023
shunnaidder/ppo-LunaLander-v2
Reinforcement Learning
•
Updated
Dec 21, 2023
Previous
1
...
882
883
884
885
886
...
918
Next