Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
41,963
Full-text search
Edit filters
Sort: Most downloads
Active filters:
reinforcement-learning
Clear all
paularusti78/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
paularusti78/tax1-v3-1
Reinforcement Learning
•
Updated
Apr 6
kelvinho8/q-FrozenLake-v1-4x4-noSlippery-utopia
Reinforcement Learning
•
Updated
Apr 6
kelvinho8/q-Taxi-v3-utopia
Reinforcement Learning
•
Updated
Apr 6
osman93/q-FrozenLake-v1-8x8-noSlippery
Reinforcement Learning
•
Updated
Apr 6
osman93/q-FrozenLake-v1-8x8
Reinforcement Learning
•
Updated
Apr 6
Pongsathorn/Reinforce
Reinforcement Learning
•
Updated
Apr 6
joslinthomas/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
trsdimi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Apr 6
NicolasYn/vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Apr 6
Pongsathorn/PixelCopter
Reinforcement Learning
•
Updated
Apr 6
Gonke/ppo-LunarLander-v2-rewritten
Reinforcement Learning
•
Updated
Apr 6
joslinthomas/q-Taxi-v3
Reinforcement Learning
•
Updated
Apr 6
qgallouedec/Hopper-v4-ppo_continuous_action-seed1
Reinforcement Learning
•
Updated
Apr 6
qgallouedec/MsPacmanNoFrameskip-v4-dqn_atari-seed1
Reinforcement Learning
•
Updated
Apr 6
NicolasYn/ppo-sf-LunarLander-v2
Reinforcement Learning
•
Updated
Apr 7
basil-ahmad/Reinforce-cart-pole-v2
Reinforcement Learning
•
Updated
Apr 7
basil-ahmad/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Apr 7
Pongsathorn/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Apr 7
bunnyTech/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Apr 7
LLParallax/sf_finetuning_forgetting_human_monk
Reinforcement Learning
•
Updated
Apr 7
mrbesher/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Apr 7
infinitix/CartPole-v1
Reinforcement Learning
•
Updated
Apr 7
infinitix/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Apr 7
mrbesher/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Apr 7
feysahin/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
Apr 7
Sassy21/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 7
ADG-2353/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Apr 7
Sassy21/Taxi-v3
Reinforcement Learning
•
Updated
Apr 7
davidkh/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Apr 7
Previous
1
...
1,364
1,365
1,366
1,367
1,368
...
1,399
Next