Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
42,349
Full-text search
Edit filters
Sort: Trending
Active filters:
reinforcement-learning
Clear all
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
Updated
20 days ago
•
262
DaniElAbrazos/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 19
PKU-Alignment/beaver-7b-v2.0-reward
Reinforcement Learning
•
Updated
Apr 20
•
7
PKU-Alignment/beaver-7b-v2.0-cost
Reinforcement Learning
•
Updated
Apr 20
•
4
PKU-Alignment/beaver-7b-v3.0
Reinforcement Learning
•
Updated
20 days ago
•
174
PKU-Alignment/beaver-7b-v3.0-reward
Reinforcement Learning
•
Updated
Apr 20
•
2.65k
PKU-Alignment/beaver-7b-v3.0-cost
Reinforcement Learning
•
Updated
Apr 20
•
2.05k
DaniElAbrazos/Taxiv3
Reinforcement Learning
•
Updated
Apr 19
PKU-Alignment/beaver-7b-unified-reward
Reinforcement Learning
•
Updated
Apr 20
•
1.16k
PKU-Alignment/beaver-7b-unified-cost
Reinforcement Learning
•
Updated
Apr 20
•
395
BWangila/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Apr 19
•
1
moczard/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 19
moczard/q-Taxi-v3
Reinforcement Learning
•
Updated
Apr 19
Kommunarus/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
Apr 21
rahil1206/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Apr 19
•
4
conlan/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Apr 19
•
15
CrispyJLoHalo/dqn-CartPole-v1
Reinforcement Learning
•
Updated
Apr 19
•
1
eulpicard/Reinforce-Pixelcopter-PLE-v1
Reinforcement Learning
•
Updated
Apr 19
CrispyJLoHalo/dqn-CartPole-v1_2
Reinforcement Learning
•
Updated
Apr 19
•
1
joen2010/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Apr 19
ahGadji/Reinforce-0
Reinforcement Learning
•
Updated
Apr 19
ahGadji/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Apr 20
minindu-liya99/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Apr 19
•
31
damienbenveniste/mistral-ppo
Reinforcement Learning
•
Updated
Apr 25
conlan/ML-Agents-Pyramids
Reinforcement Learning
•
Updated
Apr 20
•
12
izaznov/qrdqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Apr 20
•
5
stuvx/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Apr 20
•
1
yunkimmy/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Apr 20
•
1
dallonf/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 20
dallonf/Q-Taxi-v3
Reinforcement Learning
•
Updated
Apr 20
Previous
1
...
1,348
1,349
1,350
1,351
1,352
...
1,412
Next