Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
custom-implementation
Eval Results
Inference Endpoints
Other with no match
AutoTrain Compatible
text-generation-inference
4-bit precision
Merge
text-embeddings-inference
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
18,129
Full-text search
Edit filters
Sort: Trending
Active filters:
custom-implementation
Clear all
kmpartner/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 13
tomTs/ppo-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 13
AlkQ/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
May 13
Unclad3610/ppo-scratch-LunarLander-v2
Reinforcement Learning
•
Updated
May 13
Whiskas0663/taxi-v3
Reinforcement Learning
•
Updated
May 13
rgny/unit8p1
Reinforcement Learning
•
Updated
May 13
mlsby/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 13
mlsby/q-drl-taxi
Reinforcement Learning
•
Updated
May 13
Hevagog/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 13
Hevagog/q-learning-eps-greedy
Reinforcement Learning
•
Updated
May 13
evonc/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 13
evonc/taxu-v3-unit2
Reinforcement Learning
•
Updated
May 13
gruhit-patel/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 13
ulasfiliz954/ppo-LunarLander-v1
Reinforcement Learning
•
Updated
May 13
gruhit-patel/taxi-v3
Reinforcement Learning
•
Updated
May 13
gasperjw/Reinforce-Cartpole-Policy
Reinforcement Learning
•
Updated
May 13
rvukasin/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
May 13
rvukasin/Reinforce-CartPole-v1-Local
Reinforcement Learning
•
Updated
May 14
RomBor/Q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 14
RomBor/Q-Taxis-v3
Reinforcement Learning
•
Updated
May 14
zee0110/CartPole-v1
Reinforcement Learning
•
Updated
May 14
Tyhcbs/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 14
Tyhcbs/taxi_try
Reinforcement Learning
•
Updated
May 14
AlkQ/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
May 20
pdx97/Lunarlander-v2
Reinforcement Learning
•
Updated
May 14
pdx97/Lunarlander-v2_Unit8_part1
Reinforcement Learning
•
Updated
May 14
•
1
davideaguglia/ppo-LunarLander-v2-fromscratch
Reinforcement Learning
•
Updated
May 14
liqiu0202/Reinforce-Pixelcopter
Reinforcement Learning
•
Updated
May 14
rvukasin/Reinforce-Pixelcopter-PLE-v0-local
Reinforcement Learning
•
Updated
May 14
Fetanos/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
May 14
Previous
1
...
561
562
563
564
565
...
605
Next