Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
custom-implementation
Eval Results
Inference Endpoints
Other with no match
AutoTrain Compatible
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
17,186
Full-text search
Edit filters
Sort: Trending
Active filters:
custom-implementation
Clear all
davideaguglia/Taxi-v3
Reinforcement Learning
•
Updated
28 days ago
Max87152/PixelCopter
Reinforcement Learning
•
Updated
28 days ago
ulasfiliz954/Reinforce-1
Reinforcement Learning
•
Updated
28 days ago
ulasfiliz954/Reinforce-2
Reinforcement Learning
•
Updated
28 days ago
sddgs/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
28 days ago
tomTs/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
28 days ago
Blues-Monster/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
28 days ago
Blues-Monster/q-Taxi-v3
Reinforcement Learning
•
Updated
28 days ago
AlkQ/Taxi-v3
Reinforcement Learning
•
Updated
28 days ago
joosma/Reinforce-pixelcopter2
Reinforcement Learning
•
Updated
28 days ago
pdejong/ref2-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
28 days ago
BWangila/ppo-CartPole-v1
Reinforcement Learning
•
Updated
28 days ago
BWangila/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
27 days ago
stuvx/Reinforce-pixelcopter-02
Reinforcement Learning
•
Updated
28 days ago
MorganWKen/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
28 days ago
MorganWKen/q-Taxi-v3
Reinforcement Learning
•
Updated
28 days ago
RobertoFuentesRisco/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
27 days ago
Lingrui1/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
27 days ago
Lingrui1/taxi
Reinforcement Learning
•
Updated
27 days ago
pietroorlandi/ppo-CartPole-from-scratch
Reinforcement Learning
•
Updated
27 days ago
elisamammi/ppo-CartPole-v1
Reinforcement Learning
•
Updated
27 days ago
pietroorlandi/ppo-LunarLander-from-scratch
Reinforcement Learning
•
Updated
27 days ago
elisamammi/ppo-LunarLander_v2
Reinforcement Learning
•
Updated
27 days ago
tomTs/Taxi-v3
Reinforcement Learning
•
Updated
27 days ago
suryaanthony/Taxi-v3
Reinforcement Learning
•
Updated
27 days ago
elisamammi/CartPoleReinforce
Reinforcement Learning
•
Updated
27 days ago
jphyun2019/Reinforce-1
Reinforcement Learning
•
Updated
27 days ago
pietroorlandi/reinforce-cartpole
Reinforcement Learning
•
Updated
27 days ago
erikbritto/Reinforce-PixelCopter9
Reinforcement Learning
•
Updated
27 days ago
RobertoFuentesRisco/q-Taxi-v3
Reinforcement Learning
•
Updated
27 days ago
Previous
1
...
558
559
560
561
562
...
573
Next