Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
custom-implementation
Eval Results
Inference Endpoints
Other with no match
AutoTrain Compatible
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
17,365
Full-text search
Edit filters
Sort: Trending
Active filters:
custom-implementation
Clear all
ashwanth18/taxi
Reinforcement Learning
•
Updated
Apr 5
erfan1380/taxi-v3
Reinforcement Learning
•
Updated
Apr 5
Kommunarus/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 5
Kommunarus/q-Taxi
Reinforcement Learning
•
Updated
Apr 6
ethan-lam/Reinforce-cartpolev2
Reinforcement Learning
•
Updated
Apr 5
gyaan/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Apr 6
dyu200206/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
dyu200206/q-FrozenLake-v1-8x8-noSlippery
Reinforcement Learning
•
Updated
Apr 6
ztchir/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
ztchir/q-Taxi-v3
Reinforcement Learning
•
Updated
Apr 6
dyu200206/q-FrozenLake-v1-8x8-Slippery
Reinforcement Learning
•
Updated
Apr 6
kelvinho8/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
kelvinho8/q-Taxi-v3
Reinforcement Learning
•
Updated
Apr 6
paularusti78/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
paularusti78/tax1-v3-1
Reinforcement Learning
•
Updated
Apr 6
kelvinho8/q-FrozenLake-v1-4x4-noSlippery-utopia
Reinforcement Learning
•
Updated
Apr 6
kelvinho8/q-Taxi-v3-utopia
Reinforcement Learning
•
Updated
Apr 6
osman93/q-FrozenLake-v1-8x8-noSlippery
Reinforcement Learning
•
Updated
Apr 6
osman93/q-FrozenLake-v1-8x8
Reinforcement Learning
•
Updated
Apr 6
Pongsathorn/Reinforce
Reinforcement Learning
•
Updated
Apr 6
joslinthomas/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 6
Pongsathorn/PixelCopter
Reinforcement Learning
•
Updated
Apr 6
Gonke/ppo-LunarLander-v2-rewritten
Reinforcement Learning
•
Updated
Apr 6
joslinthomas/q-Taxi-v3
Reinforcement Learning
•
Updated
Apr 6
qgallouedec/Hopper-v4-ppo_continuous_action-seed1
Reinforcement Learning
•
Updated
Apr 6
qgallouedec/MsPacmanNoFrameskip-v4-dqn_atari-seed1
Reinforcement Learning
•
Updated
Apr 6
basil-ahmad/Reinforce-cart-pole-v2
Reinforcement Learning
•
Updated
Apr 7
basil-ahmad/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Apr 7
mrbesher/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Apr 7
infinitix/CartPole-v1
Reinforcement Learning
•
Updated
Apr 7
Previous
1
...
538
539
540
541
542
...
579
Next