Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Reset Other
custom-implementation
Eval Results
Inference Endpoints
Other with no match
AutoTrain Compatible
text-generation-inference
Merge
4-bit precision
custom_code
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
16,934
Full-text search
Edit filters
Sort: Most downloads
Active filters:
custom-implementation
Clear all
izaznov/Reinforce-policy_pixel_copter
Reinforcement Learning
•
Updated
19 days ago
bhutchings/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
23 days ago
bhutchings/q-Taxi-v3
Reinforcement Learning
•
Updated
23 days ago
filodoxia/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
23 days ago
filodoxia/Taxi-v3
Reinforcement Learning
•
Updated
23 days ago
ProrabVasili/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
23 days ago
conlan/ppo-LunarLander-v3
Reinforcement Learning
•
Updated
23 days ago
phoenixaiden33/Reinforce-00
Reinforcement Learning
•
Updated
23 days ago
phoenixaiden33/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
23 days ago
amine-01/Pixelcopter
Reinforcement Learning
•
Updated
23 days ago
jeliasherrero/Reinforce-Policy-Gradient-cartpole-v1
Reinforcement Learning
•
Updated
23 days ago
mbartholet/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
23 days ago
nvasko/Reinforce-pixelcopter-moe-1
Reinforcement Learning
•
Updated
21 days ago
MLIsaac/ppo_from_scratch-LunarLander-v2
Reinforcement Learning
•
Updated
23 days ago
jeliasherrero/Reinforce-Policy-Gradient-PixelCopter-v1
Reinforcement Learning
•
Updated
23 days ago
rahil1206/q-Taxi-v3
Reinforcement Learning
•
Updated
23 days ago
mezzy33/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
23 days ago
loudinthecloud/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
23 days ago
loudinthecloud/q-Taxi-v3
Reinforcement Learning
•
Updated
23 days ago
bhutchings/Reinforce-cartpole
Reinforcement Learning
•
Updated
22 days ago
Licwit/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
22 days ago
Licwit/Taximodel
Reinforcement Learning
•
Updated
22 days ago
ImenMasmoudiEm/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
22 days ago
ImenMasmoudiEm/Taxi-v3
Reinforcement Learning
•
Updated
22 days ago
phoenixaiden33/PPO-LunarLander-v2
Reinforcement Learning
•
Updated
22 days ago
mbartholet/taxi_qlearn
Reinforcement Learning
•
Updated
22 days ago
DaniElAbrazos/Reinforce-Cartpole
Reinforcement Learning
•
Updated
22 days ago
dhajnes/Reinforce-cartpole_32_1e-3
Reinforcement Learning
•
Updated
22 days ago
DaniElAbrazos/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
22 days ago
cmattoon/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
22 days ago
Previous
1
...
549
550
551
552
553
...
565
Next