Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
mkahari
/
RL_testing
like
0
Reinforcement Learning
Transformers
Taxi-v3
q-learning
custom-implementation
Eval Results
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
121803f
RL_testing
/
mk_ppo_lunar
/
_stable_baselines3_version
mkahari
PPO LunarLander-v2 model
898f026
over 1 year ago
raw
history
blame
No virus
5 Bytes
1
.
6
.
2