Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sw32-seo
/
ppo-LunarLander_SB_1e6
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
ab5207a
ppo-LunarLander_SB_1e6
/
results.json
sw32-seo
PPO training on LunarLander-v2
ab5207a
over 1 year ago
raw
Copy download link
history
blame
165 Bytes
{
"mean_reward"
:
238.52892112531896
,
"std_reward"
:
45.047057629378486
,
"is_deterministic"
:
true
,
"n_eval_episodes"
:
10
,
"eval_datetime"
:
"2023-04-23T22:49:22.569892"
}