Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LinasKo
/
RL-lander

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card Files Files and versions
xet
Community
RL-lander / a2c-1e6
  • 1 contributor
History: 1 commit
LinasKo's picture
LinasKo
A2C trained on LunarLander-v2 for 1e6 timesteps
9958b53 almost 3 years ago
  • _stable_baselines3_version
    5 Bytes
    A2C trained on LunarLander-v2 for 1e6 timesteps almost 3 years ago
  • data
    15.1 kB
    A2C trained on LunarLander-v2 for 1e6 timesteps almost 3 years ago
  • policy.optimizer.pth
    42.6 kB
    xet
    A2C trained on LunarLander-v2 for 1e6 timesteps almost 3 years ago
  • policy.pth
    43.2 kB
    xet
    A2C trained on LunarLander-v2 for 1e6 timesteps almost 3 years ago
  • pytorch_variables.pth

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    431 Bytes
    xet
    A2C trained on LunarLander-v2 for 1e6 timesteps almost 3 years ago
  • system_info.txt
    201 Bytes
    A2C trained on LunarLander-v2 for 1e6 timesteps almost 3 years ago