Quentin Gallouédec
Initial commit
67e0be8
!!python/object/apply:collections.OrderedDict
- - - gamma
- 0.9999
- - learning_starts
- 10000
- - n_timesteps
- 1000000.0
- - policy
- MlpPolicy