Quentin Gallouédec
Initial commit
d40877f
!!python/object/apply:collections.OrderedDict
- - - alive_bonus_offset
- 0
- - delta_std
- 0.03
- - learning_rate
- 0.02
- - n_delta
- 32
- - n_envs
- 1
- - n_timesteps
- 75000000.0
- - n_top
- 32
- - normalize
- dict(norm_obs=True, norm_reward=False)
- - policy
- MlpPolicy
- - policy_kwargs
- dict(net_arch=[128, 64])
- - zero_policy
- false