lunar-lander / mlp-1000000 /policy.optimizer.pth

Commit History

first trained model from deep rl course unit 1
e1f5387

jackson-lucas commited on