Edit model card

PPO Agent playing SpaceInvadersNoFrameskip-v4

This is a trained model of a PPO agent playing SpaceInvadersNoFrameskip-v4 using the stable-baselines3 library.

Evaluation Results

mean_reward=960.00 +/- 483.4252786108728

Usage (with Stable-baselines3)

TODO: Add your code

Downloads last month
0
Hosted inference API

Unable to determine this model’s pipeline type. Check the docs .