Spaces:
Running
on
CPU Upgrade
A newer version of the Gradio SDK is available:
5.5.0
The Open RL Leaderboard is a community-driven benchmark for reinforcement learning models.
π How to have your agent evaluated?
The Open RL Leaderboard constantly scans the π€ Hub to detect new models to be evaluated. For your model to be evaluated, it must meet the following criteria.
- The model must be public on the π€ Hub
- The model must contain an
agent.pt
file. - The model must be tagged
reinforcement-learning
- The model must be tagged with the name of the environment you want to evaluate (for example
MountainCar-v0
)
Once your model meets these criteria, it will be automatically evaluated on the Open RL Leaderboard. It usually takes a few minutes for the evaluation to be completed. That's it!
π΅ How are the models evaluated?
The evaluation is done by running the agent on the environment for 50 episodes. You can get the raw evaluation scores in the Leaderboard dataset.
For further information, please refer to the Open RL Leaderboard evaluation script.
The particular case of Atari environments
Atari environments are evaluated on the NoFrameskip-v4
version of the environment. For example, to evaluate an agent on the Pong
environment, you must tag your model with PongNoFrameskip-v4
. The environment is then wrapped to match the standard Atari preprocessing pipeline.
- No-op reset with a maximum of 30 no-ops
- Max and skip with a skip of 4
- Episodic life (although the reported score is for the full episode, not the life)
- Fire reset
- Clip reward (although the reported score is not clipped)
- Resize observation to 84x84
- Grayscale observation
- Frame stack of 4
π Troubleshooting
If you encounter any issue, please open an issue on the Open RL Leaderboard repository.
π Next steps
We are working on adding more environments and metrics to the Open RL Leaderboard. If you have any suggestions, please open an discussion on the Open RL Leaderboard repository.
π Citation
@misc{open-rl-leaderboard,
author = {Quentin GallouΓ©dec and TODO},
title = {Open RL Leaderboard},
year = {2024},
publisher = {Hugging Face},
howpublished = "\url{https://huggingface.co/spaces/open-rl-leaderboard/leaderboard}",
}