Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -54,7 +54,7 @@ If you want to contact us & join us, you can βοΈ to our team : <opendilab@p
|
|
54 |
| [MuZero](https://arxiv.org/abs/1911.08265) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-MuZero) | π | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/BreakoutNoFrameskip-v4-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/TicTacToe-play-with-bot-MuZero) | | |
|
55 |
| [EfficientZero](https://arxiv.org/abs/2111.00210) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-EfficientZero) | [β
](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-EfficientZero) | π | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-EfficientZero) | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-EfficientZero) | | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-EfficientZero) | π | | |
|
56 |
| [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-GumbelMuZero) | | π | | | π | π | | | |
|
57 |
-
| [Sampled EfficientZero](https://arxiv.org/abs/2104.06303) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-SampledEfficientZero) | | | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-SampledEfficientZero) |
|
58 |
| [Stochastic MuZero](https://openreview.net/pdf?id=X6D9bAHhBQ1) | π | π | π | π | π | π | π | π | | |
|
59 |
|
60 |
</details>
|
|
|
54 |
| [MuZero](https://arxiv.org/abs/1911.08265) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-MuZero) | π | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/BreakoutNoFrameskip-v4-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | [β
](https://huggingface.co/OpenDILabCommunity/TicTacToe-play-with-bot-MuZero) | | |
|
55 |
| [EfficientZero](https://arxiv.org/abs/2111.00210) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-EfficientZero) | [β
](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-EfficientZero) | π | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-EfficientZero) | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-EfficientZero) | | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-EfficientZero) | π | | |
|
56 |
| [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-GumbelMuZero) | | π | | | π | π | | | |
|
57 |
+
| [Sampled EfficientZero](https://arxiv.org/abs/2104.06303) | [β
](https://huggingface.co/OpenDILabCommunity/CartPole-v0-SampledEfficientZero) | | | [β
](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-SampledEfficientZero) | [β
](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-SampledEfficientZero) | | [β
](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-SampledEfficientZero) | π | | |
|
58 |
| [Stochastic MuZero](https://openreview.net/pdf?id=X6D9bAHhBQ1) | π | π | π | π | π | π | π | π | | |
|
59 |
|
60 |
</details>
|