Spaces:

OpenDILabCommunity
/

README

Running

App Files Files Community

zjowowen commited on Dec 18, 2023

Commit

8bbf056

1 Parent(s): f8452cc

Update README.md

Browse files

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -43,6 +43,21 @@ If you want to contact us & join us, you can  ✉️  to our team : <opendilab@p
 </details>
 ### Multi-Agent Reinforcement Learning
 <details close>

 </details>
+### Monte Carlo tree search
+<details open>
+<summary>(Click to Collapse)</summary>
+| Algo.\Env.   | [CartPole](https://di-engine-docs.readthedocs.io/en/latest/13_envs/cartpole.html) | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Breakout](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [MsPacman](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | []() | []() | []() |
+| :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
+| [AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) |  |  |  |  |  |  |  |  |  |  |
+| [Sampled AlphaZero]() |  |  |  |  |  |  |  |  |  |  |
+| [Muzero](https://arxiv.org/abs/1911.08265) | [✅](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) |  |  |  | [✅](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) |  | [✅](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) |  |  |  |
+| [EfficientZero](https://arxiv.org/abs/2111.00210) |  |  |  |  |  |  |  |  |  |  |
+| [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) |  |  |  |  |  |  |  |  |  |  |
+| [Sampled EfficientZero]() |  |  |  |  |  |  |  |  |  |  |
+| [Stochastic MuZero]() |  |  |  |  |  |  |  |  |  |  |
+</details>
 ### Multi-Agent Reinforcement Learning
 <details close>