zjowowen commited on
Commit
8bbf056
1 Parent(s): f8452cc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -43,6 +43,21 @@ If you want to contact us & join us, you can ✉️ to our team : <opendilab@p
43
 
44
  </details>
45
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
  ### Multi-Agent Reinforcement Learning
48
  <details close>
 
43
 
44
  </details>
45
 
46
+ ### Monte Carlo tree search
47
+ <details open>
48
+ <summary>(Click to Collapse)</summary>
49
+
50
+ | Algo.\Env. | [CartPole](https://di-engine-docs.readthedocs.io/en/latest/13_envs/cartpole.html) | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Breakout](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [MsPacman](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | []() | []() | []() |
51
+ | :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
52
+ | [AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
53
+ | [Sampled AlphaZero]() | | | | | | | | | | |
54
+ | [Muzero](https://arxiv.org/abs/1911.08265) | [✅](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | | | | [✅](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | | [✅](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | | | |
55
+ | [EfficientZero](https://arxiv.org/abs/2111.00210) | | | | | | | | | | |
56
+ | [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | | | | | | | | | | |
57
+ | [Sampled EfficientZero]() | | | | | | | | | | |
58
+ | [Stochastic MuZero]() | | | | | | | | | | |
59
+
60
+ </details>
61
 
62
  ### Multi-Agent Reinforcement Learning
63
  <details close>