zjowowen commited on
Commit
09d194f
β€’
1 Parent(s): 0886197

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -47,11 +47,11 @@ If you want to contact us & join us, you can βœ‰οΈ to our team : <opendilab@p
47
  <details open>
48
  <summary>(Click to Collapse)</summary>
49
 
50
- | Algo.\Env. | [CartPole](https://di-engine-docs.readthedocs.io/en/latest/13_envs/cartpole.html) | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Breakout](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [MsPacman](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | []() | []() | []() |
51
  | :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
52
- | [AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
53
  | [Sampled AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
54
- | [Muzero](https://arxiv.org/abs/1911.08265) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-MuZero) | | | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | | [βœ…](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | | | |
55
  | [EfficientZero](https://arxiv.org/abs/2111.00210) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-EfficientZero) | | | | | | | | | |
56
  | [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-GumbelMuZero) | | | | | | | | | |
57
  | [Sampled EfficientZero](https://arxiv.org/abs/2104.06303) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-SampledEfficientZero) | | | | | | | | | |
 
47
  <details open>
48
  <summary>(Click to Collapse)</summary>
49
 
50
+ | Algo.\Env. | [CartPole](https://di-engine-docs.readthedocs.io/en/latest/13_envs/cartpole.html) | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Breakout](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [MsPacman](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [TicTacToe]() | []() | []() |
51
  | :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
52
+ | [AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | [βœ…](https://huggingface.co/OpenDILabCommunity/TicTacToe-play-with-bot-AlphaZero) | | |
53
  | [Sampled AlphaZero](https://www.science.org/doi/10.1126/science.aar6404) | | | | | | | | | | |
54
+ | [Muzero](https://arxiv.org/abs/1911.08265) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-MuZero) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-MuZero) | | | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-MuZero) | | [βœ…](https://huggingface.co/OpenDILabCommunity/MsPacmanNoFrameskip-v4-MuZero) | [βœ…](https://huggingface.co/OpenDILabCommunity/TicTacToe-play-with-bot-MuZero) | | |
55
  | [EfficientZero](https://arxiv.org/abs/2111.00210) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-EfficientZero) | | | | | | | | | |
56
  | [Gumbel MuZero](https://openreview.net/pdf?id=bERaNdoegnO&) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-GumbelMuZero) | | | | | | | | | |
57
  | [Sampled EfficientZero](https://arxiv.org/abs/2104.06303) | [βœ…](https://huggingface.co/OpenDILabCommunity/CartPole-v0-SampledEfficientZero) | | | | | | | | | |