zjowowen commited on
Commit
978c0c2
β€’
1 Parent(s): c303d78

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -31,18 +31,18 @@ If you want to contact us & join us, you can βœ‰οΈ to our team : <opendilab@p
31
  <details open>
32
  <summary>(Click to Collapse)</summary>
33
 
34
- | Algo.\Env. | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [BipedalWalker](https://di-engine-docs.readthedocs.io/en/latest/13_envs/bipedalwalker.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [SpaceInvaders](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Qbert](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Hopper](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Halfcheetah](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Walker2d](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) |
35
- | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
36
- | [PPO](https://arxiv.org/pdf/1707.06347.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-PPO) | | | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPO) | [βœ…](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-PPO) | [βœ…](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPO) | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-PPO) | | |
37
- | [PPO(Offpolicy)](https://arxiv.org/pdf/1707.06347.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-PPOOffPolicy) | | | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPOOffPolicy) | | [βœ…](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPOOffPolicy) | | | |
38
- | [PG](https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf) | ⏳ | | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-PG) | | | | ⏳ | | |
39
- | [A2C](https://arxiv.org/pdf/1602.01783.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-A2C) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-A2C) | | | | | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-A2C) | | |
40
- | [IMPALA](https://arxiv.org/pdf/1802.01561.pdf) | ⏳ | | | | | | ⏳ | | |
41
- | [DQN](https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-DQN) | πŸ”“ | πŸ”“ | ⏳ | [βœ…](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-DQN) | ⏳ | πŸ”“ | πŸ”“ | πŸ”“ |
42
- | [C51](https://arxiv.org/abs/1707.06887) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-C51) | πŸ”“ | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-C51) | [βœ…](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-C51) | [βœ…](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-C51) | πŸ”“ | πŸ”“ | πŸ”“ |
43
- | [DDPG](https://arxiv.org/pdf/1509.02971.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-DDPG) | πŸ”“ | πŸ”“ | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/HalfCheetah-v3-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/Walker2d-v3-DDPG) |
44
- | [TD3](https://arxiv.org/pdf/1802.09477.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-TD3) | πŸ”“ | πŸ”“ | πŸ”“ |[βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/HalfCheetah-v3-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/Walker2d-v3-TD3) |
45
- | [SAC](https://arxiv.org/pdf/1801.01290.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-SAC) | πŸ”“ | πŸ”“ | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/HalfCheetah-v3-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/Walker2d-v3-SAC) |
46
 
47
  </details>
48
 
 
31
  <details open>
32
  <summary>(Click to Collapse)</summary>
33
 
34
+ | Algo.\Env. | [LunarLander](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [LunarLanderContinuous](https://di-engine-docs.readthedocs.io/en/latest/13_envs/lunarlander.html) | [BipedalWalker](https://di-engine-docs.readthedocs.io/en/latest/13_envs/bipedalwalker.html) | [Pendulum](https://di-engine-docs.readthedocs.io/en/latest/13_envs/pendulum.html) | [Pong](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [SpaceInvaders](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Qbert](https://di-engine-docs.readthedocs.io/en/latest/13_envs/atari.html) | [Hopper](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Halfcheetah](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) | [Walker2d](https://di-engine-docs.readthedocs.io/en/latest/13_envs/mujoco.html) |
35
+ | :-------------: | :-------------: | :-------------: | :------------------------: | :------------: | :--------------: | :------------: | :------------------: | :---------: | :---------: | :---------: |
36
+ | [PPO](https://arxiv.org/pdf/1707.06347.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-PPO) | | | | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPO) | [βœ…](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-PPO) | [βœ…](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPO) | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-PPO) | | |
37
+ | [PPO(Offpolicy)](https://arxiv.org/pdf/1707.06347.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-PPOOffPolicy) | | | | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-PPOOffPolicy) | | [βœ…](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-PPOOffPolicy) | | | |
38
+ | [PG](https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf) | | ⏳ | | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-PG) | | | | ⏳ | | |
39
+ | [A2C](https://arxiv.org/pdf/1602.01783.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-A2C) | | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-A2C) | | | | | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-A2C) | | |
40
+ | [IMPALA](https://arxiv.org/pdf/1802.01561.pdf) | ⏳ | | | | | | | ⏳ | | |
41
+ | [DQN](https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-DQN) | πŸ”“ | πŸ”“ | πŸ”“ | ⏳ | [βœ…](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-DQN) | ⏳ | πŸ”“ | πŸ”“ | πŸ”“ |
42
+ | [C51](https://arxiv.org/abs/1707.06887) | [βœ…](https://huggingface.co/OpenDILabCommunity/Lunarlander-v2-C51) | πŸ”“ | πŸ”“ | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/PongNoFrameskip-v4-C51) | [βœ…](https://huggingface.co/OpenDILabCommunity/SpaceInvadersNoFrameskip-v4-C51) | [βœ…](https://huggingface.co/OpenDILabCommunity/QbertNoFrameskip-v4-C51) | πŸ”“ | πŸ”“ | πŸ”“ |
43
+ | [DDPG](https://arxiv.org/pdf/1509.02971.pdf) | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-DDPG) | πŸ”“ | πŸ”“ | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/HalfCheetah-v3-DDPG) | [βœ…](https://huggingface.co/OpenDILabCommunity/Walker2d-v3-DDPG) |
44
+ | [TD3](https://arxiv.org/pdf/1802.09477.pdf) | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-TD3) | πŸ”“ | πŸ”“ | πŸ”“ |[βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/HalfCheetah-v3-TD3) | [βœ…](https://huggingface.co/OpenDILabCommunity/Walker2d-v3-TD3) |
45
+ | [SAC](https://arxiv.org/pdf/1801.01290.pdf) | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/BipedalWalker-v3-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/Pendulum-v1-SAC) | πŸ”“ | πŸ”“ | πŸ”“ | [βœ…](https://huggingface.co/OpenDILabCommunity/Hopper-v3-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/HalfCheetah-v3-SAC) | [βœ…](https://huggingface.co/OpenDILabCommunity/Walker2d-v3-SAC) |
46
 
47
  </details>
48