Spaces:
Running
Running
sugarfreez
commited on
Commit
·
4d5391a
1
Parent(s):
8f8a2a0
style(nyz): add naive model zoo table
Browse files
README.md
CHANGED
@@ -20,3 +20,30 @@ As an important part of OpenXLab from Shanghai AI Laboratory, OpenDILab features
|
|
20 |
OpenDILab contributes to the integration of the latest and most comprehensive achievements in academia as well as the standardization of complex problems in the industry. Our future vision is to promote the development of AI **from perceptual intelligence to decision intelligence,** taking AI technology to a higher level of the general intelligence era.
|
21 |
|
22 |
If you want to contact us & join us, you can ✉️ to our team : <opendilab@pjlab.org.cn>.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
OpenDILab contributes to the integration of the latest and most comprehensive achievements in academia as well as the standardization of complex problems in the industry. Our future vision is to promote the development of AI **from perceptual intelligence to decision intelligence,** taking AI technology to a higher level of the general intelligence era.
|
21 |
|
22 |
If you want to contact us & join us, you can ✉️ to our team : <opendilab@pjlab.org.cn>.
|
23 |
+
|
24 |
+
|
25 |
+
# Overview of Model Zoo
|
26 |
+
|
27 |
+
## Deep Reinforcement Learning
|
28 |
+
|
29 |
+
| Algo.\Env. | LunarLander | BipedalWalker | Pendulum | Atari (Pong) | Atari (SpaceInvaders) | Atari (Qbert) | MuJoCo (Hopper) | MuJoCo (Halfcheetah) | MuJoCo (Walker2d) |
|
30 |
+
| ------------- | ------------- | ------------------------ | ------------ | -------------- | ------------ | ------------------ | --------- | --------- | --------- |
|
31 |
+
| [PPO](https://arxiv.org/abs/1707.06347) | [Model](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-ppo) | | | | | | | | |
|
32 |
+
|
33 |
+
## Multi-Agent Reinforcement Learning
|
34 |
+
<details close>
|
35 |
+
<summary>(Click for Details)</summary>
|
36 |
+
TBD
|
37 |
+
</details>
|
38 |
+
|
39 |
+
## Offline Reinforcement Learning
|
40 |
+
<details close>
|
41 |
+
<summary>(Click for Details)</summary>
|
42 |
+
TBD
|
43 |
+
</details>
|
44 |
+
|
45 |
+
## Model-Based Reinforcement Learning
|
46 |
+
<details close>
|
47 |
+
<summary>(Click for Details)</summary>
|
48 |
+
TBD
|
49 |
+
</details>
|