English
Chinese
Zhoues commited on
Commit
edab443
1 Parent(s): 8755437

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -5
README.md CHANGED
@@ -6,16 +6,15 @@ language:
6
  - en
7
  - zh
8
  ---
 
9
 
10
- # Model Card for VAR (Visual AutoRegressive) Transformers 🔥
11
-
12
- <!-- Provide a quick summary of what the model is/does. -->
13
 
14
  [![arXiv](https://img.shields.io/badge/arXiv%20papr-2403.12037-b31b1b.svg)](https://arxiv.org/abs/2403.12037)
15
 
16
  [![project page](https://img.shields.io/badge/Play%20with%20MineDreamer%21-MineDreamer%20project%20page-lightblue)](https://sites.google.com/view/minedreamer/main)
17
 
18
- MineDreamer is an instructable embodied agent for simulated control and it is developed on top of recent advances in Multimodal Large Language Models (MLLMs) and diffusion models!
19
 
20
 
21
 
@@ -23,7 +22,7 @@ MineDreamer is an instructable embodied agent for simulated control and it is de
23
  <img src="https://cdn-uploads.huggingface.co/production/uploads/63f08dc79cf89c9ed1bb89cd/S62I1Tn5qz5qJ3IkgMHH8.png" width=93%>
24
  <p>
25
 
26
- MineDreamer can follow instructions steadily by employing a Chain-of-Imagination (CoI) mechanism to envision the step-by-step process of executing instructions and translating imaginations into more precise visual prompts tailored to the current state; subsequently, it generates keyboard-and-mouse actions to efficiently achieve these imaginations,
27
 
28
 
29
  <p align="center">
 
6
  - en
7
  - zh
8
  ---
9
+ # Model Card for *MineDreamer* 🔥
10
 
11
+ <!-- Briefly summarize what the model is/does. -->
 
 
12
 
13
  [![arXiv](https://img.shields.io/badge/arXiv%20papr-2403.12037-b31b1b.svg)](https://arxiv.org/abs/2403.12037)
14
 
15
  [![project page](https://img.shields.io/badge/Play%20with%20MineDreamer%21-MineDreamer%20project%20page-lightblue)](https://sites.google.com/view/minedreamer/main)
16
 
17
+ *MineDreamer* is an instructable embodied agent for simulated control and it is developed on top of recent advances in Multimodal Large Language Models (MLLMs) and diffusion models!
18
 
19
 
20
 
 
22
  <img src="https://cdn-uploads.huggingface.co/production/uploads/63f08dc79cf89c9ed1bb89cd/S62I1Tn5qz5qJ3IkgMHH8.png" width=93%>
23
  <p>
24
 
25
+ *MineDreamer* can follow instructions steadily by employing a Chain-of-Imagination (CoI) mechanism to envision the step-by-step process of executing instructions and translating imaginations into more precise visual prompts tailored to the current state; subsequently, it generates keyboard-and-mouse actions to efficiently achieve these imaginations,
26
 
27
 
28
  <p align="center">