--- license: apache-2.0 language: - en - zh --- # Model Card for *MineDreamer* 🔥 [![arXiv](https://img.shields.io/badge/arXiv%20papr-2403.12037-b31b1b.svg)](https://arxiv.org/abs/2403.12037) [![project page](https://img.shields.io/badge/Play%20with%20MineDreamer%21-MineDreamer%20project%20page-lightblue)](https://sites.google.com/view/minedreamer/main) *MineDreamer* is an instructable embodied agent for simulated control and it is developed on top of recent advances in Multimodal Large Language Models (MLLMs) and diffusion models!

*MineDreamer* can follow instructions steadily by employing a Chain-of-Imagination (CoI) mechanism to envision the step-by-step process of executing instructions and translating imaginations into more precise visual prompts tailored to the current state; subsequently, it generates keyboard-and-mouse actions to efficiently achieve these imaginations,

**This repo is used for hosting *MineDreamer*'s Q-Former checkpoints, which are the training stage 1 checkpoints for Imaginator.** For more details or tutorials see https://github.com/Zhoues/MineDreamer.