File size: 771 Bytes
3157f12
 
93e00a1
 
 
 
 
 
 
 
3157f12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
license: mit
tags:
- world-model
- robotic-manipulation
- video-generation
- video-prediction
- gpt
base_model:
- thuml/ivideogpt-oxe-64-act-free
---

# iVideoGPT (Fine-tuned to BAIR Robot Pushing, 64x64 resolution, action-free)

Fine-tuned model introduced in the paper [iVideoGPT: Interactive VideoGPTs are Scalable World Models](https://arxiv.org/abs/2405.15223).

See https://github.com/thuml/iVideoGPT for examples for using this model.

## Citation

```
@inproceedings{wu2024ivideogpt,
    title={iVideoGPT: Interactive VideoGPTs are Scalable World Models}, 
    author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long},
    booktitle={Advances in Neural Information Processing Systems},
    year={2024}
}
```