File size: 771 Bytes
3157f12 93e00a1 3157f12 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 |
---
license: mit
tags:
- world-model
- robotic-manipulation
- video-generation
- video-prediction
- gpt
base_model:
- thuml/ivideogpt-oxe-64-act-free
---
# iVideoGPT (Fine-tuned to BAIR Robot Pushing, 64x64 resolution, action-free)
Fine-tuned model introduced in the paper [iVideoGPT: Interactive VideoGPTs are Scalable World Models](https://arxiv.org/abs/2405.15223).
See https://github.com/thuml/iVideoGPT for examples for using this model.
## Citation
```
@inproceedings{wu2024ivideogpt,
title={iVideoGPT: Interactive VideoGPTs are Scalable World Models},
author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long},
booktitle={Advances in Neural Information Processing Systems},
year={2024}
}
``` |