Instructions to use YuanzeLin/Illumicraft-checkpoints with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use YuanzeLin/Illumicraft-checkpoints with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("YuanzeLin/Illumicraft-checkpoints", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
This repository contains the pretrained checkpoints for:
IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation (NeurIPS 2025)
Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai, Ronald Clark, Ming-Hsuan Yang
π Links
- π Paper: https://arxiv.org/abs/2506.03150
- π Project Page: https://yuanze-lin.me/IllumiCraft_page/
- π» GitHub: https://github.com/yuanze-lin/IllumiCraft
- π₯ YouTube: https://youtu.be/qAV58sADEzo
- π€ Dataset: https://huggingface.co/datasets/YuanzeLin/IllumiCraft
β¨ Overview
IllumiCraft is a controllable video generation framework that jointly models scene geometry and illumination dynamics. By leveraging foreground appearance, HDR lighting maps, and 3D point tracks, IllumiCraft enables temporally consistent video generation with controllable motion and relighting effects.
π₯ Download Checkpoints
Download the pretrained checkpoints:
python download_weights.py
The downloaded checkpoints will be organized as:
checkpoints/
βββ Wan2.1-Fun-1.3B-Control/
βββ illumicraft_pretrained_weights/
π Usage
Please refer to the official GitHub repository for installation, dataset preparation, training, and inference instructions:
https://github.com/yuanze-lin/IllumiCraft
π Citation
If you find IllumiCraft useful for your research, please consider citing:
@article{lin2026illumicraft,
title={IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation},
author={Lin, Yuanze and Chen, Yi-Wen and Tsai, Yi-Hsuan and Clark, Ronald and Yang, Ming-Hsuan},
journal={Advances in Neural Information Processing Systems},
volume={38},
pages={27798--27829},
year={2026}
}
- Downloads last month
- -