Giga-World-1

Giga-World-1 Main Page

Directory Structure

model/
├── README.md
├── assets/
│   └── main_page.png
├── before_stage1/                       # Base checkpoints before stage-1 training
│   ├── Wan2p1_1p3B-FunContro-GigaRobo-alpha-diffusers/
│   ├── Wan2p1_1p3B-FunControl-diffusers/
│   └── Wan2p2_5B-FunControl-diffusers/
├── stage1/                              # Stage-1 fine-tuned checkpoints
│   ├── nano/                            # small (1.3B) variant
│   └── pro/                             # large (5B)   variant
└── stage2_distill/                      # Stage-2 distilled checkpoints
    └── ...

Training pipeline overview

Giga-World-1 training pipeline

Model Overview

Stage	Name	Path	Notes
Open Source	WAN 2.1 1.3B FunControl	Wan2.1-Fun-1.3B-Control	Open-source model.
Open Source	WAN 2.2 5B FunControl	Wan2.2-Fun-5B-Control	Open-source model.
Before Stage 1	GigaRobo Alpha Diffusers	`before_stage1/Wan2p1_1p3B-FunContro-GigaRobo-alpha-diffusers/`	Pretrained on Giga dataset, then converted to Diffusers.
Before Stage 1	WAN 2.1 1.3B Diffusers	`before_stage1/Wan2p1_1p3B-FunControl-diffusers/`	Vanilla Diffusers-converted checkpoint.
Before Stage 1	WAN 2.2 5B Diffusers	`before_stage1/Wan2p2_5B-FunControl-diffusers/`	Vanilla Diffusers-converted checkpoint.
Stage 1	Nano (1.3B)	`stage1/nano/`	Stage-1 fine-tuned from the 1.3B branches.
Stage 1	Pro (5B)	`stage1/pro/`	Stage-1 fine-tuned from the 5B branch.
Stage 2	Nano Distill	🚧 Coming soon	🚧 Coming soon.
Stage 2	Pro Distill	🚧 Coming soon	🚧 Coming soon.

Stage-1 checkpoint structure

Each Stage-1 variant (nano / pro) contains two released artifacts: a full Diffusers-format checkpoint and a scene LoRA checkpoint.

stage1/{nano,pro}/
├── Giga-World-1-*-stage1_final-diffusers/         # full Diffusers checkpoint
│   ├── model_index.json                           # Diffusers pipeline index
│   ├── transformer/                               # DiT / video transformer weights
│   ├── vae/                                       # VAE weights
│   ├── text_encoder/                              # text encoder weights
│   ├── tokenizer/                                 # tokenizer files
│   ├── scheduler/                                 # scheduler config
│   ├── image_encoder/                             # image encoder weights
│   └── image_processor/                           # image preprocessing config
└── Giga-World-1-*-stage1_scene_lora/              # scene LoRA checkpoint
    ├── pytorch_lora_weights.safetensors           # LoRA weights for inference
    ├── transformer_full/                          # full transformer export
    ├── transformer_partial.pth                    # partial transformer checkpoint
    ├── pytorch_model/                             # training checkpoint shards
    ├── distributed_checkpoint/                    # distributed training checkpoint
    ├── scheduler.bin                              # training scheduler state
    ├── latest                                     # latest checkpoint pointer
    ├── zero_to_fp32.py                            # ZeRO checkpoint conversion script
    └── random_states_*.pkl                        # training random states

Quick Start

Hugging Face Repository

https://huggingface.co/GigaAI-Research/Giga-World-1

SDK Download

# Install Hugging Face Hub
pip install huggingface_hub

# Download the model snapshot via Hugging Face Hub
from huggingface_hub import snapshot_download

model_dir = snapshot_download(repo_id='GigaAI-Research/Giga-World-1')

Git Download

git lfs install
git clone https://huggingface.co/GigaAI-Research/Giga-World-1

Acknowledgements

We sincerely thank the open-source community and the projects that make this work possible.

Thanks also to many other open-source contributors for their tools, models, and community support.

License

This model is released under the Apache License 2.0 unless otherwise specified.

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support