Diffusers
Safetensors

Giga-World-1

Giga-World-1 Main Page

Directory Structure

model/
β”œβ”€β”€ README.md
β”œβ”€β”€ assets/
β”‚   └── main_page.png
β”œβ”€β”€ before_stage1/                       # Base checkpoints before stage-1 training
β”‚   β”œβ”€β”€ Wan2p1_1p3B-FunContro-GigaRobo-alpha-diffusers/
β”‚   β”œβ”€β”€ Wan2p1_1p3B-FunControl-diffusers/
β”‚   └── Wan2p2_5B-FunControl-diffusers/
β”œβ”€β”€ stage1/                              # Stage-1 fine-tuned checkpoints
β”‚   β”œβ”€β”€ nano/                            # small (1.3B) variant
β”‚   └── pro/                             # large (5B)   variant
└── stage2_distill/                      # Stage-2 distilled checkpoints
    └── ...

Training pipeline overview

Giga-World-1 training pipeline

Model Overview

Stage Name Path Notes
Open Source WAN 2.1 1.3B FunControl Wan2.1-Fun-1.3B-Control Hugging Face Open-source model.
Open Source WAN 2.2 5B FunControl Wan2.2-Fun-5B-Control Hugging Face Open-source model.
Before Stage 1 GigaRobo Alpha Diffusers before_stage1/Wan2p1_1p3B-FunContro-GigaRobo-alpha-diffusers/ Pretrained on Giga dataset, then converted to Diffusers.
Before Stage 1 WAN 2.1 1.3B Diffusers before_stage1/Wan2p1_1p3B-FunControl-diffusers/ Vanilla Diffusers-converted checkpoint.
Before Stage 1 WAN 2.2 5B Diffusers before_stage1/Wan2p2_5B-FunControl-diffusers/ Vanilla Diffusers-converted checkpoint.
Stage 1 Nano (1.3B) stage1/nano/ Stage-1 fine-tuned from the 1.3B branches.
Stage 1 Pro (5B) stage1/pro/ Stage-1 fine-tuned from the 5B branch.
Stage 2 Nano Distill 🚧 Coming soon 🚧 Coming soon.
Stage 2 Pro Distill 🚧 Coming soon 🚧 Coming soon.

Stage-1 checkpoint structure

Each Stage-1 variant (nano / pro) contains two released artifacts: a full Diffusers-format checkpoint and a scene LoRA checkpoint.

stage1/{nano,pro}/
β”œβ”€β”€ Giga-World-1-*-stage1_final-diffusers/         # full Diffusers checkpoint
β”‚   β”œβ”€β”€ model_index.json                           # Diffusers pipeline index
β”‚   β”œβ”€β”€ transformer/                               # DiT / video transformer weights
β”‚   β”œβ”€β”€ vae/                                       # VAE weights
β”‚   β”œβ”€β”€ text_encoder/                              # text encoder weights
β”‚   β”œβ”€β”€ tokenizer/                                 # tokenizer files
β”‚   β”œβ”€β”€ scheduler/                                 # scheduler config
β”‚   β”œβ”€β”€ image_encoder/                             # image encoder weights
β”‚   └── image_processor/                           # image preprocessing config
└── Giga-World-1-*-stage1_scene_lora/              # scene LoRA checkpoint
    β”œβ”€β”€ pytorch_lora_weights.safetensors           # LoRA weights for inference
    β”œβ”€β”€ transformer_full/                          # full transformer export
    β”œβ”€β”€ transformer_partial.pth                    # partial transformer checkpoint
    β”œβ”€β”€ pytorch_model/                             # training checkpoint shards
    β”œβ”€β”€ distributed_checkpoint/                    # distributed training checkpoint
    β”œβ”€β”€ scheduler.bin                              # training scheduler state
    β”œβ”€β”€ latest                                     # latest checkpoint pointer
    β”œβ”€β”€ zero_to_fp32.py                            # ZeRO checkpoint conversion script
    └── random_states_*.pkl                        # training random states

Quick Start

Hugging Face Repository

https://huggingface.co/GigaAI-Research/Giga-World-1

SDK Download

# Install Hugging Face Hub
pip install huggingface_hub
# Download the model snapshot via Hugging Face Hub
from huggingface_hub import snapshot_download

model_dir = snapshot_download(repo_id='GigaAI-Research/Giga-World-1')

Git Download

git lfs install
git clone https://huggingface.co/GigaAI-Research/Giga-World-1

Acknowledgements

We sincerely thank the open-source community and the projects that make this work possible.

Diffusers LeRobot Hugging Face ModelScope PyTorch

Thanks also to many other open-source contributors for their tools, models, and community support.

License

This model is released under the Apache License 2.0 unless otherwise specified.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support