π Introduction
EchoStyle is a high-fidelity video stylization framework built on the Wan2.2 video generation model. It fine-tunes the Wan2.2 14B DiT model using LoRA adapters trained with a novel reverse data synthesis pipeline, enabling faithful style transfer across diverse artistic styles β Ghibli, anime, ink wash, oil painting, ukiyo-e, Disney 3D, and more.
Key Features
- π¨ Multi-style video transfer β supports 6+ artistic styles with a single framework
- π Dual-LoRA architecture β separate LoRA models for high-noise (t=900-1000) and low-noise (t=0-900) timestep ranges, yielding better detail preservation
- β‘ Lightning inference β compatible with LightX2V distilled checkpoints for 10-step generation
- π₯οΈ Scalable training β auto-adapts to multi-node multi-GPU clusters via FSDP + Ulysses sequence parallelism
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support