📖 Introduction

EchoStyle is a high-fidelity video stylization framework built on the Wan2.2 video generation model. It fine-tunes the Wan2.2 14B DiT model using LoRA adapters trained with a novel reverse data synthesis pipeline, enabling faithful style transfer across diverse artistic styles — Ghibli, anime, ink wash, oil painting, ukiyo-e, Disney 3D, and more.

Key Features

🎨 Multi-style video transfer — supports 6+ artistic styles with a single framework
🔄 Dual-LoRA architecture — separate LoRA models for high-noise (t=900-1000) and low-noise (t=0-900) timestep ranges, yielding better detail preservation
⚡ Lightning inference — compatible with LightX2V distilled checkpoints for 10-step generation
🖥️ Scalable training — auto-adapts to multi-node multi-GPU clusters via FSDP + Ulysses sequence parallelism

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support