πŸ“– Introduction

EchoStyle is a high-fidelity video stylization framework built on the Wan2.2 video generation model. It fine-tunes the Wan2.2 14B DiT model using LoRA adapters trained with a novel reverse data synthesis pipeline, enabling faithful style transfer across diverse artistic styles β€” Ghibli, anime, ink wash, oil painting, ukiyo-e, Disney 3D, and more.

Key Features

  • 🎨 Multi-style video transfer β€” supports 6+ artistic styles with a single framework
  • πŸ”„ Dual-LoRA architecture β€” separate LoRA models for high-noise (t=900-1000) and low-noise (t=0-900) timestep ranges, yielding better detail preservation
  • ⚑ Lightning inference β€” compatible with LightX2V distilled checkpoints for 10-step generation
  • πŸ–₯️ Scalable training β€” auto-adapts to multi-node multi-GPU clusters via FSDP + Ulysses sequence parallelism
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support