watermark-free Modelscope-based video generation
Generate detailed prompts for Stable Diffusion
Transform video frames using text instructions