cerspense
/

zeroscope_v2_30x448x256

TextToVideoSDPipeline

Model card Files Files and versions Community

cerspense commited on Jun 15, 2023

Commit

493a3d9

•

1 Parent(s): 97cb3c0

Update README.md

Files changed (1) hide show

README.md +14 -16

README.md CHANGED Viewed

@@ -7,33 +7,31 @@ tags:
 # zeroscope_v2 30x448x256
-Modelscope without the watermark, optimized for high quality 16:9 compositions and a smooth output.<br />
-Trained at 30 frames, 448x256 resolution using 9923 clips and 29,769 tagged frames<br />
-This low-res modelscope model is intended to be upscaled with [potat1](https://huggingface.co/camenduru/potat1) using vid2vid in the 1111 text2video extension by [kabachuha](https://github.com/kabachuha) <br />
-[example output](https://i.imgur.com/lj90FYP.mp4) upscaled to 1152 x 640 with potat1<br />
-### 1111 text2video extension usage
-1. Rename zeroscope_v2_30x448x256.pth to text2video_pytorch_model.pth<br />
-2. Rename zeroscope_v2_30x448x256_text.bin to open_clip_pytorch_model.bin<br />
-3. Replace files in stable-diffusion-webui\models\ModelScope\t2v<br />
-### Upscaling
-I recommend upscaling this using vid2vid in the 1111 extension to 1152x640 with a denoise strength between 0.66 and 0.85. Use the same prompt and settings used to create the original clip. <br />
 ### Known issues
-Using a lower resolution or fewer frames will result in a worse output <br />
-Many clips come out with cuts. This will be fixed soon with 2.1 with a much cleaner dataset <br />
-Some clips come out too slow, and might need prompt engineering to be faster in pace <br />

 # zeroscope_v2 30x448x256
+a watermark-free Modelscope-based video model optimized for producing high-quality 16:9 compositions and a smooth video output.<br />
+This model was trained using 9,923 clips and 29,769 tagged frames at 30 frames, 448x256 resolution.<br />
+zeroscope_v2 30x448x256 is specifically designed for upscaling with [Potat1](https://huggingface.co/camenduru/potat1) using vid2vid in the 1111 Text2Video extension by [kabachuha](https://github.com/kabachuha). <br />
+Leveraging this model as a preliminary step allows for superior overall compositions at higher resolutions in Potat1, permitting faster exploration in 448x256 before transitioning to a high-resolution render. <br />
+See an [example output](https://i.imgur.com/lj90FYP.mp4) that has been upscaled to 1152 x 640 using Potat1.<br />
+### Using it with the 1111 text2Video extension
+1. Rename the file 'zeroscope_v2_30x448x256.pth' to 'text2video_pytorch_model.pth'.
+2. Rename the file 'zeroscope_v2_30x448x256_text.bin' to 'open_clip_pytorch_model.bin'.
+3. Replace the respective files in the 'stable-diffusion-webui\models\ModelScope\t2v' directory.
+### Upscaling recommendations
+For upscaling, it's recommended to use Potat1 via vid2vid in the 1111 extension. Aim for a resolution of 1152x640 and a denoise strength between 0.66 and 0.85. Remember to use the same prompt and settings that were used to generate the original clip.
 ### Known issues
+Lower resolutions or fewer frames could lead to suboptimal output. <br />
+Certain clips might appear with cuts. This ill be fixed in the upcoming 2.1 version, which will incorporate a cleaner dataset.
+Some clips may playback too slowly, requiring prompt engineering for an increased pace.