Delete README.md
Browse files
README.md
DELETED
@@ -1,54 +0,0 @@
|
|
1 |
-
---
|
2 |
-
thumbnail: https://user-images.githubusercontent.com/54370274/243292723-fa703668-a931-41e1-8bcf-19c72203980b.png
|
3 |
-
tags:
|
4 |
-
- TextTovideo
|
5 |
-
- Text2Video
|
6 |
-
- text-to-video
|
7 |
-
---
|
8 |
-
|
9 |
-
🐣 Please follow me for new updates https://twitter.com/camenduru <br />
|
10 |
-
🔥 Please join our discord server https://discord.gg/k5BwmmvJJU
|
11 |
-
|
12 |
-
![00041-3056174990](https://github.com/camenduru/Text-To-Video-Finetuning-colab/assets/54370274/fa703668-a931-41e1-8bcf-19c72203980b)
|
13 |
-
|
14 |
-
# Potat 1️⃣
|
15 |
-
First Open-Source 1024x576 Text To Video Model 🥳
|
16 |
-
|
17 |
-
https://huggingface.co/vdo/potat1-5000/tree/main <br />
|
18 |
-
https://huggingface.co/vdo/potat1-10000/tree/main = https://huggingface.co/camenduru/potat1 (you are here) <br />
|
19 |
-
https://huggingface.co/vdo/potat1-15000/tree/main <br />
|
20 |
-
https://huggingface.co/vdo/potat1-20000/tree/main <br />
|
21 |
-
https://huggingface.co/vdo/potat1-25000/tree/main <br />
|
22 |
-
https://huggingface.co/vdo/potat1-30000/tree/main <br />
|
23 |
-
https://huggingface.co/vdo/potat1-35000/tree/main <br />
|
24 |
-
https://huggingface.co/vdo/potat1-40000/tree/main <br />
|
25 |
-
https://huggingface.co/vdo/potat1-45000/tree/main <br />
|
26 |
-
https://huggingface.co/vdo/potat1-50000/tree/main <br />
|
27 |
-
|
28 |
-
### Info
|
29 |
-
Prototype Model <br />
|
30 |
-
Trained with https://lambdalabs.com ❤ 1xA100 (40GB) <br />
|
31 |
-
2197 clips, 68388 tagged frames ( [salesforce/blip2-opt-6.7b-coco](https://huggingface.co/Salesforce/blip2-opt-6.7b-coco) ) <br />
|
32 |
-
train_steps: 10000 <br />
|
33 |
-
|
34 |
-
### Dataset & Config
|
35 |
-
https://huggingface.co/camenduru/potat1_dataset/tree/main
|
36 |
-
|
37 |
-
### Finetuning
|
38 |
-
https://github.com/Breakthrough/PySceneDetect <br />
|
39 |
-
https://github.com/ExponentialML/Video-BLIP2-Preprocessor <br />
|
40 |
-
https://github.com/ExponentialML/Text-To-Video-Finetuning <br />
|
41 |
-
https://github.com/camenduru/Text-To-Video-Finetuning-colab <br />
|
42 |
-
|
43 |
-
### Base Model
|
44 |
-
https://huggingface.co/damo-vilab/modelscope-damo-text-to-video-synthesis <br />
|
45 |
-
https://www.modelscope.cn/models/damo/text-to-video-synthesis <br />
|
46 |
-
|
47 |
-
Thanks to [damo-vilab](https://damo.alibaba.com/) ❤ [ExponentialML](https://github.com/ExponentialML) ❤ [kabachuha](https://github.com/kabachuha) ❤ [@DiffusersLib](https://twitter.com/DiffusersLib) ❤ [@LambdaAPI](https://twitter.com/LambdaAPI) ❤ [@cerspense](https://twitter.com/cerspense) ❤ [@CiaraRowles1](https://twitter.com/CiaraRowles1) ❤ [@p1atdev_art](https://twitter.com/p1atdev_art) ❤ <br />
|
48 |
-
|
49 |
-
Please try it 🐣 <br />
|
50 |
-
https://github.com/camenduru/text-to-video-synthesis-colab <br />
|
51 |
-
|
52 |
-
<video src="https://user-images.githubusercontent.com/54370274/243275155-97282de4-e1df-49a0-851e-cb8b4b040441.mp4" data-canonical-src="https://user-images.githubusercontent.com/54370274/243275155-97282de4-e1df-49a0-851e-cb8b4b040441.mp4" controls="controls" muted="muted" class="d-block rounded-bottom-2 border-top width-fit" style="max-height:640px; min-height: 200px"></video>
|
53 |
-
|
54 |
-
Potat 2️⃣ is in the oven ♨ <br />
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|