benjamin-paine
/

dragnuwa-pruned-safetensors

Model card Files Files and versions Community

benjamin-paine commited on Jan 13

Commit

e47ae10

•

1 Parent(s): 3047418

Create README.md

Files changed (1) hide show

README.md +51 -0

README.md ADDED Viewed

	@@ -0,0 +1,51 @@

+---
+license: mit
+---
+# This Repository
+This repository contains pruned `.safetensors` versions of [DragNUWA](https://huggingface.co/yinsming/DragNUWA) uploaded by [yinsming](https://huggingface.co/yinsming). The following text is a copy of that repository's README at time of upload.
+# DragNUWA
+**DragNUWA** enables users to manipulate backgrounds or objects within images directly, and the model seamlessly translates these actions into **camera movements** or **object motions**, generating the corresponding video.
+See our paper:   [DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory](https://arxiv.org/abs/2308.08089)
+Github repository: [https://github.com/ProjectNUWA/DragNUWA](https://github.com/ProjectNUWA/DragNUWA)
+<a src="https://img.shields.io/badge/%F0%9F%A4%97-Open%20in%20Spaces-blue" href="https://huggingface.co/spaces/yinsming/DragNUWA">
+    <img src="https://img.shields.io/badge/%F0%9F%A4%97-Open%20in%20Spaces-blue" alt="Open in Spaces">
+</a>
+<a src="https://colab.research.google.com/assets/colab-badge.svg" href="TOBEDONE">
+    <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open in Colab">
+</a>
+### DragNUWA 1.5 (Updated on Jan 8, 2024)
+**DragNUWA 1.5** enables Stable Video Diffusion to animate an image according to specific path.
+<p align="center">
+  <img src="assets/DragNUWA1.5/Figure1.gif" width="90%">
+</p>
+<p align="center">
+  <img src="assets/DragNUWA1.5/Figure2.gif" width="90%">
+</p>
+<p align="center">
+  <img src="assets/DragNUWA1.5/Figure3.gif" width="90%">
+</p>
+<p align="center">
+  <img src="assets/DragNUWA1.5/Figure4.gif" width="90%">
+</p>
+### Citation
+```bibtex
+@article{yin2023dragnuwa,
+  title={Dragnuwa: Fine-grained control in video generation by integrating text, image, and trajectory},
+  author={Yin, Shengming and Wu, Chenfei and Liang, Jian and Shi, Jie and Li, Houqiang and Ming, Gong and Duan, Nan},
+  journal={arXiv preprint arXiv:2308.08089},
+  year={2023}
+}
+```