Upload 11 files

Browse files

Files changed (12) hide show

.gitattributes +7 -0
README.md +51 -3
ani.ckpt +3 -0
config.json +5 -0
examples/1.gif +3 -0
examples/2.gif +3 -0
examples/3.gif +3 -0
examples/4.gif +3 -0
examples/5.gif +3 -0
examples/6.gif +3 -0
figs/pipeline_figure.pdf +3 -0
real.ckpt +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+examples/1.gif filter=lfs diff=lfs merge=lfs -text
+examples/2.gif filter=lfs diff=lfs merge=lfs -text
+examples/3.gif filter=lfs diff=lfs merge=lfs -text
+examples/4.gif filter=lfs diff=lfs merge=lfs -text
+examples/5.gif filter=lfs diff=lfs merge=lfs -text
+examples/6.gif filter=lfs diff=lfs merge=lfs -text
+figs/pipeline_figure.pdf filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,51 @@
----
-license: apache-2.0
----

+# MoG: Motion-Aware Generative Frame Interpolation
+<p style="display: flex; flex-direction: column; justify-content: center; align-items: center;">
+  <div style="width: 100%; text-align: center; margin-bottom: 4px;">
+    <img src="examples/1.gif" style="zoom:32%;">
+    <img src="examples/2.gif" style="zoom:32%;">
+    <img src="examples/3.gif" style="zoom:32%;">
+  </div>
+  <div style="width: 100%; text-align: center;">
+    <img src="examples/4.gif" style="zoom:32%;">
+    <img src="examples/5.gif" style="zoom:32%;">
+    <img src="examples/6.gif" style="zoom:32%;">
+  </div>
+</p>
+MoG is a generative video frame interpolation (VFI) model, designed to synthesize intermediate frames between two input frames.
+MoG marks the first explicit incorporation of motion guidance between input frames to enhance the motion awareness of generative models. We demonstrate that the intermediate flow derived from flow-based VFI methods can effectively serve as motion guidance, and we propose a simple yet efficient approach to integrate this prior into the network. As a result, MoG achieves significant improvements over existing open-source generative VFI methods, excelling in both real-world and animated scenarios.
+Source code is available at [https://github.com/MCG-NJU/MoG-VFI](https://github.com/MCG-NJU/MoG-VFI).
+## Network Arichitecture
+![pipeline_figure](figs/pipeline_figure.pdf)
+## Model Description
+- **Developed by:** Nanjing University, Tencent PCG
+- **Model type:** Generative video frame interploation model, takes two still video frames as input.
+- **Arxiv paper**: [https://arxiv.org/pdf/2501.03699](https://arxiv.org/pdf/2501.03699)
+- **Project page:** [https://mcg-nju.github.io/MoG_Web/](https://mcg-nju.github.io/MoG_Web/)
+- **Repository**: [https://github.com/MCG-NJU/MoG-VFI](https://github.com/MCG-NJU/MoG-VFI)
+- **License:** Apache 2.0 license.
+# Usage
+We develop MoG based on [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter) for real-world scenes and [ToonCrafter](https://github.com/Doubiiu/ToonCrafter) for animation scenes. Both checkpoints are available, and you can select the desired option by specifying the model parameter. Feel free to use it under the Apache 2.0 license.
+## Citation
+If you find our code useful or our work relevant, please consider citing:
+> @misc{zhang2024vfimambavideoframeinterpolation,
+>       title={VFIMamba: Video Frame Interpolation with State Space Models},
+>       author={Guozhen Zhang and Chunxu Liu and Yutao Cui and Xiaotong Zhao and Kai Ma and Limin Wang},
+>       year={2024},
+>       eprint={2407.02315},
+>       archivePrefix={arXiv},
+>       primaryClass={cs.CV},
+>       url={https://arxiv.org/abs/2407.02315},
+> }