MoG
ZhuYuhan commited on
Commit
cd2d420
·
verified ·
1 Parent(s): 31a990e

Upload 11 files

Browse files
.gitattributes CHANGED
@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ examples/1.gif filter=lfs diff=lfs merge=lfs -text
37
+ examples/2.gif filter=lfs diff=lfs merge=lfs -text
38
+ examples/3.gif filter=lfs diff=lfs merge=lfs -text
39
+ examples/4.gif filter=lfs diff=lfs merge=lfs -text
40
+ examples/5.gif filter=lfs diff=lfs merge=lfs -text
41
+ examples/6.gif filter=lfs diff=lfs merge=lfs -text
42
+ figs/pipeline_figure.pdf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,51 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # MoG: Motion-Aware Generative Frame Interpolation
2
+
3
+ <p style="display: flex; flex-direction: column; justify-content: center; align-items: center;">
4
+ <div style="width: 100%; text-align: center; margin-bottom: 4px;">
5
+ <img src="examples/1.gif" style="zoom:32%;">
6
+ <img src="examples/2.gif" style="zoom:32%;">
7
+ <img src="examples/3.gif" style="zoom:32%;">
8
+ </div>
9
+ <div style="width: 100%; text-align: center;">
10
+ <img src="examples/4.gif" style="zoom:32%;">
11
+ <img src="examples/5.gif" style="zoom:32%;">
12
+ <img src="examples/6.gif" style="zoom:32%;">
13
+ </div>
14
+ </p>
15
+
16
+ MoG is a generative video frame interpolation (VFI) model, designed to synthesize intermediate frames between two input frames.
17
+
18
+ MoG marks the first explicit incorporation of motion guidance between input frames to enhance the motion awareness of generative models. We demonstrate that the intermediate flow derived from flow-based VFI methods can effectively serve as motion guidance, and we propose a simple yet efficient approach to integrate this prior into the network. As a result, MoG achieves significant improvements over existing open-source generative VFI methods, excelling in both real-world and animated scenarios.
19
+
20
+ Source code is available at [https://github.com/MCG-NJU/MoG-VFI](https://github.com/MCG-NJU/MoG-VFI).
21
+
22
+ ## Network Arichitecture
23
+
24
+ ![pipeline_figure](figs/pipeline_figure.pdf)
25
+
26
+ ## Model Description
27
+
28
+ - **Developed by:** Nanjing University, Tencent PCG
29
+ - **Model type:** Generative video frame interploation model, takes two still video frames as input.
30
+ - **Arxiv paper**: [https://arxiv.org/pdf/2501.03699](https://arxiv.org/pdf/2501.03699)
31
+ - **Project page:** [https://mcg-nju.github.io/MoG_Web/](https://mcg-nju.github.io/MoG_Web/)
32
+ - **Repository**: [https://github.com/MCG-NJU/MoG-VFI](https://github.com/MCG-NJU/MoG-VFI)
33
+ - **License:** Apache 2.0 license.
34
+
35
+ # Usage
36
+
37
+ We develop MoG based on [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter) for real-world scenes and [ToonCrafter](https://github.com/Doubiiu/ToonCrafter) for animation scenes. Both checkpoints are available, and you can select the desired option by specifying the model parameter. Feel free to use it under the Apache 2.0 license.
38
+
39
+ ## Citation
40
+
41
+ If you find our code useful or our work relevant, please consider citing:
42
+
43
+ > @misc{zhang2024vfimambavideoframeinterpolation,
44
+ > title={VFIMamba: Video Frame Interpolation with State Space Models},
45
+ > author={Guozhen Zhang and Chunxu Liu and Yutao Cui and Xiaotong Zhao and Kai Ma and Limin Wang},
46
+ > year={2024},
47
+ > eprint={2407.02315},
48
+ > archivePrefix={arXiv},
49
+ > primaryClass={cs.CV},
50
+ > url={https://arxiv.org/abs/2407.02315},
51
+ > }
ani.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b29b622fc18557b383a414f90acebae7da02a0abcaa077333540c952daae8d55
3
+ size 10502364004
config.json ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {
2
+ "LOG_NAME": "MoG",
3
+ "model_type": "MoG",
4
+ "transformers_version": "4.25.1"
5
+ }
examples/1.gif ADDED

Git LFS Details

  • SHA256: 56a5f816d48d227f9b8ee0d86ae2622aa77975f3c677a24ea28f334f0f02200e
  • Pointer size: 132 Bytes
  • Size of remote file: 3.84 MB
examples/2.gif ADDED

Git LFS Details

  • SHA256: 7df75706ff3ab230147f97952ebe64e85c969bfd2f46e912cb8d6ca5a55ada32
  • Pointer size: 132 Bytes
  • Size of remote file: 3.67 MB
examples/3.gif ADDED

Git LFS Details

  • SHA256: 625893e676d118ff4705f1e0037650559bec71de72d69267a65eec45f4e856b7
  • Pointer size: 132 Bytes
  • Size of remote file: 3.67 MB
examples/4.gif ADDED

Git LFS Details

  • SHA256: dc54fa9b73c46e11d5fd61c098456bfae3ace32fe93fce8e9e59fbd81ec26623
  • Pointer size: 132 Bytes
  • Size of remote file: 4.46 MB
examples/5.gif ADDED

Git LFS Details

  • SHA256: b67e5fbcc589b99ab1ef924a06037a6160d7ef48f05228bb7a2f7fef747c8b1c
  • Pointer size: 132 Bytes
  • Size of remote file: 2.99 MB
examples/6.gif ADDED

Git LFS Details

  • SHA256: d095ad0ea54a22f734259b2d93af8ff5ea61b7ffc161f0b63299b6a089708963
  • Pointer size: 132 Bytes
  • Size of remote file: 2.78 MB
figs/pipeline_figure.pdf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f4c7f30fa3cf90e28ba42fe8a50edf5d3f418e8b1137ec183785169633f2ba3
3
+ size 448248
real.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0763cfaa2fc808574a7f9a9a4a4688377a013ae6af35a0b494733ee4ccdd055b
3
+ size 10437459961