Video-Foley
Official model checkpoint of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound".
(refer to the paper for further details)
Video2RMS
- files: checkpoint_000500_Video2RMS.pt (weight), opts.yml (config)
- training data: Greatest Hits (audio-visual, ~6hr)
RMS2Sound (including RMS-ControlNet)
- file: ControlNetstep300000.pth (weight)
- training data: FreeSound (audio only, ~6khr, in Wavcaps)
Citation
@article{video-foley,
title={Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound},
author={Lee, Junwon and Im, Jaekwon and Kim, Dabin and Nam, Juhan},
journal={arXiv preprint arXiv:2408.11915},
year={2024}
}
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.