Post
2411
Doing some testing with HunyuanVideo on the Hugging Face Inference Endpoints π€
prompt: "a Shiba Inu is acting as a DJ, he wears sunglasses and is mixing and scratching with vinyl discs at a Ibiza sunny sand beach party"
1280x720, 22 steps, 121 frames
There are still some things to iron out regarding speed and memory usage, right now it takes 20min on an A100 (see attached charts)
but you can check it out here:
https://huggingface.co/jbilcke-hf/HunyuanVideo-for-InferenceEndpoints
There are various things I want to try like the 100% diffusers version and other models (LTX-Video..)
prompt: "a Shiba Inu is acting as a DJ, he wears sunglasses and is mixing and scratching with vinyl discs at a Ibiza sunny sand beach party"
1280x720, 22 steps, 121 frames
There are still some things to iron out regarding speed and memory usage, right now it takes 20min on an A100 (see attached charts)
but you can check it out here:
https://huggingface.co/jbilcke-hf/HunyuanVideo-for-InferenceEndpoints
There are various things I want to try like the 100% diffusers version and other models (LTX-Video..)