This is just a copied checkpoint from TuneAVideo. The only difference is in the model_index.json where UNet3DCondition model is loaded from diffusers. Made this for the port I'm working on.