Spaces:
Runtime error
Runtime error
# ComfyUI CogVideoX-Fun | |
Easily use CogVideoX-Fun inside ComfyUI! | |
- [Installation](#1-installation) | |
- [Node types](#node-types) | |
- [Example workflows](#example-workflows) | |
## 1. Installation | |
### Option 1: Install via ComfyUI Manager | |
TBD | |
### Option 2: Install manually | |
The CogVideoX-Fun repository needs to be placed at `ComfyUI/custom_nodes/CogVideoX-Fun/`. | |
``` | |
cd ComfyUI/custom_nodes/ | |
# Git clone the cogvideox_fun itself | |
git clone https://github.com/aigc-apps/CogVideoX-Fun.git | |
# Git clone the video outout node | |
git clone https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite.git | |
cd CogVideoX-Fun/ | |
python install.py | |
``` | |
### 2. Download models into `ComfyUI/models/CogVideoX_Fun/` | |
V1.1: | |
| 名称 | 存储空间 | Hugging Face | Model Scope | 描述 | | |
|--|--|--|--|--| | |
| CogVideoX-Fun-V1.1-2b-InP.tar.gz | Before extraction:9.7 GB \/ After extraction: 13.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/CogVideoX-Fun-V1.1-2b-InP) | [😄Link](https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-2b-InP) | Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second. Noise has been added to the reference image, and the amplitude of motion is greater compared to V1.0. | | |
| CogVideoX-Fun-V1.1-5b-InP.tar.gz | Before extraction:16.0 GB \/ After extraction: 20.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/CogVideoX-Fun-V1.1-5b-InP) | [😄Link](https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-InP) | Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second. Noise has been added to the reference image, and the amplitude of motion is greater compared to V1.0. | | |
| CogVideoX-Fun-V1.1-2b-Pose.tar.gz | Before extraction:9.7 GB \/ After extraction: 13.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/CogVideoX-Fun-V1.1-2b-Pose) | [😄Link](https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-2b-Pose) | Our official pose-control video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second.| | |
| CogVideoX-Fun-V1.1-5b-Pose.tar.gz | Before extraction:16.0 GB \/ After extraction: 20.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/CogVideoX-Fun-V1.1-5b-Pose) | [😄Link](https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-Pose) | Our official pose-control video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second.| | |
V1.0: | |
| Name | Storage Space | Hugging Face | Model Scope | Description | | |
|--|--|--|--|--| | |
| CogVideoX-Fun-2b-InP.tar.gz | Before extraction:9.7 GB \/ After extraction: 13.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/CogVideoX-Fun-2b-InP) | [😄Link](https://modelscope.cn/models/PAI/CogVideoX-Fun-2b-InP) | Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second. | | |
| CogVideoX-Fun-5b-InP.tar.gz | Before extraction:16.0 GB \/ After extraction: 20.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/CogVideoX-Fun-5b-InP)| [😄Link](https://modelscope.cn/models/PAI/CogVideoX-Fun-5b-InP)| Our official graph-generated video model is capable of predicting videos at multiple resolutions (512, 768, 1024, 1280) and has been trained on 49 frames at a rate of 8 frames per second. | | |
## Node types | |
- **LoadCogVideoX_Fun_Model** | |
- Loads the CogVideoX-Fun model | |
- **CogVideoX_FUN_TextBox** | |
- Write the prompt for CogVideoX-Fun model | |
- **CogVideoX_Fun_I2VSampler** | |
- CogVideoX-Fun Sampler for Image to Video | |
- **CogVideoX_Fun_T2VSampler** | |
- CogVideoX-Fun Sampler for Text to Video | |
- **CogVideoX_Fun_V2VSampler** | |
- CogVideoX-Fun Sampler for Video to Video | |
## Example workflows | |
### Video to video generation | |
Our ui is shown as follow, this is the [download link](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1.1/cogvideoxfunv1.1_workflow_v2v.json) of the json: | |
data:image/s3,"s3://crabby-images/26d29/26d295d6d8b7f65ddba40a212dc99a716300aa10" alt="workflow graph" | |
You can run the demo using following video: | |
[demo video](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1/play_guitar.mp4) | |
### Control video generation | |
Our ui is shown as follow, this is the [download link](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1.1/cogvideoxfunv1.1_workflow_v2v_control.json) of the json: | |
data:image/s3,"s3://crabby-images/f8bf7/f8bf769698f0dbe37cc7d478b477f06920ad9b03" alt="workflow graph" | |
You can run the demo using following video: | |
[demo video](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1.1/pose.mp4) | |
### Image to video generation | |
Our ui is shown as follow, this is the [download link](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1.1/cogvideoxfunv1.1_workflow_i2v.json) of the json: | |
data:image/s3,"s3://crabby-images/4bd01/4bd0161f38d6a9d6b4fbd4f717e0f6f4dcc6a714" alt="workflow graph" | |
You can run the demo using following photo: | |
data:image/s3,"s3://crabby-images/263b8/263b8c8050611bffbd0ceaf51777397528196166" alt="demo image" | |
### Text to video generation | |
Our ui is shown as follow, this is the [download link](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1.1/cogvideoxfunv1.1_workflow_t2v.json) of the json: | |
data:image/s3,"s3://crabby-images/c2d31/c2d317aa346064c3e6517311095889ede3aab4e8" alt="workflow graph" |