BAAI
/

PhyscalX commited on
Commit
1e66007
1 Parent(s): 9487909

Update model type

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -10,11 +10,11 @@ tags:
10
 
11
  ## Model Details
12
  - **Developed by:** BAAI
13
- - **Model type:** Masked Autoregressive Text-to-Video Generation Model
14
  - **Model size:** 645M
15
  - **Model precision:** torch.float16 (FP16)
16
  - **Model resolution:** 768x480
17
- - **Model Description:** This is a model that can be used to generate and modify videos based on text prompts. It is a [Masked Autoregressive (MAR)](https://arxiv.org/abs/2406.11838) diffusion model that uses a pretrained text encoder ([Phi-2](https://huggingface.co/microsoft/phi-2)) and one VAE video tokenizer ([OpenSoraPlanV1.2-VAE](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0)).
18
  - **Model License:** [Apache 2.0 License](LICENSE)
19
  - **Resources for more information:** [GitHub Repository](https://github.com/baaivision/NOVA).
20
 
 
10
 
11
  ## Model Details
12
  - **Developed by:** BAAI
13
+ - **Model type:** Non-quantized Autoregressive Text-to-Video Generation Model
14
  - **Model size:** 645M
15
  - **Model precision:** torch.float16 (FP16)
16
  - **Model resolution:** 768x480
17
+ - **Model Description:** This is a model that can be used to generate and modify videos based on text prompts. It is a [Non-quantized Video Autoregressive (NOVA)](https://arxiv.org/abs/2412.14169) diffusion model that uses a pretrained text encoder ([Phi-2](https://huggingface.co/microsoft/phi-2)) and one VAE video tokenizer ([OpenSoraPlanV1.2-VAE](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0)).
18
  - **Model License:** [Apache 2.0 License](LICENSE)
19
  - **Resources for more information:** [GitHub Repository](https://github.com/baaivision/NOVA).
20