Binarybardakshat
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,39 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: text-to-video
|
6 |
+
tags:
|
7 |
+
- art
|
8 |
+
- code
|
9 |
+
---
|
10 |
+
# RCNA MINI
|
11 |
+
|
12 |
+
**RCNA MINI** is a compact **LoRA** (Low-Rank Adaptation) model designed for generating high-quality, 4-step text-to-video outputs. It can create video clips ranging from 4 to 16 seconds long, making it ideal for generating short animations with rich details and smooth transitions.
|
13 |
+
|
14 |
+
## Key Features:
|
15 |
+
- **4-step Text-to-Video**: Generates videos from a text prompt in just 4 steps.
|
16 |
+
- **Video Length**: Can generate videos from 4 seconds to 16 seconds long.
|
17 |
+
- **High Quality**: Supports high-resolution and detailed outputs (up to 8K).
|
18 |
+
- **Fast Sampling**: Leveraging decoupled consistency learning, the model is optimized for speed while maintaining quality.
|
19 |
+
|
20 |
+
## Example Outputs:
|
21 |
+
|
22 |
+
- **Prompt**: "Astronaut in a jungle, cold color palette, muted colors, detailed, 8K"
|
23 |
+
- Generates a high-quality video with rich details and smooth motion.
|
24 |
+
|
25 |
+
## How it Works:
|
26 |
+
RCNA MINI is based on the LoRA architecture, which fine-tunes diffusion models using low-rank adaptations. This results in faster generation and less computational overhead compared to full model retraining.
|
27 |
+
|
28 |
+
## Applications:
|
29 |
+
- Short-form animations for social media content
|
30 |
+
- Video generation for creative projects
|
31 |
+
- Artistic video generation based on textual descriptions
|
32 |
+
|
33 |
+
## Model Details:
|
34 |
+
- **Architecture**: LoRA applied to diffusion models
|
35 |
+
- **Inference Steps**: 4-step generation
|
36 |
+
- **Output Length**: 4 to 16 seconds
|
37 |
+
|
38 |
+
## License:
|
39 |
+
This model is licensed under the [MIT License](LICENSE).
|