Update README.md
Browse files
README.md
CHANGED
@@ -34,11 +34,15 @@ allows for end-to-end fine-tuning with novel loss functions such as the CLAP sco
|
|
34 |
## Model Details
|
35 |
|
36 |
We share three model checkpoints:
|
37 |
-
- ConsistencyTTA directly distilled from a diffusion model
|
38 |
-
-
|
39 |
-
-
|
|
|
|
|
|
|
|
|
|
|
40 |
|
41 |
-
These model checkpoints can be found on our [Huggingface page](https://huggingface.co/Bai-YT/ConsistencyTTA).
|
42 |
After downloading and unzipping the files, place them in the `saved` directory.
|
43 |
|
44 |
Please refer to our [GitHub page](https://github.com/Bai-YT/ConsistencyTTA) for usage details.
|
|
|
34 |
## Model Details
|
35 |
|
36 |
We share three model checkpoints:
|
37 |
+
- [ConsistencyTTA directly distilled from a diffusion model](
|
38 |
+
https://huggingface.co/Bai-YT/ConsistencyTTA/blob/main/ConsistencyTTA.zip);
|
39 |
+
- [ConsistencyTTA fine-tuned by optimizing the CLAP score](
|
40 |
+
https://huggingface.co/Bai-YT/ConsistencyTTA/blob/main/ConsistencyTTA_CLAPFT.zip);
|
41 |
+
- [The diffusion teacher model from which ConsistencyTTA is distilled](
|
42 |
+
https://huggingface.co/Bai-YT/ConsistencyTTA/blob/main/LightweightLDM.zip).
|
43 |
+
|
44 |
+
The first two models are capable of high-quality single-step text-to-audio generation. Generations are 10 seconds long.
|
45 |
|
|
|
46 |
After downloading and unzipping the files, place them in the `saved` directory.
|
47 |
|
48 |
Please refer to our [GitHub page](https://github.com/Bai-YT/ConsistencyTTA) for usage details.
|