Bai-YT
/

ConsistencyTTA

Model card Files Files and versions Community

Bai-YT commited on Feb 8

Commit

6ed754f

•

1 Parent(s): 5a0dcf3

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -34,11 +34,15 @@ allows for end-to-end fine-tuning with novel loss functions such as the CLAP sco
 ## Model Details
 We share three model checkpoints:
-- ConsistencyTTA directly distilled from a diffusion model;
-- The above ConsistencyTTA model fine-tuned by optimizing the CLAP score;
-- The diffusion teacher model from which ConsistencyTTA is distilled.
-These model checkpoints can be found on our [Huggingface page](https://huggingface.co/Bai-YT/ConsistencyTTA).
 After downloading and unzipping the files, place them in the `saved` directory.
 Please refer to our [GitHub page](https://github.com/Bai-YT/ConsistencyTTA) for usage details.

 ## Model Details
 We share three model checkpoints:
+- [ConsistencyTTA directly distilled from a diffusion model](
+  https://huggingface.co/Bai-YT/ConsistencyTTA/blob/main/ConsistencyTTA.zip);
+- [ConsistencyTTA fine-tuned by optimizing the CLAP score](
+  https://huggingface.co/Bai-YT/ConsistencyTTA/blob/main/ConsistencyTTA_CLAPFT.zip);
+- [The diffusion teacher model from which ConsistencyTTA is distilled](
+  https://huggingface.co/Bai-YT/ConsistencyTTA/blob/main/LightweightLDM.zip).
+The first two models are capable of high-quality single-step text-to-audio generation. Generations are 10 seconds long.
 After downloading and unzipping the files, place them in the `saved` directory.
 Please refer to our [GitHub page](https://github.com/Bai-YT/ConsistencyTTA) for usage details.