happpylittlecat
commited on
Commit
•
a75ebf2
1
Parent(s):
b3aa8aa
first commit
Browse files
README.md
CHANGED
@@ -90,4 +90,17 @@ IPython.display.Audio(data=audio, rate=16000)
|
|
90 |
|
91 |
The auffusion model will be automatically downloaded from huggingface and saved in cache. Subsequent runs will load the model directly from cache.
|
92 |
|
93 |
-
Other audio manipulation examples can be seen in [https://github.com/happylittlecat2333/Auffusion/notebooks](https://github.com/happylittlecat2333/Auffusion/notebooks). We only show the default text-to-audio example here.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
90 |
|
91 |
The auffusion model will be automatically downloaded from huggingface and saved in cache. Subsequent runs will load the model directly from cache.
|
92 |
|
93 |
+
Other audio manipulation examples can be seen in [https://github.com/happylittlecat2333/Auffusion/notebooks](https://github.com/happylittlecat2333/Auffusion/notebooks). We only show the default text-to-audio example here.
|
94 |
+
|
95 |
+
## Citation
|
96 |
+
|
97 |
+
Please consider citing the following article if you found our work useful:
|
98 |
+
|
99 |
+
```bibtex
|
100 |
+
@article{xue2024auffusion,
|
101 |
+
title={Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation},
|
102 |
+
author={Jinlong Xue and Yayue Deng and Yingming Gao and Ya Li},
|
103 |
+
journal={arXiv preprint arXiv:2401.01044},
|
104 |
+
year={2024}
|
105 |
+
}
|
106 |
+
```
|