Amphion

community

https://openhlt.github.io/amphion/

AI & ML interests

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Recent Activity

jiaqili3 submitted a paper 12 days ago

FlexiSLM: A Dynamic and Controllable Frame Rate Spoken Language Model

jiaqili3 new activity 28 days ago

amphion/dualcodec:Source and computation of w2vbert2_mean_var_stats_emilia.pt

jiaqili3 updated a model 29 days ago

amphion/dualcodec

View all activity

Papers

FlexiSLM: A Dynamic and Controllable Frame Rate Spoken Language Model

View all Papers

Organization Card

Community About org cards

Amphion is An Open-Source Audio, Music, and Speech Generation Toolkit developed by a team led by Prof Zhizheng Wu from the Chinese University of Hong Kong, Shenzhen. The toolkit is developed in collaboration with OpenMMLab.

The North-Star objective of Amphion is to offer a platform for studying the conversion of any inputs into audio. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. Amphion offers a unique feature: visualizations of classic models or architectures. We believe that these visualizations are beneficial for junior researchers and engineers who wish to gain a better understanding of the model.

Technical Report: https://huggingface.co/papers/2312.09911

Discord: https://discord.com/invite/ZxxREr3Y