tada-codec / README.md
sharath25's picture
Update README.md
173c2d9 verified
metadata
license: mit
language:
  - en
tags:
  - tts
  - text-to-speech
  - speech-language-model
arxiv: 2602.23068

TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment

Paper Demo Collection PyPI Blog License

image


A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment.


Text-Acoustic Dual-Alignment Large Language Model

TADA is a unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. By leveraging a novel tokenizer and architectural design, TADA achieves high-fidelity synthesis and generation with a fraction of the computational overhead required by traditional models.

⭐️ arxiv: https://arxiv.org/abs/2602.23068
⭐️ demo: https://huggingface.co/spaces/HumeAI/tada
⭐️ github: https://github.com/HumeAI/tada
⭐️ blog post: https://www.hume.ai/blog/opensource-tada