Darío Muñoz Prudant's picture

Darío Muñoz Prudant PRO

prudant

AI & ML interests

Tech enthusiast, avid AI learner, and perpetual seeker of new knowledge.

Recent Activity

Articles

Organizations

AIModels.org's profile picture Comunidad Latinoamericana - Entendimiento (y procesamiento) del lenguaje natural's profile picture Dolfs AI's profile picture

prudant's activity

reacted to reach-vb's post with 👀 29 days ago
view post
Post
1587
Smol TTS models are here! OuteTTS-0.1-350M - Zero shot voice cloning, built on LLaMa architecture, CC-BY license! 🔥

> Pure language modeling approach to TTS
> Zero-shot voice cloning
> LLaMa architecture w/ Audio tokens (WavTokenizer)
> BONUS: Works on-device w/ llama.cpp ⚡

Three-step approach to TTS:

> Audio tokenization using WavTokenizer (75 tok per second)
> CTC forced alignment for word-to-audio token mapping
> Structured prompt creation w/ transcription, duration, audio tokens

The model is extremely impressive for 350M parameters! Kudos to the
OuteAI team on such a brilliant feat - I'd love to see this be applied on larger data and smarter backbones like SmolLM 🤗

Check out the models here: OuteAI/outetts-6728aa71a53a076e4ba4817c
New activity in OpenGVLab/InternVL2-8B-MPO about 1 month ago

awq quant

#1 opened about 1 month ago by
prudant
New activity in meta-llama/Llama-3.2-11B-Vision about 2 months ago
New activity in InfiniFlow/deepdoc 2 months ago

Documentation?

1
#1 opened 9 months ago by
ThewindMom
upvoted an article 2 months ago
view article
Article

¡Lanzamiento de la Comunidad Latinoamericana de NLP en Hugging Face! 🌟

By prudant
7
published an article 2 months ago
view article
Article

¡Lanzamiento de la Comunidad Latinoamericana de NLP en Hugging Face! 🌟

By prudant
7
updated a Space 3 months ago
New activity in yujunhuinlp/LayoutReader-only-layout-large 3 months ago

where is the model?

3
#1 opened 5 months ago by
moyanxinxu
New activity in MaziyarPanahi/calme-2.4-rys-78b 3 months ago

Collaboration?

10
#10 opened 3 months ago by
dnhkng
New activity in fishaudio/fish-speech-1.4 3 months ago

License

3
#4 opened 3 months ago by
jameshuntercarter
New activity in stepfun-ai/GOT-OCR2_0 3 months ago

License?

4
#1 opened 3 months ago by
nbroad
New activity in coqui/XTTS-v2 5 months ago
New activity in super-cinnamon/fewshot-followup-multi-e5 5 months ago

dataset

8
#1 opened 6 months ago by
prudant
New activity in Qwen/Qwen2-7B 5 months ago
New activity in Qwen/Qwen2-72B-Instruct-AWQ 6 months ago

Error AutoAWQ tensor 4 vllm

4
#3 opened 6 months ago by
fersebas
New activity in Alibaba-NLP/gte-Qwen2-1.5B-instruct 6 months ago

sequence classification

1
#3 opened 6 months ago by
prudant
New activity in Alibaba-NLP/gte-Qwen2-7B-instruct 6 months ago

question about quants

3
#12 opened 6 months ago by
prudant