arxiv:2407.14329
Xuenan Xu
wsntxxn
AI & ML interests
Text to Speech Synthesis
Text to Music Synthesis
Singing Voice Synthesis
Organizations
None yet
Papers
10
models
7
wsntxxn/cnn14rnn-tempgru-audiocaps-captioning
Feature Extraction
•
Updated
•
1
•
1
wsntxxn/effb2-trm-audiocaps-captioning
Feature Extraction
•
Updated
•
20
•
1
wsntxxn/effb2-trm-clotho-captioning
Feature Extraction
•
Updated
•
21
•
1
wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding
Audio Classification
•
Updated
•
72
•
2
wsntxxn/cnn8rnn-audioset-sed
Audio Classification
•
Updated
•
454
•
1
wsntxxn/audiocaps-simple-tokenizer
Updated
wsntxxn/clotho-simple-tokenizer
Updated
datasets
None public yet