Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
30
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
eaf3298
SALMONN
/
resource
/
response_demo
5 contributors
History:
1 commit
Changli
Upload 20 files
144d332
10 months ago
asr.png
45.4 kB
Upload 20 files
10 months ago
asr_en2de.png
42 kB
Upload 20 files
10 months ago
audio_story_telling.png
280 kB
Upload 20 files
10 months ago
audiocaption.png
135 kB
Upload 20 files
10 months ago
emotion.png
33 kB
Upload 20 files
10 months ago
keywords.png
32.1 kB
Upload 20 files
10 months ago
music.png
45.3 kB
Upload 20 files
10 months ago
spoken_audio_query.png
234 kB
Upload 20 files
10 months ago
spoken_query.png
336 kB
Upload 20 files
10 months ago