Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
38
Follow
Electronic Engineering @Tsinghua University
8
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
96f3cb5
SALMONN
/
resource
/
response_demo
5 contributors
History:
1 commit
Changli
Upload 20 files
144d332
about 1 year ago
asr.png
Safe
45.4 kB
Upload 20 files
about 1 year ago
asr_en2de.png
Safe
42 kB
Upload 20 files
about 1 year ago
audio_story_telling.png
Safe
280 kB
Upload 20 files
about 1 year ago
audiocaption.png
Safe
135 kB
Upload 20 files
about 1 year ago
emotion.png
Safe
33 kB
Upload 20 files
about 1 year ago
keywords.png
Safe
32.1 kB
Upload 20 files
about 1 year ago
music.png
Safe
45.3 kB
Upload 20 files
about 1 year ago
spoken_audio_query.png
Safe
234 kB
Upload 20 files
about 1 year ago
spoken_query.png
Safe
336 kB
Upload 20 files
about 1 year ago