Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
30
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
eaf3298
SALMONN
/
resource
/
audio_demo
5 contributors
History:
1 commit
Changli
Upload 20 files
144d332
10 months ago
asr.wav
176 kB
Upload 20 files
10 months ago
asr_en2de.wav
176 kB
Upload 20 files
10 months ago
audio_story_telling.wav
640 kB
Upload 20 files
10 months ago
audiocaption.wav
640 kB
Upload 20 files
10 months ago
emotion.wav
106 kB
Upload 20 files
10 months ago
keywords.flac
256 kB
Upload 20 files
10 months ago
music.wav
960 kB
Upload 20 files
10 months ago
spoken_audio_query.wav
320 kB
Upload 20 files
10 months ago
spoken_query.wav
58.4 kB
Upload 20 files
10 months ago