Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
38
Follow
Electronic Engineering @Tsinghua University
8
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
96f3cb5
SALMONN
/
resource
/
audio_demo
5 contributors
History:
1 commit
Changli
Upload 20 files
144d332
about 1 year ago
asr.wav
Safe
176 kB
Upload 20 files
about 1 year ago
asr_en2de.wav
Safe
176 kB
Upload 20 files
about 1 year ago
audio_story_telling.wav
Safe
640 kB
Upload 20 files
about 1 year ago
audiocaption.wav
Safe
640 kB
Upload 20 files
about 1 year ago
emotion.wav
Safe
106 kB
Upload 20 files
about 1 year ago
keywords.flac
Safe
256 kB
Upload 20 files
about 1 year ago
music.wav
Safe
960 kB
Upload 20 files
about 1 year ago
spoken_audio_query.wav
Safe
320 kB
Upload 20 files
about 1 year ago
spoken_query.wav
Safe
58.4 kB
Upload 20 files
about 1 year ago