Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
38
Follow
Electronic Engineering @Tsinghua University
8
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
main
SALMONN
/
resource
/
response_demo
5 contributors
History:
3 commits
Changli
chore: release v1
0bf5005
about 1 year ago
aac.png
Safe
13 kB
chore: release v1
about 1 year ago
aed.png
Safe
18.6 kB
chore: release v1
about 1 year ago
asr.png
Safe
13.8 kB
chore: release v1
about 1 year ago
emo.png
Safe
11.4 kB
chore: release v1
about 1 year ago
jsac.png
Safe
21 kB
chore: release v1
about 1 year ago
lyrics.png
Safe
40.7 kB
chore: release v1
about 1 year ago
mc.png
Safe
28.8 kB
chore: release v1
about 1 year ago
memo.png
Safe
32.3 kB
chore: release v1
about 1 year ago
pr.png
Safe
14.8 kB
chore: release v1
about 1 year ago
sac.png
Safe
29.1 kB
chore: release v1
about 1 year ago
sq.png
Safe
22.5 kB
chore: release v1
about 1 year ago
sr.png
Safe
15.9 kB
chore: release v1
about 1 year ago
story.png
Safe
71.1 kB
chore: release v1
about 1 year ago
title.png
Safe
27.3 kB
chore: release v1
about 1 year ago