Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
26
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
SALMONN
/
resource
/
response_demo
3 contributors
History:
3 commits
Changli
chore: release v1
0bf5005
9 months ago
aac.png
13 kB
chore: release v1
9 months ago
aed.png
18.6 kB
chore: release v1
9 months ago
asr.png
13.8 kB
chore: release v1
9 months ago
emo.png
11.4 kB
chore: release v1
9 months ago
jsac.png
21 kB
chore: release v1
9 months ago
lyrics.png
40.7 kB
chore: release v1
9 months ago
mc.png
28.8 kB
chore: release v1
9 months ago
memo.png
32.3 kB
chore: release v1
9 months ago
pr.png
14.8 kB
chore: release v1
9 months ago
sac.png
29.1 kB
chore: release v1
9 months ago
sq.png
22.5 kB
chore: release v1
9 months ago
sr.png
15.9 kB
chore: release v1
9 months ago
story.png
71.1 kB
chore: release v1
9 months ago
title.png
27.3 kB
chore: release v1
9 months ago