Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
38
Follow
Electronic Engineering @Tsinghua University
8
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
aa4521e
SALMONN
/
resource
/
audio_demo
/
duck.wav
Changli
chore: release v1
0bf5005
about 1 year ago
download
Copy download link
history
640 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.