Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
34
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
d0647f1
SALMONN
/
resource
/
audio_demo
/
keywords.flac
Changli
Upload 20 files
144d332
11 months ago
download
Copy download link
history
No virus
256 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.