Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
30
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
144d332
SALMONN
/
resource
/
audio_demo
/
spoken_query.wav
Changli
Upload 20 files
144d332
10 months ago
download
Copy download link
history
No virus
58.4 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.