Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN-7B
like
22
Automatic Speech Recognition
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
License:
apache-2.0
Model card
Files
Files and versions
Community
main
SALMONN-7B
/
resource
/
audio_demo
2 contributors
History:
1 commit
tangchangli
chore: init repo
7cf7820
7 months ago
duck.wav
640 kB
chore: init repo
7 months ago
excitement.wav
40.4 kB
chore: init repo
7 months ago
gunshots.wav
320 kB
chore: init repo
7 months ago
mountain.wav
79.1 kB
chore: init repo
7 months ago
music.wav
639 kB
chore: init repo
7 months ago