Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
38
Follow
Electronic Engineering @Tsinghua University
8
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
96bc929
SALMONN
/
requirements.txt
Changli
Create requirements.txt
aa63220
about 1 year ago
raw
Copy download link
history
blame
Safe
160 Bytes
torch==2.0.1
torchaudio==2.0.2
peft==0.3.0
soundfile
librosa
transformers==4.28.0
sentencepiece==0.1.97
accelerate==0.20.3
bitsandbytes==0.35.0
gradio==3.23.0