Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
38
Follow
Electronic Engineering @Tsinghua University
8
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
59f41c2
SALMONN
5 contributors
History:
19 commits
Changli
Update README.md
59f41c2
about 1 year ago
beats
chore: release v1
about 1 year ago
other_third-party_licenses
chore: release v1
about 1 year ago
qformer
chore: release v1
about 1 year ago
resource
chore: release v1
about 1 year ago
.gitattributes
Safe
56 Bytes
chore: release v1
about 1 year ago
.gitignore
Safe
3.1 kB
chore: release v1
about 1 year ago
LICENSE
Safe
11.3 kB
chore: release v1
about 1 year ago
README.md
Safe
5.08 kB
Update README.md
about 1 year ago
cli_inference.py
Safe
1.88 kB
chore: release v1
about 1 year ago
index.html
Safe
540 Bytes
chore: release v1
about 1 year ago
model.py
Safe
9.79 kB
chore: release v1
about 1 year ago
salmonn_v1.pth
Safe
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.LongStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
400 MB
LFS
Upload salmonn_v1.pth
about 1 year ago
web_demo.py
Safe
7.2 kB
chore: release v1
about 1 year ago