Update README.md
Browse files
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
# SALMONN: Speech Audio Language Music Open Neural Network
|
5 |
-
<div align=center><img src="
|
6 |
|
7 |
Welcome to the repo of **SALMONN**!
|
8 |
|
@@ -19,7 +19,7 @@ We will open source the code and the model checkpoint soon. Stay tuned!
|
|
19 |
|
20 |
SALMONN adopts a speech & audio encoder to encode generic audio representation, then uses an audio-text aligner to map the audio feature into textual space. Finally, the large language model answers based on the textual prompt and the auditory tokens.
|
21 |
|
22 |
-
<div align=center><img src="
|
23 |
|
24 |
## Demos
|
25 |
|
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
# SALMONN: Speech Audio Language Music Open Neural Network
|
5 |
+
<div align=center><img src="https://cdn-uploads.huggingface.co/production/uploads/63770389cdcc1bf630870758/sr9ABG_rv6P-VgesTMhOC.png" height="256px" width="256px"/></div>
|
6 |
|
7 |
Welcome to the repo of **SALMONN**!
|
8 |
|
|
|
19 |
|
20 |
SALMONN adopts a speech & audio encoder to encode generic audio representation, then uses an audio-text aligner to map the audio feature into textual space. Finally, the large language model answers based on the textual prompt and the auditory tokens.
|
21 |
|
22 |
+
<div align=center><img src="https://cdn-uploads.huggingface.co/production/uploads/63770389cdcc1bf630870758/TEZzr54VZ5yc34LeixFbi.png" height="75%" width="75%"/></div>
|
23 |
|
24 |
## Demos
|
25 |
|