Clarify AS-20M standalone audio-only card
Browse files
README.md
CHANGED
|
@@ -78,15 +78,17 @@ triembed/checkpoints/mn20_native_lora_aistmix_audioheavy100k175k175k_continue_fr
|
|
| 78 |
|
| 79 |
## Local Gate Metrics
|
| 80 |
|
| 81 |
-
The checkpoint-local heldout gate reported:
|
| 82 |
|
| 83 |
| Metric | Score |
|
| 84 |
|---|---:|
|
| 85 |
| audio cosine | 0.8108 |
|
| 86 |
| embedding Pearson | 0.7953 |
|
| 87 |
| similarity Pearson | 0.8853 |
|
| 88 |
-
|
| 89 |
-
|
|
|
|
|
|
|
| 90 |
|
| 91 |
## MAEB Audio-Only Comparison
|
| 92 |
|
|
@@ -139,7 +141,7 @@ Artifacts:
|
|
| 139 |
|
| 140 |
## Limitations
|
| 141 |
|
| 142 |
-
`AS-20M` is an embedding model only. It does not transcribe speech,
|
| 143 |
-
classify audio events directly, or
|
| 144 |
-
|
| 145 |
-
|
|
|
|
| 78 |
|
| 79 |
## Local Gate Metrics
|
| 80 |
|
| 81 |
+
The checkpoint-local heldout gate reported audio-side consistency metrics:
|
| 82 |
|
| 83 |
| Metric | Score |
|
| 84 |
|---|---:|
|
| 85 |
| audio cosine | 0.8108 |
|
| 86 |
| embedding Pearson | 0.7953 |
|
| 87 |
| similarity Pearson | 0.8853 |
|
| 88 |
+
|
| 89 |
+
Internal training runs also tracked text-audio retrieval against a companion
|
| 90 |
+
text embedding space. Those numbers are not reported here as standalone model
|
| 91 |
+
capabilities because this release artifact does not include a text encoder.
|
| 92 |
|
| 93 |
## MAEB Audio-Only Comparison
|
| 94 |
|
|
|
|
| 141 |
|
| 142 |
## Limitations
|
| 143 |
|
| 144 |
+
`AS-20M` is an audio embedding model only. It does not transcribe speech,
|
| 145 |
+
classify audio events directly, or embed text. Text-audio retrieval requires
|
| 146 |
+
a separate compatible text encoder/head that is not included in this release
|
| 147 |
+
artifact.
|