gcoderw commited on
Commit
72ddab5
·
verified ·
1 Parent(s): 7a716db

Clarify AS-20M standalone audio-only card

Browse files
Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -78,15 +78,17 @@ triembed/checkpoints/mn20_native_lora_aistmix_audioheavy100k175k175k_continue_fr
78
 
79
  ## Local Gate Metrics
80
 
81
- The checkpoint-local heldout gate reported:
82
 
83
  | Metric | Score |
84
  |---|---:|
85
  | audio cosine | 0.8108 |
86
  | embedding Pearson | 0.7953 |
87
  | similarity Pearson | 0.8853 |
88
- | audio-to-text R@1, 1280d | 0.3216 |
89
- | text-to-audio R@1, 1280d | 0.3028 |
 
 
90
 
91
  ## MAEB Audio-Only Comparison
92
 
@@ -139,7 +141,7 @@ Artifacts:
139
 
140
  ## Limitations
141
 
142
- `AS-20M` is an embedding model only. It does not transcribe speech,
143
- classify audio events directly, or include a text encoder in this standalone
144
- release artifact. Text-audio retrieval evaluations use a separate compatible
145
- text encoder/head to score cross-modal alignment.
 
78
 
79
  ## Local Gate Metrics
80
 
81
+ The checkpoint-local heldout gate reported audio-side consistency metrics:
82
 
83
  | Metric | Score |
84
  |---|---:|
85
  | audio cosine | 0.8108 |
86
  | embedding Pearson | 0.7953 |
87
  | similarity Pearson | 0.8853 |
88
+
89
+ Internal training runs also tracked text-audio retrieval against a companion
90
+ text embedding space. Those numbers are not reported here as standalone model
91
+ capabilities because this release artifact does not include a text encoder.
92
 
93
  ## MAEB Audio-Only Comparison
94
 
 
141
 
142
  ## Limitations
143
 
144
+ `AS-20M` is an audio embedding model only. It does not transcribe speech,
145
+ classify audio events directly, or embed text. Text-audio retrieval requires
146
+ a separate compatible text encoder/head that is not included in this release
147
+ artifact.