senstella
/

csm-expressiva-1b

senstella commited on Apr 11

Commit

28e29f6

verified ·

1 Parent(s): 5c9d887

added note about intention behind the model

Files changed (1) hide show

README.md CHANGED Viewed

@@ -6,6 +6,7 @@ language:
 - en
 base_model:
 - sesame/csm-1b
 ---
 ## csm-experssiva
@@ -69,4 +70,12 @@ audiofile.write("./audio.wav", np.asarray(audio), 24000)
 The future plan is to implement KTO on `csm-mlx` and further mitigate model failure cases using that approach.
-Licence follows Expresso dataset's `cc-by-nc-4.0`!

 - en
 base_model:
 - sesame/csm-1b
+pipeline_tag: text-to-audio
 ---
 ## csm-experssiva
 The future plan is to implement KTO on `csm-mlx` and further mitigate model failure cases using that approach.
+**Note**
+This model was fine-tuned to investigate whether the CSM-1b model exhibits emergent capacity to effectively compress and reconstruct whisper-style vocal features - something that traditional TTS models do not usually demonstrate.
+It also serves as a preliminary verification of the csm-mlx training setup and the correctness of its loss function.
+I want to make it clear that I do **not endorse or encourage** any inappropriate use of this model. Any unintended associations or interpretations do not reflect the intent behind this model.
+**License**
+Licence follows Expresso dataset's `cc-by-nc-4.0`, since it's trained from it!