shangeth
/

SpeechLLM

Feature Extraction

Model card Files Files and versions Community

shangeth commited on May 28, 2024

Commit

68d52d8

·

verified ·

1 Parent(s): ee1b905

Update README.md

Files changed (1) hide show

README.md +30 -4

README.md CHANGED Viewed

@@ -22,6 +22,10 @@ tags:
 ## Model Details
 ### Model Description
@@ -34,9 +38,9 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]
@@ -84,7 +88,29 @@ Users (both direct and downstream) should be made aware of the risks, biases and
 Use the code below to get started with the model.
-[More Information Needed]
 ## Training Details

 ## Model Details
 ### Model Description
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
+- **Language(s) (NLP):** English
+- **License:** Apache 2.0
+- **Finetuned from model [optional]:** HubertX and TinyLlama
 ### Model Sources [optional]
 Use the code below to get started with the model.
+```python
+# Load model directly from huggingface
+from transformers import AutoModel
+model = AutoModel.from_pretrained("shangeth/SpeechLLM", trust_remote_code=True)
+model.generate_meta(
+	audio_path="path-to-audio.wav",
+	instruction="Give me the following information about the audio [SpeechActivity, Transcript, Gender, Emotion, Age, Accent]",
+	max_new_tokens=500,
+	return_special_tokens=False
+)
+# Model Generation
+'''
+{ "SpeechActivity" : "True",
+  "Transcript": "Yes, I got it. I'll make the payment now.",
+  "Gender": "Female",
+  "Emotion": "Neutral",
+  "Age": "Young",
+	"Accent" : "America",
+	}
+'''
+```
 ## Training Details