Feature Extraction
Transformers
Safetensors
English
custom_model
multi-modal
conversational
speechllm
speech2text
custom_code
shangeth commited on
Commit
68d52d8
1 Parent(s): ee1b905

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -4
README.md CHANGED
@@ -22,6 +22,10 @@ tags:
22
 
23
 
24
 
 
 
 
 
25
  ## Model Details
26
 
27
  ### Model Description
@@ -34,9 +38,9 @@ This is the model card of a 🤗 transformers model that has been pushed on the
34
  - **Funded by [optional]:** [More Information Needed]
35
  - **Shared by [optional]:** [More Information Needed]
36
  - **Model type:** [More Information Needed]
37
- - **Language(s) (NLP):** [More Information Needed]
38
- - **License:** [More Information Needed]
39
- - **Finetuned from model [optional]:** [More Information Needed]
40
 
41
  ### Model Sources [optional]
42
 
@@ -84,7 +88,29 @@ Users (both direct and downstream) should be made aware of the risks, biases and
84
 
85
  Use the code below to get started with the model.
86
 
87
- [More Information Needed]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
88
 
89
  ## Training Details
90
 
 
22
 
23
 
24
 
25
+
26
+
27
+
28
+
29
  ## Model Details
30
 
31
  ### Model Description
 
38
  - **Funded by [optional]:** [More Information Needed]
39
  - **Shared by [optional]:** [More Information Needed]
40
  - **Model type:** [More Information Needed]
41
+ - **Language(s) (NLP):** English
42
+ - **License:** Apache 2.0
43
+ - **Finetuned from model [optional]:** HubertX and TinyLlama
44
 
45
  ### Model Sources [optional]
46
 
 
88
 
89
  Use the code below to get started with the model.
90
 
91
+ ```python
92
+ # Load model directly from huggingface
93
+ from transformers import AutoModel
94
+ model = AutoModel.from_pretrained("shangeth/SpeechLLM", trust_remote_code=True)
95
+
96
+ model.generate_meta(
97
+ audio_path="path-to-audio.wav",
98
+ instruction="Give me the following information about the audio [SpeechActivity, Transcript, Gender, Emotion, Age, Accent]",
99
+ max_new_tokens=500,
100
+ return_special_tokens=False
101
+ )
102
+
103
+ # Model Generation
104
+ '''
105
+ { "SpeechActivity" : "True",
106
+ "Transcript": "Yes, I got it. I'll make the payment now.",
107
+ "Gender": "Female",
108
+ "Emotion": "Neutral",
109
+ "Age": "Young",
110
+ "Accent" : "America",
111
+ }
112
+ '''
113
+ ```
114
 
115
  ## Training Details
116