Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ datasets:
|
|
5 |
language:
|
6 |
- en
|
7 |
---
|
8 |
-
# Model Card for
|
9 |
|
10 |
This is a finetuned model of Cerebras 1.3B model using DataBricksLabs Dolly Framework
|
11 |
|
@@ -179,45 +179,20 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
|
|
179 |
|
180 |
### Model Architecture and Objective
|
181 |
|
182 |
-
|
183 |
-
|
184 |
-
|
185 |
-
|
186 |
-
[More Information Needed]
|
187 |
|
188 |
#### Hardware
|
189 |
|
190 |
-
|
191 |
|
192 |
#### Software
|
193 |
|
194 |
-
|
195 |
-
|
196 |
-
## Citation [optional]
|
197 |
-
|
198 |
-
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
199 |
-
|
200 |
-
**BibTeX:**
|
201 |
-
|
202 |
-
[More Information Needed]
|
203 |
-
|
204 |
-
**APA:**
|
205 |
-
|
206 |
-
[More Information Needed]
|
207 |
-
|
208 |
-
## Glossary [optional]
|
209 |
-
|
210 |
-
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
211 |
|
212 |
-
[More Information Needed]
|
213 |
|
214 |
-
## More Information [optional]
|
215 |
-
|
216 |
-
[More Information Needed]
|
217 |
-
|
218 |
-
## Model Card Authors [optional]
|
219 |
-
|
220 |
-
[More Information Needed]
|
221 |
|
222 |
## Model Card Contact
|
223 |
|
|
|
5 |
language:
|
6 |
- en
|
7 |
---
|
8 |
+
# Model Card for Cerebras 1.3b Dollyfied.
|
9 |
|
10 |
This is a finetuned model of Cerebras 1.3B model using DataBricksLabs Dolly Framework
|
11 |
|
|
|
179 |
|
180 |
### Model Architecture and Objective
|
181 |
|
182 |
+
GPT2 Cerebras-GPT 1.3B
|
183 |
+
Layers 24
|
184 |
+
n_embd 2048
|
185 |
+
Heads 16
|
|
|
186 |
|
187 |
#### Hardware
|
188 |
|
189 |
+
8xA100s
|
190 |
|
191 |
#### Software
|
192 |
|
193 |
+
https://github.com/databrickslabs/dolly
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
194 |
|
|
|
195 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
196 |
|
197 |
## Model Card Contact
|
198 |
|