Commit
·
16023fc
1
Parent(s):
98b4d91
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,25 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
datasets:
|
4 |
+
- DarwinAnim8or/grug
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
pipeline_tag: text-generation
|
8 |
+
tags:
|
9 |
+
- grug
|
10 |
+
- caveman
|
11 |
+
- fun
|
12 |
---
|
13 |
+
|
14 |
+
# GPT-Grug-125m
|
15 |
+
A finetuned version of [GPT-Neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on the 'grug' dataset.
|
16 |
+
|
17 |
+
# Training Procedure
|
18 |
+
This was trained on the 'grug' dataset, using the "HappyTransformers" library on Google Colab.
|
19 |
+
This model was trained for 4 epochs with learning rate 1e-2.
|
20 |
+
|
21 |
+
# Biases & Limitations
|
22 |
+
This likely contains the same biases and limitations as the original GPT-Neo-125M that it is based on, and additionally heavy biases from the grug datasets.
|
23 |
+
|
24 |
+
# Intended Use
|
25 |
+
This model is meant for fun, please do not take anything this caveman says seriously.
|