Commit
•
b36e2f7
1
Parent(s):
1d58b15
add model
Browse files- README.md +3 -3
- config.json +1 -1
- tf_model.h5 +2 -2
README.md
CHANGED
@@ -14,7 +14,7 @@ probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
-
- Train Loss:
|
18 |
- Epoch: 1
|
19 |
|
20 |
## Model description
|
@@ -41,8 +41,8 @@ The following hyperparameters were used during training:
|
|
41 |
|
42 |
| Train Loss | Epoch |
|
43 |
|:----------:|:-----:|
|
44 |
-
|
|
45 |
-
|
|
46 |
|
47 |
|
48 |
### Framework versions
|
|
|
14 |
|
15 |
This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
+
- Train Loss: 5.5751
|
18 |
- Epoch: 1
|
19 |
|
20 |
## Model description
|
|
|
41 |
|
42 |
| Train Loss | Epoch |
|
43 |
|:----------:|:-----:|
|
44 |
+
| 6.0979 | 0 |
|
45 |
+
| 5.5751 | 1 |
|
46 |
|
47 |
|
48 |
### Framework versions
|
config.json
CHANGED
@@ -41,5 +41,5 @@
|
|
41 |
},
|
42 |
"transformers_version": "4.17.0",
|
43 |
"use_cache": false,
|
44 |
-
"vocab_size":
|
45 |
}
|
|
|
41 |
},
|
42 |
"transformers_version": "4.17.0",
|
43 |
"use_cache": false,
|
44 |
+
"vocab_size": 50257
|
45 |
}
|
tf_model.h5
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:aedebfa3bfb10faca2e81c7e205208ab122a761d79024cc8b52e755236057d83
|
3 |
+
size 327745496
|