jed351 commited on
Commit
006bf74
1 Parent(s): 86a5699

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -11
README.md CHANGED
@@ -25,24 +25,23 @@ should probably proofread and complete it, then remove this comment. -->
25
 
26
  # gpt2-shikoto
27
 
28
- This model is a fine-tuned version of [jed351/gpt2-tiny-zh-hk](https://huggingface.co/jed351/gpt2-tiny-zh-hk) on the jed351/shikoto_zh_hk dataset.
29
- It achieves the following results on the evaluation set:
30
- - Loss: 3.2965
31
- - Accuracy: 0.3738
32
 
33
- ## Model description
34
 
35
- More information needed
 
36
 
37
- ## Intended uses & limitations
38
 
39
- More information needed
40
 
41
- ## Training and evaluation data
42
 
43
- More information needed
 
 
 
 
44
 
45
- ## Training procedure
46
 
47
  ### Training hyperparameters
48
 
 
25
 
26
  # gpt2-shikoto
27
 
28
+ This model was trained on a dataset I obtained from an online novel site.
29
+ **Please be aware that the stories might contain inappropriate content**
 
 
30
 
 
31
 
32
+ The base model can be found [here](https://huggingface.co/jed351/gpt2-tiny-zh-hk), which was obtained from
33
+ patching a GPT2 Chinese model and its tokenizer with Cantonese characters.
34
 
 
35
 
 
36
 
37
+ ## Training procedure
38
 
39
+ Please refer to the [script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling)
40
+ provided by Huggingface.
41
+
42
+
43
+ The model was trained for 400,000 steps on 2 NVIDIA Quadro RTX6000 for around 15 hours.
44
 
 
45
 
46
  ### Training hyperparameters
47