austindavis commited on
Commit
bb72387
1 Parent(s): 74a6b58

End of training

Browse files
Files changed (3) hide show
  1. README.md +12 -10
  2. generation_config.json +6 -3
  3. model.safetensors +1 -1
README.md CHANGED
@@ -1,24 +1,20 @@
1
  ---
 
2
  tags:
3
  - generated_from_trainer
4
- widget:
5
- - text: e2e4
6
- example_title: King's pawn
7
- - text: d2d4
8
- example_title: Queen's pawn
9
  model-index:
10
- - name: austindavis/gpt2-pretrained-lichess-uci
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # gpt2-pretrained-lichess-uci-finetuned-lichess-uci
18
 
19
- This model is a Pretrained GPT-2 trained on an the Lichess UCI dataset from Feb 2013.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.3084
22
 
23
  ## Model description
24
 
@@ -37,7 +33,7 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.0002
41
  - train_batch_size: 16
42
  - eval_batch_size: 8
43
  - seed: 42
@@ -45,6 +41,12 @@ The following hyperparameters were used during training:
45
  - lr_scheduler_type: cosine
46
  - num_epochs: 1
47
 
 
 
 
 
 
 
48
 
49
  ### Framework versions
50
 
 
1
  ---
2
+ base_model: austindavis/gpt2-lichess-uci-201601
3
  tags:
4
  - generated_from_trainer
 
 
 
 
 
5
  model-index:
6
+ - name: gpt2-lichess-uci-2016-01_11
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
+ # gpt2-lichess-uci-2016-01_11
14
 
15
+ This model is a fine-tuned version of [austindavis/gpt2-lichess-uci-201601](https://huggingface.co/austindavis/gpt2-lichess-uci-201601) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 1.0379
18
 
19
  ## Model description
20
 
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 0.0001715755714441261
37
  - train_batch_size: 16
38
  - eval_batch_size: 8
39
  - seed: 42
 
41
  - lr_scheduler_type: cosine
42
  - num_epochs: 1
43
 
44
+ ### Training results
45
+
46
+ | Training Loss | Epoch | Step | Validation Loss |
47
+ |:-------------:|:-----:|:------:|:---------------:|
48
+ | 1.0634 | 1.0 | 266171 | 1.0379 |
49
+
50
 
51
  ### Framework versions
52
 
generation_config.json CHANGED
@@ -1,6 +1,9 @@
1
  {
2
- "_from_model_config": true,
3
- "bos_token_id": 1,
4
  "eos_token_id": 2,
 
 
 
 
5
  "transformers_version": "4.40.1"
6
- }
 
1
  {
2
+ "do_sample": true,
 
3
  "eos_token_id": 2,
4
+ "max_length": 128,
5
+ "max_new_tokens": 128,
6
+ "pad_token_id": 0,
7
+ "temperature": 0.0001,
8
  "transformers_version": "4.40.1"
9
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d4f7485b1a913fd8cfbf70278c5bfe5ae943d596f79e937950229b97ded75469
3
  size 102086376
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f610df8ff91108cf4949c4c4057ebebc783fedb8857c07f4ab5b5528d88f681
3
  size 102086376