vishnun25 commited on
Commit
a1f580c
·
1 Parent(s): c5a6b62
README.md CHANGED
@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 4.8013
23
 
24
  ## Model description
25
 
@@ -44,15 +44,17 @@ The following hyperparameters were used during training:
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 3.0
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | No log | 1.0 | 226 | 5.0612 |
54
- | No log | 2.0 | 452 | 4.8513 |
55
- | 5.3272 | 3.0 | 678 | 4.8013 |
 
 
56
 
57
 
58
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 4.4572
23
 
24
  ## Model description
25
 
 
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 5
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | 5.6438 | 1.0 | 907 | 4.8026 |
54
+ | 4.774 | 2.0 | 1814 | 4.5953 |
55
+ | 4.5745 | 3.0 | 2721 | 4.5070 |
56
+ | 4.4677 | 4.0 | 3628 | 4.4688 |
57
+ | 4.4294 | 5.0 | 4535 | 4.4572 |
58
 
59
 
60
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2c17c7bac3979d533e51bc9ce2b487c17959573f91fcd6719b3a91901878b6cc
3
  size 333972957
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:49f654694a392507d48db1dda8fdea68bc8a0c427464733a3fe1011ec8f4ba66
3
  size 333972957
runs/Aug14_05-01-45_11545100a714/1628917342.3469803/events.out.tfevents.1628917342.11545100a714.63.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cbe1c684ca1a2d6bca0717d187eb82d19c55853f82f584b2a81ec2378c3d6ef
3
+ size 4199
runs/Aug14_05-01-45_11545100a714/events.out.tfevents.1628917342.11545100a714.63.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:12f5cc384f18d7b54c6c9e8f4f208a3c08ebf2237eef509d520594b44c4d3fac
3
+ size 6380
runs/Aug14_05-01-45_11545100a714/events.out.tfevents.1628917791.11545100a714.63.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ae86af5a84c65faad9ca63d5d461e2c3d1fad857d7a455082e848416da0481c
3
+ size 311
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:358f821f4a3bf829e8a87ea2600263fd7eaf0d054f9e2bd7e0a668aff4ef689e
3
  size 2671
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18f9d74409c2fbb747802e36cdc7dca06edd0d2ad99cc7fcc22806e61eee66bc
3
  size 2671