raygx commited on
Commit
61c6a82
1 Parent(s): 31554f5

Finetuned raygx/GPT2-Nepali-Casual-LM model for generating Covid-News; 10 Epochs

Browse files
Files changed (2) hide show
  1. README.md +13 -8
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -13,9 +13,9 @@ probably proofread and complete it, then remove this comment. -->
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Train Loss: 6.2871
17
- - Validation Loss: 6.6830
18
- - Epoch: 4
19
 
20
  ## Model description
21
 
@@ -41,11 +41,16 @@ The following hyperparameters were used during training:
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
- | 6.6684 | 6.9298 | 0 |
45
- | 6.5617 | 6.8545 | 1 |
46
- | 6.4641 | 6.7858 | 2 |
47
- | 6.3725 | 6.7307 | 3 |
48
- | 6.2871 | 6.6830 | 4 |
 
 
 
 
 
49
 
50
 
51
  ### Framework versions
 
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Train Loss: 5.6414
17
+ - Validation Loss: 6.3015
18
+ - Epoch: 9
19
 
20
  ## Model description
21
 
 
41
 
42
  | Train Loss | Validation Loss | Epoch |
43
  |:----------:|:---------------:|:-----:|
44
+ | 6.2059 | 6.6394 | 0 |
45
+ | 6.1308 | 6.6034 | 1 |
46
+ | 6.0590 | 6.5447 | 2 |
47
+ | 5.9910 | 6.5061 | 3 |
48
+ | 5.9264 | 6.4637 | 4 |
49
+ | 5.8640 | 6.4168 | 5 |
50
+ | 5.8058 | 6.3805 | 6 |
51
+ | 5.7492 | 6.3604 | 7 |
52
+ | 5.6948 | 6.3189 | 8 |
53
+ | 5.6414 | 6.3015 | 9 |
54
 
55
 
56
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8c3f75b590baa62b8099ee67c4c4c637089385611ccb93759010b0848a320924
3
  size 357679600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16708d7488c0291f4f34a9e6055f2b805e41c5cba5ce9428c7a7e808ecd64b32
3
  size 357679600