Vlasta commited on
Commit
3702828
1 Parent(s): 32b52e0

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -44
README.md CHANGED
@@ -2,18 +2,18 @@
2
  tags:
3
  - generated_from_trainer
4
  model-index:
5
- - name: DNADebertaSentencepiece10k
6
  results: []
7
  ---
8
 
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
  should probably proofread and complete it, then remove this comment. -->
11
 
12
- # DNADebertaSentencepiece10k
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 5.5666
17
 
18
  ## Model description
19
 
@@ -47,47 +47,42 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:------:|:---------------:|
50
- | 7.1504 | 0.36 | 5000 | 7.0604 |
51
- | 7.0431 | 0.72 | 10000 | 7.0307 |
52
- | 7.0219 | 1.08 | 15000 | 7.0186 |
53
- | 7.0099 | 1.45 | 20000 | 7.0058 |
54
- | 6.9686 | 1.81 | 25000 | 6.8723 |
55
- | 6.8449 | 2.17 | 30000 | 6.7980 |
56
- | 6.7654 | 2.53 | 35000 | 6.7057 |
57
- | 6.6418 | 2.89 | 40000 | 6.5286 |
58
- | 6.4225 | 3.25 | 45000 | 6.2286 |
59
- | 6.1859 | 3.61 | 50000 | 6.0729 |
60
- | 6.0727 | 3.97 | 55000 | 5.9866 |
61
- | 5.998 | 4.34 | 60000 | 5.9212 |
62
- | 5.945 | 4.7 | 65000 | 5.8824 |
63
- | 5.904 | 5.06 | 70000 | 5.8446 |
64
- | 5.8689 | 5.42 | 75000 | 5.8139 |
65
- | 5.8431 | 5.78 | 80000 | 5.7862 |
66
- | 5.8186 | 6.14 | 85000 | 5.7655 |
67
- | 5.7957 | 6.5 | 90000 | 5.7447 |
68
- | 5.7803 | 6.86 | 95000 | 5.7249 |
69
- | 5.765 | 7.23 | 100000 | 5.7107 |
70
- | 5.747 | 7.59 | 105000 | 5.7004 |
71
- | 5.7345 | 7.95 | 110000 | 5.6835 |
72
- | 5.7221 | 8.31 | 115000 | 5.6728 |
73
- | 5.7106 | 8.67 | 120000 | 5.6622 |
74
- | 5.7018 | 9.03 | 125000 | 5.6516 |
75
- | 5.692 | 9.39 | 130000 | 5.6390 |
76
- | 5.6791 | 9.75 | 135000 | 5.6313 |
77
- | 5.6751 | 10.12 | 140000 | 5.6250 |
78
- | 5.6649 | 10.48 | 145000 | 5.6182 |
79
- | 5.6601 | 10.84 | 150000 | 5.6103 |
80
- | 5.6542 | 11.2 | 155000 | 5.6059 |
81
- | 5.6468 | 11.56 | 160000 | 5.5957 |
82
- | 5.6393 | 11.92 | 165000 | 5.5915 |
83
- | 5.6362 | 12.28 | 170000 | 5.5880 |
84
- | 5.6328 | 12.64 | 175000 | 5.5835 |
85
- | 5.6261 | 13.01 | 180000 | 5.5775 |
86
- | 5.6218 | 13.37 | 185000 | 5.5753 |
87
- | 5.6215 | 13.73 | 190000 | 5.5701 |
88
- | 5.6163 | 14.09 | 195000 | 5.5697 |
89
- | 5.6151 | 14.45 | 200000 | 5.5667 |
90
- | 5.6129 | 14.81 | 205000 | 5.5651 |
91
 
92
 
93
  ### Framework versions
 
2
  tags:
3
  - generated_from_trainer
4
  model-index:
5
+ - name: DNADebertaSentencepiece30k
6
  results: []
7
  ---
8
 
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
  should probably proofread and complete it, then remove this comment. -->
11
 
12
+ # DNADebertaSentencepiece30k
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 6.3257
17
 
18
  ## Model description
19
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:------:|:---------------:|
50
+ | 7.9373 | 0.41 | 5000 | 7.8263 |
51
+ | 7.8005 | 0.81 | 10000 | 7.7871 |
52
+ | 7.7704 | 1.22 | 15000 | 7.7630 |
53
+ | 7.7477 | 1.62 | 20000 | 7.6857 |
54
+ | 7.6058 | 2.03 | 25000 | 7.5543 |
55
+ | 7.5281 | 2.44 | 30000 | 7.4839 |
56
+ | 7.4487 | 2.84 | 35000 | 7.3801 |
57
+ | 7.3368 | 3.25 | 40000 | 7.2603 |
58
+ | 7.1923 | 3.66 | 45000 | 7.0365 |
59
+ | 6.9858 | 4.06 | 50000 | 6.8793 |
60
+ | 6.8639 | 4.47 | 55000 | 6.7839 |
61
+ | 6.7877 | 4.87 | 60000 | 6.7176 |
62
+ | 6.728 | 5.28 | 65000 | 6.6680 |
63
+ | 6.6826 | 5.69 | 70000 | 6.6258 |
64
+ | 6.6414 | 6.09 | 75000 | 6.5847 |
65
+ | 6.6057 | 6.5 | 80000 | 6.5571 |
66
+ | 6.5794 | 6.91 | 85000 | 6.5279 |
67
+ | 6.5525 | 7.31 | 90000 | 6.5059 |
68
+ | 6.5354 | 7.72 | 95000 | 6.4816 |
69
+ | 6.5125 | 8.12 | 100000 | 6.4674 |
70
+ | 6.4958 | 8.53 | 105000 | 6.4486 |
71
+ | 6.4817 | 8.94 | 110000 | 6.4317 |
72
+ | 6.4674 | 9.34 | 115000 | 6.4195 |
73
+ | 6.4549 | 9.75 | 120000 | 6.4072 |
74
+ | 6.4409 | 10.16 | 125000 | 6.3945 |
75
+ | 6.4302 | 10.56 | 130000 | 6.3861 |
76
+ | 6.4214 | 10.97 | 135000 | 6.3755 |
77
+ | 6.4118 | 11.37 | 140000 | 6.3659 |
78
+ | 6.4058 | 11.78 | 145000 | 6.3604 |
79
+ | 6.3985 | 12.19 | 150000 | 6.3560 |
80
+ | 6.3899 | 12.59 | 155000 | 6.3473 |
81
+ | 6.3837 | 13.0 | 160000 | 6.3417 |
82
+ | 6.3782 | 13.41 | 165000 | 6.3361 |
83
+ | 6.3753 | 13.81 | 170000 | 6.3309 |
84
+ | 6.3733 | 14.22 | 175000 | 6.3285 |
85
+ | 6.3706 | 14.62 | 180000 | 6.3277 |
 
 
 
 
 
86
 
87
 
88
  ### Framework versions