jonaskoenig commited on
Commit
abe5e66
1 Parent(s): cdfa79c
Files changed (2) hide show
  1. README.md +16 -2
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -14,7 +14,9 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [microsoft/xtremedistil-l6-h256-uncased](https://huggingface.co/microsoft/xtremedistil-l6-h256-uncased) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
-
 
 
18
 
19
  ## Model description
20
 
@@ -33,11 +35,23 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - optimizer: {'name': 'Adam', 'learning_rate': 4e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
37
  - training_precision: float32
38
 
39
  ### Training results
40
 
 
 
 
 
 
 
 
 
 
 
 
 
41
 
42
 
43
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [microsoft/xtremedistil-l6-h256-uncased](https://huggingface.co/microsoft/xtremedistil-l6-h256-uncased) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.0279
18
+ - Train Binary Crossentropy: 0.4809
19
+ - Epoch: 9
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - optimizer: {'name': 'Adam', 'learning_rate': 3e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
39
  - training_precision: float32
40
 
41
  ### Training results
42
 
43
+ | Train Loss | Train Binary Crossentropy | Epoch |
44
+ |:----------:|:-------------------------:|:-----:|
45
+ | 0.0487 | 0.6401 | 0 |
46
+ | 0.0348 | 0.5925 | 1 |
47
+ | 0.0319 | 0.5393 | 2 |
48
+ | 0.0306 | 0.5168 | 3 |
49
+ | 0.0298 | 0.5045 | 4 |
50
+ | 0.0292 | 0.4970 | 5 |
51
+ | 0.0288 | 0.4916 | 6 |
52
+ | 0.0284 | 0.4878 | 7 |
53
+ | 0.0282 | 0.4836 | 8 |
54
+ | 0.0279 | 0.4809 | 9 |
55
 
56
 
57
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9d352b498685b35b17374cafd396f6b33a9b6a4800f5b82db3c0a4a8eb7dba14
3
  size 51177784
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6322962679080bb6e5600e84e6491586ec04b768df3475ad5bdb462c0be0a87d
3
  size 51177784