pankaj10034 commited on
Commit
41e36c3
1 Parent(s): 61bf006

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +47 -24
README.md CHANGED
@@ -1,23 +1,51 @@
1
  ---
 
 
2
  license: mit
3
  base_model: deepset/gbert-base
4
- tags:
5
- - generated_from_keras_callback
 
 
 
 
 
6
  model-index:
7
- - name: pankaj10034/gbert-base-germaner
8
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
- <!-- This model card has been generated automatically according to the information Keras had access to. You should
12
- probably proofread and complete it, then remove this comment. -->
13
 
14
- # pankaj10034/gbert-base-germaner
15
 
16
- This model is a fine-tuned version of [deepset/gbert-base](https://huggingface.co/deepset/gbert-base) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.0201
19
- - Validation Loss: 0.1012
20
- - Epoch: 3
 
21
 
22
  ## Model description
23
 
@@ -36,22 +64,17 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - optimizer: {'inner_optimizer': {'class_name': 'AdamWeightDecay', 'config': {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 6960, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}}, 'dynamic': True, 'initial_scale': 32768.0, 'dynamic_growth_steps': 2000}
40
- - training_precision: mixed_float16
41
-
42
- ### Training results
43
-
44
- | Train Loss | Validation Loss | Epoch |
45
- |:----------:|:---------------:|:-----:|
46
- | 0.1379 | 0.0909 | 0 |
47
- | 0.0558 | 0.0848 | 1 |
48
- | 0.0328 | 0.0899 | 2 |
49
- | 0.0201 | 0.1012 | 3 |
50
-
51
 
52
  ### Framework versions
53
 
54
  - Transformers 4.31.0
55
- - TensorFlow 2.12.0
56
  - Datasets 2.14.4
57
  - Tokenizers 0.13.3
 
1
  ---
2
+ language:
3
+ - de
4
  license: mit
5
  base_model: deepset/gbert-base
6
+ datasets:
7
+ - germaner
8
+ metrics:
9
+ - precision
10
+ - recall
11
+ - f1
12
+ - accuracy
13
  model-index:
14
+ - name: gbert-base-germaner
15
+ results:
16
+ - task:
17
+ name: Token Classification
18
+ type: token-classification
19
+ dataset:
20
+ name: germaner
21
+ type: germaner
22
+ args: default
23
+ metrics:
24
+ - name: precision
25
+ type: precision
26
+ value: 0.8403996101364523
27
+ - name: recall
28
+ type: recall
29
+ value: 0.8674547283702213
30
+ - name: f1
31
+ type: f1
32
+ value: 0.8537128712871287
33
+ - name: accuracy
34
+ type: accuracy
35
+ value: 0.9760785008915815
36
  ---
37
 
38
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
39
+ should probably proofread and complete it, then remove this comment. -->
40
 
41
+ # gbert-base-germaner
42
 
43
+ This model is a fine-tuned version of [deepset/gbert-base](https://huggingface.co/deepset/gbert-base) on the germaner dataset.
44
  It achieves the following results on the evaluation set:
45
+ - precision: 0.8404
46
+ - recall: 0.8675
47
+ - f1: 0.8537
48
+ - accuracy: 0.9761
49
 
50
  ## Model description
51
 
 
64
  ### Training hyperparameters
65
 
66
  The following hyperparameters were used during training:
67
+ - num_train_epochs: 5
68
+ - train_batch_size: 16
69
+ - eval_batch_size: 32
70
+ - learning_rate: 2e-06
71
+ - weight_decay_rate: 0.01
72
+ - num_warmup_steps: 0
73
+ - fp16: True
 
 
 
 
 
74
 
75
  ### Framework versions
76
 
77
  - Transformers 4.31.0
78
+ - Pytorch 2.0.1+cu118
79
  - Datasets 2.14.4
80
  - Tokenizers 0.13.3