yidi-huang commited on
Commit
1cdecbb
1 Parent(s): bf6336a

Training in progress epoch 0

Browse files
Files changed (3) hide show
  1. README.md +4 -6
  2. config.json +8 -8
  3. tf_model.h5 +2 -2
README.md CHANGED
@@ -15,8 +15,8 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [dbmdz/bert-base-french-europeana-cased](https://huggingface.co/dbmdz/bert-base-french-europeana-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.0693
19
- - Epoch: 2
20
 
21
  ## Model description
22
 
@@ -35,16 +35,14 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 2151, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
39
  - training_precision: float32
40
 
41
  ### Training results
42
 
43
  | Train Loss | Epoch |
44
  |:----------:|:-----:|
45
- | 0.2548 | 0 |
46
- | 0.1102 | 1 |
47
- | 0.0693 | 2 |
48
 
49
 
50
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [dbmdz/bert-base-french-europeana-cased](https://huggingface.co/dbmdz/bert-base-french-europeana-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.2420
19
+ - Epoch: 0
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 7170, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
39
  - training_precision: float32
40
 
41
  ### Training results
42
 
43
  | Train Loss | Epoch |
44
  |:----------:|:-----:|
45
+ | 0.2420 | 0 |
 
 
46
 
47
 
48
  ### Framework versions
config.json CHANGED
@@ -11,18 +11,18 @@
11
  "hidden_size": 768,
12
  "id2label": {
13
  "0": "O",
14
- "1": "B-lobj",
15
- "2": "I-lobj",
16
- "3": "B-lloc",
17
- "4": "I-lloc"
18
  },
19
  "initializer_range": 0.02,
20
  "intermediate_size": 3072,
21
  "label2id": {
22
- "B-lloc": 3,
23
- "B-lobj": 1,
24
- "I-lloc": 4,
25
- "I-lobj": 2,
26
  "O": 0
27
  },
28
  "layer_norm_eps": 1e-12,
 
11
  "hidden_size": 768,
12
  "id2label": {
13
  "0": "O",
14
+ "1": "B-lloc",
15
+ "2": "I-lloc",
16
+ "3": "B-lobj",
17
+ "4": "I-lobj"
18
  },
19
  "initializer_range": 0.02,
20
  "intermediate_size": 3072,
21
  "label2id": {
22
+ "B-lloc": 1,
23
+ "B-lobj": 3,
24
+ "I-lloc": 2,
25
+ "I-lobj": 4,
26
  "O": 0
27
  },
28
  "layer_norm_eps": 1e-12,
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:88becd5370422fd98ea69b3c828e1b9002f855e1bf51b07362b6ee253f07100a
3
- size 440408028
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7ec3776e2a1ab44b753ce70254d42f306ce0d5633e403df8424b62662cf472c9
3
+ size 440395740