iguerrasevillano commited on
Commit
6d3f748
1 Parent(s): 82aac7d

Training in progress epoch 0

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 2.5428
19
- - Validation Loss: 3.5532
20
- - Train Rouge1: 26.3837
21
- - Train Rouge2: 9.2192
22
- - Train Rougel: 21.8441
23
- - Train Rougelsum: 21.7351
24
- - Train Gen Len: 20.0
25
- - Epoch: 5
26
 
27
  ## Model description
28
 
@@ -48,12 +48,7 @@ The following hyperparameters were used during training:
48
 
49
  | Train Loss | Validation Loss | Train Rouge1 | Train Rouge2 | Train Rougel | Train Rougelsum | Train Gen Len | Epoch |
50
  |:----------:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:-------------:|:-----:|
51
- | 3.9333 | 3.5188 | 26.6086 | 6.7680 | 20.7580 | 20.5472 | 20.0 | 0 |
52
- | 3.3063 | 3.5640 | 27.2760 | 6.7716 | 21.4322 | 20.8327 | 20.0 | 1 |
53
- | 3.0764 | 3.5048 | 27.4389 | 7.4962 | 21.7549 | 21.5366 | 20.0 | 2 |
54
- | 2.8770 | 3.5305 | 26.4536 | 8.0238 | 21.3939 | 21.0272 | 20.0 | 3 |
55
- | 2.6977 | 3.5499 | 27.1909 | 9.8630 | 22.4824 | 22.2501 | 20.0 | 4 |
56
- | 2.5428 | 3.5532 | 26.3837 | 9.2192 | 21.8441 | 21.7351 | 20.0 | 5 |
57
 
58
 
59
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 3.9877
19
+ - Validation Loss: 3.5287
20
+ - Train Rouge1: 32.1820
21
+ - Train Rouge2: 9.3543
22
+ - Train Rougel: 22.3531
23
+ - Train Rougelsum: 25.9513
24
+ - Train Gen Len: 52.0625
25
+ - Epoch: 0
26
 
27
  ## Model description
28
 
 
48
 
49
  | Train Loss | Validation Loss | Train Rouge1 | Train Rouge2 | Train Rougel | Train Rougelsum | Train Gen Len | Epoch |
50
  |:----------:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:-------------:|:-----:|
51
+ | 3.9877 | 3.5287 | 32.1820 | 9.3543 | 22.3531 | 25.9513 | 52.0625 | 0 |
 
 
 
 
 
52
 
53
 
54
  ### Framework versions
checkpoint/extra_data.pickle CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:905eed494bc22129f765ac78b4be18e82fc28c1f5982a6c9e9485acb0bb875d2
3
  size 1115383533
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7d83bb1e267833d00dd05d467facf7bb0eab45df7c74dc8de85cdb7a72066d2
3
  size 1115383533
checkpoint/weights.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0aa1d48069c62c85699ceffa40b09d959fef96068f07d641f2717ccf90ab6b17
3
  size 558172300
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0af9614bc39d0d557c4e0e2d75b2f8c5a21dec1dd3137983cd8ba273c6f98981
3
  size 558172300
config.json CHANGED
@@ -7,7 +7,7 @@
7
  "architectures": [
8
  "BartForConditionalGeneration"
9
  ],
10
- "attention_dropout": 0.1,
11
  "bos_token_id": 0,
12
  "classif_dropout": 0.1,
13
  "classifier_dropout": 0.0,
 
7
  "architectures": [
8
  "BartForConditionalGeneration"
9
  ],
10
+ "attention_dropout": 0.3,
11
  "bos_token_id": 0,
12
  "classif_dropout": 0.1,
13
  "classifier_dropout": 0.0,
generation_config.json CHANGED
@@ -2,12 +2,16 @@
2
  "_from_model_config": true,
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
 
5
  "early_stopping": true,
6
  "eos_token_id": 2,
7
  "forced_bos_token_id": 0,
8
  "forced_eos_token_id": 2,
 
 
9
  "no_repeat_ngram_size": 3,
10
  "num_beams": 4,
11
  "pad_token_id": 1,
 
12
  "transformers_version": "4.35.2"
13
  }
 
2
  "_from_model_config": true,
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
5
+ "do_sample": true,
6
  "early_stopping": true,
7
  "eos_token_id": 2,
8
  "forced_bos_token_id": 0,
9
  "forced_eos_token_id": 2,
10
+ "length_penalty": 3,
11
+ "max_new_tokens": 60,
12
  "no_repeat_ngram_size": 3,
13
  "num_beams": 4,
14
  "pad_token_id": 1,
15
+ "temperature": 0.5,
16
  "transformers_version": "4.35.2"
17
  }
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0aa1d48069c62c85699ceffa40b09d959fef96068f07d641f2717ccf90ab6b17
3
  size 558172300
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0af9614bc39d0d557c4e0e2d75b2f8c5a21dec1dd3137983cd8ba273c6f98981
3
  size 558172300