Vexemous commited on
Commit
79f1429
1 Parent(s): fa48e9d

End of training

Browse files
README.md ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: facebook/bart-base
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: bart-base-finetuned-samsum
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # bart-base-finetuned-samsum
17
+
18
+ This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.5273
21
+ - Rouge1: 46.8865
22
+ - Rouge2: 23.8976
23
+ - Rougel: 39.8604
24
+ - Rougelsum: 43.0185
25
+ - Gen Len: 18.0659
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 2e-05
45
+ - train_batch_size: 16
46
+ - eval_batch_size: 16
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 5
51
+ - mixed_precision_training: Native AMP
52
+
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
57
+ | 2.0008 | 1.0 | 921 | 1.6050 | 45.4152 | 21.5898 | 38.2192 | 41.5283 | 18.3272 |
58
+ | 1.6741 | 2.0 | 1842 | 1.5611 | 45.6316 | 22.7331 | 38.6353 | 42.0206 | 17.9963 |
59
+ | 1.547 | 3.0 | 2763 | 1.5362 | 46.4511 | 23.218 | 39.1461 | 42.4645 | 17.9255 |
60
+ | 1.4668 | 4.0 | 3684 | 1.5338 | 46.8899 | 23.7554 | 39.7789 | 43.0769 | 18.3553 |
61
+ | 1.4218 | 5.0 | 4605 | 1.5273 | 46.8865 | 23.8976 | 39.8604 | 43.0185 | 18.0659 |
62
+
63
+
64
+ ### Framework versions
65
+
66
+ - Transformers 4.40.1
67
+ - Pytorch 1.13.1+cu117
68
+ - Datasets 2.19.0
69
+ - Tokenizers 0.19.1
generation_config.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 0,
3
+ "decoder_start_token_id": 2,
4
+ "early_stopping": true,
5
+ "eos_token_id": 2,
6
+ "forced_bos_token_id": 0,
7
+ "forced_eos_token_id": 2,
8
+ "no_repeat_ngram_size": 3,
9
+ "num_beams": 4,
10
+ "pad_token_id": 1,
11
+ "transformers_version": "4.40.1"
12
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4df4fffb58de23283cb46177f59b57d15affdc93a2a819d7b7476d48144d91fa
3
  size 557912620
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e45f725a55cb0fab0311b97abe3c514c151827b18f830345e827a66b84ef62b0
3
  size 557912620
runs/Apr26_09-04-09_instance-20240426-075425/events.out.tfevents.1714122258.instance-20240426-075425 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cbb6b5c832d4a3ba85e80afb5cecbb65ed272ed0d9743580cac42939ced7571e
3
- size 9690
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29e38f23190654fc04b5ed2655629e4683a1197cb9c1a3c2a0169be3514fc771
3
+ size 10780