etav22 commited on
Commit
b293e76
1 Parent(s): a1946d7

training completed[prod]: 512 128

Browse files
README.md CHANGED
@@ -1,13 +1,9 @@
1
  ---
2
  tags:
3
  - generated_from_trainer
4
- metrics:
5
- - rouge
6
  model-index:
7
  - name: pegasus-legalease
8
  results: []
9
- datasets:
10
- - hheiden/us-congress-117-bills
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -17,12 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model was trained from scratch on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 4.2373
21
- - Rouge1: 0.4847
22
- - Rouge2: 0.3225
23
- - Rougel: 0.4194
24
- - Rougelsum: 0.4177
25
- - Gen Len: 43.02
26
 
27
  ## Model description
28
 
@@ -42,28 +33,34 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
- - train_batch_size: 8
46
- - eval_batch_size: 8
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 5
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
- |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
- | 5.2738 | 1.0 | 125 | 4.6004 | 0.4363 | 0.2769 | 0.375 | 0.3743 | 37.24 |
58
- | 4.8164 | 2.0 | 250 | 4.4350 | 0.464 | 0.3085 | 0.405 | 0.4038 | 40.3 |
59
- | 4.8494 | 3.0 | 375 | 4.3372 | 0.473 | 0.3153 | 0.412 | 0.41 | 41.2 |
60
- | 4.6062 | 4.0 | 500 | 4.2669 | 0.4791 | 0.3196 | 0.4159 | 0.4141 | 43.03 |
61
- | 4.5682 | 5.0 | 625 | 4.2373 | 0.4847 | 0.3225 | 0.4194 | 0.4177 | 43.02 |
 
 
 
 
 
 
62
 
63
 
64
  ### Framework versions
65
 
66
- - Transformers 4.38.1
67
  - Pytorch 2.1.0+cu121
68
  - Datasets 2.18.0
69
- - Tokenizers 0.15.2
 
1
  ---
2
  tags:
3
  - generated_from_trainer
 
 
4
  model-index:
5
  - name: pegasus-legalease
6
  results: []
 
 
7
  ---
8
 
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
13
 
14
  This model was trained from scratch on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 1.1142
 
 
 
 
 
17
 
18
  ## Model description
19
 
 
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 2e-05
36
+ - train_batch_size: 4
37
+ - eval_batch_size: 4
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
+ - num_epochs: 1
42
  - mixed_precision_training: Native AMP
43
 
44
  ### Training results
45
 
46
+ | Training Loss | Epoch | Step | Validation Loss |
47
+ |:-------------:|:-----:|:----:|:---------------:|
48
+ | No log | 0.09 | 250 | 4.5607 |
49
+ | 4.8769 | 0.18 | 500 | 4.2187 |
50
+ | 4.8769 | 0.27 | 750 | 2.2905 |
51
+ | 2.9804 | 0.35 | 1000 | 1.1894 |
52
+ | 2.9804 | 0.44 | 1250 | 1.1604 |
53
+ | 1.3716 | 0.53 | 1500 | 1.1433 |
54
+ | 1.3716 | 0.62 | 1750 | 1.1318 |
55
+ | 1.2964 | 0.71 | 2000 | 1.1244 |
56
+ | 1.2964 | 0.8 | 2250 | 1.1188 |
57
+ | 1.248 | 0.89 | 2500 | 1.1152 |
58
+ | 1.248 | 0.98 | 2750 | 1.1142 |
59
 
60
 
61
  ### Framework versions
62
 
63
+ - Transformers 4.38.2
64
  - Pytorch 2.1.0+cu121
65
  - Datasets 2.18.0
66
+ - Tokenizers 0.15.2
generation_config.json CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "bos_token_id": 0,
3
  "decoder_start_token_id": 0,
4
  "eos_token_id": 1,
@@ -7,5 +8,5 @@
7
  "max_length": 64,
8
  "num_beams": 8,
9
  "pad_token_id": 0,
10
- "transformers_version": "4.38.1"
11
  }
 
1
  {
2
+ "_from_model_config": true,
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 0,
5
  "eos_token_id": 1,
 
8
  "max_length": 64,
9
  "num_beams": 8,
10
  "pad_token_id": 0,
11
+ "transformers_version": "4.38.2"
12
  }
runs/Mar08_22-44-37_e6ae31d0efb2/events.out.tfevents.1709937880.e6ae31d0efb2.13111.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2e50a974647b7bf85e0330d5d24d20be4a2b8c63078539448d5883af01721976
3
- size 8586
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6ca1c453ac5133258515bc45fde7291daf87afb37b3124a8248b96a6008aa6c
3
+ size 9964