smrynrz20 commited on
Commit
6b5e34b
1 Parent(s): d325f51

End of training

Browse files
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.2041
21
- - F1: 0.7312
22
- - Exact Match: 0.59
23
 
24
  ## Model description
25
 
@@ -38,10 +38,12 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 2e-05
42
  - train_batch_size: 16
43
  - eval_batch_size: 16
44
  - seed: 42
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - num_epochs: 3
@@ -50,14 +52,14 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 | Exact Match |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-----------:|
53
- | 2.582 | 1.0 | 250 | 1.4052 | 0.6466 | 0.511 |
54
- | 1.2757 | 2.0 | 500 | 1.2138 | 0.7143 | 0.573 |
55
- | 1.0031 | 3.0 | 750 | 1.2041 | 0.7312 | 0.59 |
56
 
57
 
58
  ### Framework versions
59
 
60
- - Transformers 4.36.2
61
- - Pytorch 2.1.0+cu121
62
- - Datasets 2.16.1
63
- - Tokenizers 0.15.0
 
17
 
18
  This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.1988
21
+ - F1: 0.7649
22
+ - Exact Match: 0.634
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 3.7185140364032e-05
42
  - train_batch_size: 16
43
  - eval_batch_size: 16
44
  - seed: 42
45
+ - gradient_accumulation_steps: 2
46
+ - total_train_batch_size: 32
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
  - num_epochs: 3
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | F1 | Exact Match |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-----------:|
55
+ | 0.8014 | 1.0 | 125 | 1.3209 | 0.7185 | 0.579 |
56
+ | 0.9114 | 2.0 | 250 | 1.1532 | 0.7515 | 0.623 |
57
+ | 0.6644 | 3.0 | 375 | 1.1988 | 0.7649 | 0.634 |
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.41.2
63
+ - Pytorch 2.3.0+cu121
64
+ - Datasets 2.20.0
65
+ - Tokenizers 0.19.1
config.json CHANGED
@@ -67,7 +67,7 @@
67
  }
68
  },
69
  "torch_dtype": "float32",
70
- "transformers_version": "4.36.2",
71
  "use_cache": true,
72
  "vocab_size": 50265
73
  }
 
67
  }
68
  },
69
  "torch_dtype": "float32",
70
+ "transformers_version": "4.41.2",
71
  "use_cache": true,
72
  "vocab_size": 50265
73
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:51a34cb8db7ffc240aaf3f634d9d857a0042fc4371f33e760bd3078ca42c34e3
3
  size 557717800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4c098311fe8fc9d3cef762b24e6320f838731059a5d19b7b686e02fbaec3ddf
3
  size 557717800
runs/Jun24_07-47-52_9b938ea75028/events.out.tfevents.1719215272.9b938ea75028.1492.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cbb87231fddac7275bb8d74772129fd1b1dcbc71e393251dc9bfa22a3e6cf9c3
3
+ size 6074
runs/Jun24_07-49-57_9b938ea75028/events.out.tfevents.1719215397.9b938ea75028.1492.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5a73ed811400493c4eb51c95c7f0f9ba820a6e2051395bdb8e75e0e8f89a165
3
+ size 7959
tokenizer.json CHANGED
@@ -97,6 +97,7 @@
97
  "end_of_word_suffix": "",
98
  "fuse_unk": false,
99
  "byte_fallback": false,
 
100
  "vocab": {
101
  "<s>": 0,
102
  "<pad>": 1,
 
97
  "end_of_word_suffix": "",
98
  "fuse_unk": false,
99
  "byte_fallback": false,
100
+ "ignore_merges": false,
101
  "vocab": {
102
  "<s>": 0,
103
  "<pad>": 1,
tokenizer_config.json CHANGED
@@ -48,7 +48,7 @@
48
  "eos_token": "</s>",
49
  "errors": "replace",
50
  "mask_token": "<mask>",
51
- "model_max_length": 1024,
52
  "pad_token": "<pad>",
53
  "sep_token": "</s>",
54
  "tokenizer_class": "BartTokenizer",
 
48
  "eos_token": "</s>",
49
  "errors": "replace",
50
  "mask_token": "<mask>",
51
+ "model_max_length": 1000000000000000019884624838656,
52
  "pad_token": "<pad>",
53
  "sep_token": "</s>",
54
  "tokenizer_class": "BartTokenizer",
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:21f560049ce8ec58e69b5850ec29fa79f2ee2743c69b55c9b21e22055075e404
3
- size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:384836a68ffb46b3b59e7a350ccfa7ecab30aef4e43f8b4ad82d8724ba1f6919
3
+ size 5112