akrishnan1 commited on
Commit
e7c7701
·
1 Parent(s): 0e7822d

Training in progress epoch 0

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: akrishnan1/arxiv_summarization_model
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
@@ -13,16 +13,16 @@ probably proofread and complete it, then remove this comment. -->
13
 
14
  # akrishnan1/arxiv_summarization_model
15
 
16
- This model is a fine-tuned version of [akrishnan1/arxiv_summarization_model](https://huggingface.co/akrishnan1/arxiv_summarization_model) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 2.7649
19
- - Validation Loss: 2.7152
20
- - Train Rouge1: 17.4143
21
- - Train Rouge2: 6.1880
22
- - Train Rougel: 13.5825
23
- - Train Rougelsum: 15.6683
24
  - Train Gen Len: 19.0
25
- - Epoch: 2
26
 
27
  ## Model description
28
 
@@ -48,9 +48,7 @@ The following hyperparameters were used during training:
48
 
49
  | Train Loss | Validation Loss | Train Rouge1 | Train Rouge2 | Train Rougel | Train Rougelsum | Train Gen Len | Epoch |
50
  |:----------:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:-------------:|:-----:|
51
- | 2.8019 | 2.7361 | 17.1985 | 6.1226 | 13.4386 | 15.8710 | 19.0 | 0 |
52
- | 2.7832 | 2.7251 | 17.3981 | 6.0631 | 13.4450 | 15.8012 | 19.0 | 1 |
53
- | 2.7649 | 2.7152 | 17.4143 | 6.1880 | 13.5825 | 15.6683 | 19.0 | 2 |
54
 
55
 
56
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: google-t5/t5-small
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
 
13
 
14
  # akrishnan1/arxiv_summarization_model
15
 
16
+ This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 2.6862
19
+ - Validation Loss: 2.4424
20
+ - Train Rouge1: 17.9778
21
+ - Train Rouge2: 6.7295
22
+ - Train Rougel: 14.3327
23
+ - Train Rougelsum: 16.3045
24
  - Train Gen Len: 19.0
25
+ - Epoch: 0
26
 
27
  ## Model description
28
 
 
48
 
49
  | Train Loss | Validation Loss | Train Rouge1 | Train Rouge2 | Train Rougel | Train Rougelsum | Train Gen Len | Epoch |
50
  |:----------:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:-------------:|:-----:|
51
+ | 2.6862 | 2.4424 | 17.9778 | 6.7295 | 14.3327 | 16.3045 | 19.0 | 0 |
 
 
52
 
53
 
54
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "akrishnan1/arxiv_summarization_model",
3
  "architectures": [
4
  "T5ForConditionalGeneration"
5
  ],
 
1
  {
2
+ "_name_or_path": "google-t5/t5-small",
3
  "architectures": [
4
  "T5ForConditionalGeneration"
5
  ],
logs/train/events.out.tfevents.1714514322.cf05c62b2ebf.1206.0.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dde5968255b4f83c7437e727901af32deb4be2e8b39e6cbbaf1ec97fcd2f544e
3
+ size 78
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f51da9715ce106dcd2e5f15567f1f1c302a983ad04d7f675a8a1f43d91ac65b6
3
  size 373902664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ce6f767e3f4fee49c37a5065fb8570f04ff6447c2187c321dc7632943c534c6
3
  size 373902664
tokenizer.json CHANGED
@@ -1,6 +1,11 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 128,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
  "padding": null,
10
  "added_tokens": [
11
  {
tokenizer_config.json CHANGED
@@ -930,12 +930,8 @@
930
  "clean_up_tokenization_spaces": true,
931
  "eos_token": "</s>",
932
  "extra_ids": 100,
933
- "max_length": 128,
934
  "model_max_length": 512,
935
  "pad_token": "<pad>",
936
- "stride": 0,
937
  "tokenizer_class": "T5Tokenizer",
938
- "truncation_side": "right",
939
- "truncation_strategy": "longest_first",
940
  "unk_token": "<unk>"
941
  }
 
930
  "clean_up_tokenization_spaces": true,
931
  "eos_token": "</s>",
932
  "extra_ids": 100,
 
933
  "model_max_length": 512,
934
  "pad_token": "<pad>",
 
935
  "tokenizer_class": "T5Tokenizer",
 
 
936
  "unk_token": "<unk>"
937
  }