aish31 commited on
Commit
1f6981c
·
1 Parent(s): 9b4ea5a

Upload TFBartForConditionalGeneration

Browse files
Files changed (4) hide show
  1. README.md +13 -12
  2. config.json +1 -1
  3. generation_config.json +1 -1
  4. tf_model.h5 +1 -1
README.md CHANGED
@@ -15,9 +15,9 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.5390
19
- - Validation Loss: 0.8504
20
- - Epoch: 6
21
 
22
  ## Model description
23
 
@@ -43,18 +43,19 @@ The following hyperparameters were used during training:
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
- | 1.5397 | 1.0943 | 0 |
47
- | 1.0308 | 0.9304 | 1 |
48
- | 0.9037 | 0.8744 | 2 |
49
- | 0.7231 | 0.8223 | 3 |
50
- | 0.6445 | 0.8153 | 4 |
51
- | 0.5842 | 0.7929 | 5 |
52
- | 0.5390 | 0.8504 | 6 |
 
53
 
54
 
55
  ### Framework versions
56
 
57
- - Transformers 4.33.0
58
  - TensorFlow 2.12.0
59
- - Datasets 2.14.4
60
  - Tokenizers 0.13.3
 
15
 
16
  This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.4108
19
+ - Validation Loss: 0.6862
20
+ - Epoch: 7
21
 
22
  ## Model description
23
 
 
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
+ | 1.4774 | 1.0330 | 0 |
47
+ | 0.9993 | 0.8665 | 1 |
48
+ | 0.7898 | 0.8213 | 2 |
49
+ | 0.6723 | 0.7351 | 3 |
50
+ | 0.5816 | 0.7387 | 4 |
51
+ | 0.5306 | 0.6879 | 5 |
52
+ | 0.4547 | 0.6966 | 6 |
53
+ | 0.4108 | 0.6862 | 7 |
54
 
55
 
56
  ### Framework versions
57
 
58
+ - Transformers 4.33.1
59
  - TensorFlow 2.12.0
60
+ - Datasets 2.14.5
61
  - Tokenizers 0.13.3
config.json CHANGED
@@ -67,7 +67,7 @@
67
  "num_beams": 6
68
  }
69
  },
70
- "transformers_version": "4.33.0",
71
  "use_cache": true,
72
  "vocab_size": 50265
73
  }
 
67
  "num_beams": 6
68
  }
69
  },
70
+ "transformers_version": "4.33.1",
71
  "use_cache": true,
72
  "vocab_size": 50265
73
  }
generation_config.json CHANGED
@@ -9,5 +9,5 @@
9
  "no_repeat_ngram_size": 3,
10
  "num_beams": 4,
11
  "pad_token_id": 1,
12
- "transformers_version": "4.33.0"
13
  }
 
9
  "no_repeat_ngram_size": 3,
10
  "num_beams": 4,
11
  "pad_token_id": 1,
12
+ "transformers_version": "4.33.1"
13
  }
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5c7aaed66589fbe62d4978d68ef63f31300a7bcb823aa7ae2357502a5a1cbed9
3
  size 1625925412
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4ad4b80352ff701b9a8b2e9c243bb956f79f7346e65ede91216d4140578f4a4
3
  size 1625925412