bthomas commited on
Commit
67b1b57
1 Parent(s): 9204b85

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +73 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - mlm
5
+ - generated_from_trainer
6
+ model-index:
7
+ - name: article2keyword2.1b_barthez-orangesum-title_finetuned16_for_mlm
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # article2keyword2.1b_barthez-orangesum-title_finetuned16_for_mlm
15
+
16
+ This model is a fine-tuned version of [moussaKam/barthez-orangesum-title](https://huggingface.co/moussaKam/barthez-orangesum-title) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.0525
19
+
20
+ ## Model description
21
+
22
+ More information needed
23
+
24
+ ## Intended uses & limitations
25
+
26
+ More information needed
27
+
28
+ ## Training and evaluation data
29
+
30
+ More information needed
31
+
32
+ ## Training procedure
33
+
34
+ ### Training hyperparameters
35
+
36
+ The following hyperparameters were used during training:
37
+ - learning_rate: 2e-05
38
+ - train_batch_size: 4
39
+ - eval_batch_size: 4
40
+ - seed: 42
41
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
+ - lr_scheduler_type: linear
43
+ - num_epochs: 16
44
+ - mixed_precision_training: Native AMP
45
+
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:-----:|:---------------:|
50
+ | 0.2976 | 1.0 | 1353 | 0.0543 |
51
+ | 0.0566 | 2.0 | 2706 | 0.0509 |
52
+ | 0.0487 | 3.0 | 4059 | 0.0458 |
53
+ | 0.0433 | 4.0 | 5412 | 0.0456 |
54
+ | 0.04 | 5.0 | 6765 | 0.0460 |
55
+ | 0.0373 | 6.0 | 8118 | 0.0454 |
56
+ | 0.0355 | 7.0 | 9471 | 0.0465 |
57
+ | 0.0328 | 8.0 | 10824 | 0.0474 |
58
+ | 0.0317 | 9.0 | 12177 | 0.0470 |
59
+ | 0.03 | 10.0 | 13530 | 0.0488 |
60
+ | 0.0285 | 11.0 | 14883 | 0.0489 |
61
+ | 0.0272 | 12.0 | 16236 | 0.0500 |
62
+ | 0.0262 | 13.0 | 17589 | 0.0510 |
63
+ | 0.0258 | 14.0 | 18942 | 0.0511 |
64
+ | 0.0245 | 15.0 | 20295 | 0.0522 |
65
+ | 0.0239 | 16.0 | 21648 | 0.0525 |
66
+
67
+
68
+ ### Framework versions
69
+
70
+ - Transformers 4.21.1
71
+ - Pytorch 1.11.0
72
+ - Datasets 2.3.2
73
+ - Tokenizers 0.12.1