Anorak commited on
Commit
29eed85
1 Parent(s): da99698

End of training

Browse files
README.md ADDED
@@ -0,0 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ model-index:
5
+ - name: layoutlm-funsd
6
+ results: []
7
+ ---
8
+
9
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
+ should probably proofread and complete it, then remove this comment. -->
11
+
12
+ # layoutlm-funsd
13
+
14
+ This model is a fine-tuned version of [microsoft/layoutlm-base-uncased](https://huggingface.co/microsoft/layoutlm-base-uncased) on an unknown dataset.
15
+ It achieves the following results on the evaluation set:
16
+ - Loss: 0.6857
17
+ - Answer: {'precision': 0.7176981541802389, 'recall': 0.8170580964153276, 'f1': 0.7641618497109827, 'number': 809}
18
+ - Header: {'precision': 0.28368794326241137, 'recall': 0.33613445378151263, 'f1': 0.3076923076923077, 'number': 119}
19
+ - Question: {'precision': 0.7773820124666073, 'recall': 0.819718309859155, 'f1': 0.7979890310786105, 'number': 1065}
20
+ - Overall Precision: 0.7204
21
+ - Overall Recall: 0.7898
22
+ - Overall F1: 0.7535
23
+ - Overall Accuracy: 0.8139
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 3e-05
43
+ - train_batch_size: 16
44
+ - eval_batch_size: 8
45
+ - seed: 42
46
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 15
49
+ - mixed_precision_training: Native AMP
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Answer | Header | Question | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
54
+ |:-------------:|:-----:|:----:|:---------------:|:-------------------------------------------------------------------------------------------------------------:|:-------------------------------------------------------------------------------------------------------------:|:-----------------------------------------------------------------------------------------------------------:|:-----------------:|:--------------:|:----------:|:----------------:|
55
+ | 1.8064 | 1.0 | 10 | 1.6080 | {'precision': 0.020618556701030927, 'recall': 0.012360939431396786, 'f1': 0.01545595054095827, 'number': 809} | {'precision': 0.0, 'recall': 0.0, 'f1': 0.0, 'number': 119} | {'precision': 0.2702127659574468, 'recall': 0.11924882629107982, 'f1': 0.16547231270358306, 'number': 1065} | 0.1435 | 0.0687 | 0.0929 | 0.3378 |
56
+ | 1.4826 | 2.0 | 20 | 1.2520 | {'precision': 0.20166320166320167, 'recall': 0.23980222496909764, 'f1': 0.21908526256352345, 'number': 809} | {'precision': 0.0, 'recall': 0.0, 'f1': 0.0, 'number': 119} | {'precision': 0.4309507286606523, 'recall': 0.5830985915492958, 'f1': 0.49561053471667993, 'number': 1065} | 0.3392 | 0.4089 | 0.3708 | 0.5993 |
57
+ | 1.1438 | 3.0 | 30 | 0.9584 | {'precision': 0.463519313304721, 'recall': 0.5339925834363412, 'f1': 0.49626651349798967, 'number': 809} | {'precision': 0.0, 'recall': 0.0, 'f1': 0.0, 'number': 119} | {'precision': 0.6199664429530202, 'recall': 0.6938967136150235, 'f1': 0.6548515728843598, 'number': 1065} | 0.5492 | 0.5876 | 0.5678 | 0.6897 |
58
+ | 0.8546 | 4.0 | 40 | 0.7900 | {'precision': 0.5885714285714285, 'recall': 0.7639060568603214, 'f1': 0.6648735879505111, 'number': 809} | {'precision': 0.06666666666666667, 'recall': 0.025210084033613446, 'f1': 0.036585365853658534, 'number': 119} | {'precision': 0.6505823627287853, 'recall': 0.7342723004694836, 'f1': 0.6898985443317159, 'number': 1065} | 0.6108 | 0.7040 | 0.6541 | 0.7537 |
59
+ | 0.6765 | 5.0 | 50 | 0.7144 | {'precision': 0.6514047866805411, 'recall': 0.7737948084054388, 'f1': 0.7073446327683616, 'number': 809} | {'precision': 0.09230769230769231, 'recall': 0.05042016806722689, 'f1': 0.06521739130434782, 'number': 119} | {'precision': 0.7019810508182601, 'recall': 0.7652582159624414, 'f1': 0.7322551662174304, 'number': 1065} | 0.6616 | 0.7260 | 0.6923 | 0.7773 |
60
+ | 0.5613 | 6.0 | 60 | 0.6796 | {'precision': 0.6635514018691588, 'recall': 0.7898640296662547, 'f1': 0.7212189616252822, 'number': 809} | {'precision': 0.15306122448979592, 'recall': 0.12605042016806722, 'f1': 0.1382488479262673, 'number': 119} | {'precision': 0.7274320771253286, 'recall': 0.7793427230046949, 'f1': 0.7524932003626473, 'number': 1065} | 0.6739 | 0.7446 | 0.7075 | 0.7927 |
61
+ | 0.4872 | 7.0 | 70 | 0.6554 | {'precision': 0.6592517694641051, 'recall': 0.8059332509270705, 'f1': 0.7252502780867631, 'number': 809} | {'precision': 0.22549019607843138, 'recall': 0.19327731092436976, 'f1': 0.20814479638009048, 'number': 119} | {'precision': 0.7383177570093458, 'recall': 0.815962441314554, 'f1': 0.775200713648528, 'number': 1065} | 0.6808 | 0.7747 | 0.7247 | 0.7997 |
62
+ | 0.4334 | 8.0 | 80 | 0.6526 | {'precision': 0.6941176470588235, 'recall': 0.8022249690976514, 'f1': 0.7442660550458714, 'number': 809} | {'precision': 0.24545454545454545, 'recall': 0.226890756302521, 'f1': 0.23580786026200873, 'number': 119} | {'precision': 0.7493627867459643, 'recall': 0.828169014084507, 'f1': 0.7867975022301517, 'number': 1065} | 0.7012 | 0.7817 | 0.7393 | 0.8035 |
63
+ | 0.3941 | 9.0 | 90 | 0.6694 | {'precision': 0.7048997772828508, 'recall': 0.7824474660074165, 'f1': 0.741652021089631, 'number': 809} | {'precision': 0.22099447513812154, 'recall': 0.33613445378151263, 'f1': 0.26666666666666666, 'number': 119} | {'precision': 0.7218984179850125, 'recall': 0.8140845070422535, 'f1': 0.76522506619594, 'number': 1065} | 0.6754 | 0.7727 | 0.7208 | 0.8007 |
64
+ | 0.3556 | 10.0 | 100 | 0.6607 | {'precision': 0.694006309148265, 'recall': 0.8158220024721878, 'f1': 0.75, 'number': 809} | {'precision': 0.25, 'recall': 0.2773109243697479, 'f1': 0.26294820717131473, 'number': 119} | {'precision': 0.7846153846153846, 'recall': 0.8140845070422535, 'f1': 0.7990783410138248, 'number': 1065} | 0.7130 | 0.7827 | 0.7462 | 0.8068 |
65
+ | 0.3245 | 11.0 | 110 | 0.6728 | {'precision': 0.6990595611285266, 'recall': 0.826946847960445, 'f1': 0.7576443941109853, 'number': 809} | {'precision': 0.2892561983471074, 'recall': 0.29411764705882354, 'f1': 0.2916666666666667, 'number': 119} | {'precision': 0.7817703768624014, 'recall': 0.8375586854460094, 'f1': 0.8087035358114233, 'number': 1065} | 0.7192 | 0.8008 | 0.7578 | 0.8089 |
66
+ | 0.3113 | 12.0 | 120 | 0.6799 | {'precision': 0.71875, 'recall': 0.796044499381953, 'f1': 0.755425219941349, 'number': 809} | {'precision': 0.25903614457831325, 'recall': 0.36134453781512604, 'f1': 0.3017543859649123, 'number': 119} | {'precision': 0.775330396475771, 'recall': 0.8262910798122066, 'f1': 0.8, 'number': 1065} | 0.7132 | 0.7863 | 0.7480 | 0.8106 |
67
+ | 0.2921 | 13.0 | 130 | 0.6836 | {'precision': 0.7070063694267515, 'recall': 0.823238566131026, 'f1': 0.7607081667618503, 'number': 809} | {'precision': 0.32432432432432434, 'recall': 0.3025210084033613, 'f1': 0.31304347826086953, 'number': 119} | {'precision': 0.7976513098464318, 'recall': 0.8291079812206573, 'f1': 0.8130755064456722, 'number': 1065} | 0.7338 | 0.7953 | 0.7633 | 0.8122 |
68
+ | 0.2841 | 14.0 | 140 | 0.6848 | {'precision': 0.7150537634408602, 'recall': 0.8220024721878862, 'f1': 0.7648073605520415, 'number': 809} | {'precision': 0.26666666666666666, 'recall': 0.33613445378151263, 'f1': 0.2973977695167286, 'number': 119} | {'precision': 0.7841726618705036, 'recall': 0.8187793427230047, 'f1': 0.8011024345429489, 'number': 1065} | 0.7194 | 0.7913 | 0.7536 | 0.8127 |
69
+ | 0.2793 | 15.0 | 150 | 0.6857 | {'precision': 0.7176981541802389, 'recall': 0.8170580964153276, 'f1': 0.7641618497109827, 'number': 809} | {'precision': 0.28368794326241137, 'recall': 0.33613445378151263, 'f1': 0.3076923076923077, 'number': 119} | {'precision': 0.7773820124666073, 'recall': 0.819718309859155, 'f1': 0.7979890310786105, 'number': 1065} | 0.7204 | 0.7898 | 0.7535 | 0.8139 |
70
+
71
+
72
+ ### Framework versions
73
+
74
+ - Transformers 4.27.0.dev0
75
+ - Pytorch 1.8.0+cu101
76
+ - Tokenizers 0.13.2
logs/events.out.tfevents.1676015492.1fe1c94ef7fa.4434.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:73a6858d754dffee200e60151caef5da0e406f2f7a7227a0658b0c31eba986e9
3
- size 13234
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bd50ec7577aaf8e4c00832cc22a36fdc2d6887a976efad1105b10f6f6fee8cd
3
+ size 14249
preprocessor_config.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "apply_ocr": true,
3
+ "do_resize": true,
4
+ "feature_extractor_type": "LayoutLMv2FeatureExtractor",
5
+ "image_processor_type": "LayoutLMv2ImageProcessor",
6
+ "ocr_lang": null,
7
+ "processor_class": "LayoutLMv2Processor",
8
+ "resample": 2,
9
+ "size": {
10
+ "height": 224,
11
+ "width": 224
12
+ },
13
+ "tesseract_config": ""
14
+ }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9d81cc50b8b2de2cc3d206ff29800bf384c6cccf58fe7525b432597e0d52da8e
3
  size 450610229
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c8905430cf95d93adb36a795974480ab2d76635d90949211ba03aba20f2837ed
3
  size 450610229
special_tokens_map.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": "[CLS]",
3
+ "mask_token": "[MASK]",
4
+ "pad_token": "[PAD]",
5
+ "sep_token": "[SEP]",
6
+ "unk_token": "[UNK]"
7
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "additional_special_tokens": null,
3
+ "apply_ocr": false,
4
+ "cls_token": "[CLS]",
5
+ "cls_token_box": [
6
+ 0,
7
+ 0,
8
+ 0,
9
+ 0
10
+ ],
11
+ "do_basic_tokenize": true,
12
+ "do_lower_case": true,
13
+ "mask_token": "[MASK]",
14
+ "model_max_length": 512,
15
+ "never_split": null,
16
+ "only_label_first_subword": true,
17
+ "pad_token": "[PAD]",
18
+ "pad_token_box": [
19
+ 0,
20
+ 0,
21
+ 0,
22
+ 0
23
+ ],
24
+ "pad_token_label": -100,
25
+ "processor_class": "LayoutLMv2Processor",
26
+ "sep_token": "[SEP]",
27
+ "sep_token_box": [
28
+ 1000,
29
+ 1000,
30
+ 1000,
31
+ 1000
32
+ ],
33
+ "special_tokens_map_file": null,
34
+ "strip_accents": null,
35
+ "tokenize_chinese_chars": true,
36
+ "tokenizer_class": "LayoutLMv2Tokenizer",
37
+ "unk_token": "[UNK]"
38
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff