przybytniowska commited on
Commit
17fcd1a
1 Parent(s): ac72aaa

Training complete

Browse files
Files changed (4) hide show
  1. README.md +52 -0
  2. pytorch_model.bin +1 -1
  3. test_metrics.json +3 -0
  4. train_losses.csv +33 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: allenai/longformer-base-4096
4
+ tags:
5
+ - generated_from_trainer
6
+ model-index:
7
+ - name: longformer_base_4096_QA_SQUAD_adamw_torch
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # longformer_base_4096_QA_SQUAD_adamw_torch
15
+
16
+ This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on an unknown dataset.
17
+
18
+ ## Model description
19
+
20
+ More information needed
21
+
22
+ ## Intended uses & limitations
23
+
24
+ More information needed
25
+
26
+ ## Training and evaluation data
27
+
28
+ More information needed
29
+
30
+ ## Training procedure
31
+
32
+ ### Training hyperparameters
33
+
34
+ The following hyperparameters were used during training:
35
+ - learning_rate: 2e-05
36
+ - train_batch_size: 8
37
+ - eval_batch_size: 8
38
+ - seed: 42
39
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
+ - lr_scheduler_type: linear
41
+ - num_epochs: 5
42
+
43
+ ### Training results
44
+
45
+
46
+
47
+ ### Framework versions
48
+
49
+ - Transformers 4.34.1
50
+ - Pytorch 2.3.0+cu118
51
+ - Datasets 2.19.0
52
+ - Tokenizers 0.14.1
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7d876167a6a110fc6def9ff9257390527919e2526739b2d2775c7fa3f197609f
3
  size 594739726
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb365be77c434dc44453ec3ec51c71f9d9306ba3b4db3ecc72ff357006a22f3f
3
  size 594739726
test_metrics.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "test_accuracy": 0.95428
3
+ }
train_losses.csv ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ loss,epoch
2
+ 0.3612,0.16
3
+ 0.2863,0.32
4
+ 0.297,0.48
5
+ 0.2678,0.64
6
+ 0.2396,0.8
7
+ 0.2529,0.96
8
+ 0.181,1.12
9
+ 0.1898,1.28
10
+ 0.1684,1.44
11
+ 0.1543,1.6
12
+ 0.166,1.76
13
+ 0.1706,1.92
14
+ 0.1293,2.08
15
+ 0.0913,2.24
16
+ 0.0976,2.4
17
+ 0.0956,2.56
18
+ 0.0955,2.72
19
+ 0.11,2.88
20
+ 0.0768,3.04
21
+ 0.0522,3.2
22
+ 0.0478,3.36
23
+ 0.0428,3.52
24
+ 0.0492,3.68
25
+ 0.0421,3.84
26
+ 0.0329,4.0
27
+ 0.0224,4.16
28
+ 0.0199,4.32
29
+ 0.0213,4.48
30
+ 0.023,4.64
31
+ 0.0237,4.8
32
+ 0.0196,4.96
33
+ 0.12260504020690918,5.0