rajevan123 commited on
Commit
1144b43
·
verified ·
1 Parent(s): 9c7a8c9

End of training

Browse files
README.md ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ library_name: peft
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ base_model: deepset/roberta-base-squad2
9
+ model-index:
10
+ - name: STS-Lora-Fine-Tuning-Capstone-roberta-base-deepset-filtered-120-with-higher-r-mid
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # STS-Lora-Fine-Tuning-Capstone-roberta-base-deepset-filtered-120-with-higher-r-mid
18
+
19
+ This model is a fine-tuned version of [deepset/roberta-base-squad2](https://huggingface.co/deepset/roberta-base-squad2) on the None dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.7040
22
+ - Accuracy: 0.6854
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 3e-05
42
+ - train_batch_size: 64
43
+ - eval_batch_size: 64
44
+ - seed: 42
45
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
+ - lr_scheduler_type: linear
47
+ - num_epochs: 30
48
+
49
+ ### Training results
50
+
51
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
53
+ | No log | 1.0 | 57 | 1.0545 | 0.4551 |
54
+ | No log | 2.0 | 114 | 1.0270 | 0.5 |
55
+ | No log | 3.0 | 171 | 0.9823 | 0.5262 |
56
+ | No log | 4.0 | 228 | 0.9416 | 0.5356 |
57
+ | No log | 5.0 | 285 | 0.8917 | 0.5805 |
58
+ | No log | 6.0 | 342 | 0.7931 | 0.6180 |
59
+ | No log | 7.0 | 399 | 0.7627 | 0.6423 |
60
+ | No log | 8.0 | 456 | 0.7679 | 0.6573 |
61
+ | 0.7963 | 9.0 | 513 | 0.7380 | 0.6610 |
62
+ | 0.7963 | 10.0 | 570 | 0.7256 | 0.6760 |
63
+ | 0.7963 | 11.0 | 627 | 0.7223 | 0.6742 |
64
+ | 0.7963 | 12.0 | 684 | 0.7255 | 0.6779 |
65
+ | 0.7963 | 13.0 | 741 | 0.7132 | 0.6779 |
66
+ | 0.7963 | 14.0 | 798 | 0.7097 | 0.6835 |
67
+ | 0.7963 | 15.0 | 855 | 0.7116 | 0.6760 |
68
+ | 0.7963 | 16.0 | 912 | 0.7200 | 0.6760 |
69
+ | 0.7963 | 17.0 | 969 | 0.7176 | 0.6760 |
70
+ | 0.615 | 18.0 | 1026 | 0.7133 | 0.6798 |
71
+ | 0.615 | 19.0 | 1083 | 0.7121 | 0.6798 |
72
+ | 0.615 | 20.0 | 1140 | 0.7117 | 0.6873 |
73
+ | 0.615 | 21.0 | 1197 | 0.7028 | 0.6816 |
74
+ | 0.615 | 22.0 | 1254 | 0.7033 | 0.6854 |
75
+ | 0.615 | 23.0 | 1311 | 0.7054 | 0.6798 |
76
+ | 0.615 | 24.0 | 1368 | 0.7059 | 0.6854 |
77
+ | 0.615 | 25.0 | 1425 | 0.6996 | 0.6835 |
78
+ | 0.615 | 26.0 | 1482 | 0.7045 | 0.6835 |
79
+ | 0.5826 | 27.0 | 1539 | 0.7032 | 0.6854 |
80
+ | 0.5826 | 28.0 | 1596 | 0.7030 | 0.6854 |
81
+ | 0.5826 | 29.0 | 1653 | 0.7046 | 0.6854 |
82
+ | 0.5826 | 30.0 | 1710 | 0.7040 | 0.6854 |
83
+
84
+
85
+ ### Framework versions
86
+
87
+ - PEFT 0.10.0
88
+ - Transformers 4.38.2
89
+ - Pytorch 2.2.1+cu121
90
+ - Datasets 2.18.0
91
+ - Tokenizers 0.15.2
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4e4ea888147226507834dd28cb0c7a243f1420a1d851033767451d826b5879ff
3
  size 2968756
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:296aadd02e5eb9275a61a8acaf2c5b2c53b1c9a67bec52a9aa0a54dee8340e3a
3
  size 2968756
runs/Mar28_15-22-33_a7e03f7291c9/events.out.tfevents.1711639354.a7e03f7291c9.182.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ae30a3c78ee03a3d2cd4778c1dca4b8dfdf0a408a9bfb9e59057e48aa689432
3
- size 15082
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:659a76146dbf3dd2063736ec3fd1d3c0ccc89642276b291b9f66ecd67f319e85
3
+ size 15759