makhataei commited on
Commit
4b195f0
1 Parent(s): 5bbf6a9

End of training

Browse files
README.md CHANGED
@@ -1,9 +1,7 @@
1
  ---
2
- base_model: m3hrdadfi/xlmr-large-qa-fa
3
  tags:
4
  - generated_from_trainer
5
- datasets:
6
- - pquad
7
  model-index:
8
  - name: qa-persian-xlmr-large
9
  results: []
@@ -14,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # qa-persian-xlmr-large
16
 
17
- This model is a fine-tuned version of [m3hrdadfi/xlmr-large-qa-fa](https://huggingface.co/m3hrdadfi/xlmr-large-qa-fa) on the pquad dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.4953
20
 
21
  ## Model description
22
 
@@ -35,24 +33,33 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - learning_rate: 1e-05
39
- - train_batch_size: 1
40
- - eval_batch_size: 1
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 0.01
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 1.6966 | 0.01 | 640 | 1.4953 |
 
 
 
 
 
 
 
 
 
51
 
52
 
53
  ### Framework versions
54
 
55
- - Transformers 4.35.2
56
- - Pytorch 2.1.0+cu118
57
- - Datasets 2.15.0
58
- - Tokenizers 0.15.0
 
1
  ---
2
+ base_model: makhataei/qa-persian-xlmr-large
3
  tags:
4
  - generated_from_trainer
 
 
5
  model-index:
6
  - name: qa-persian-xlmr-large
7
  results: []
 
12
 
13
  # qa-persian-xlmr-large
14
 
15
+ This model is a fine-tuned version of [makhataei/qa-persian-xlmr-large](https://huggingface.co/makhataei/qa-persian-xlmr-large) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 5.0366
18
 
19
  ## Model description
20
 
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 5e-06
37
+ - train_batch_size: 14
38
+ - eval_batch_size: 14
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
+ - num_epochs: 10
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | 2.0175 | 1.0 | 202 | 1.5773 |
49
+ | 1.17 | 2.0 | 404 | 1.7608 |
50
+ | 0.6861 | 3.0 | 606 | 2.2780 |
51
+ | 0.4457 | 4.0 | 808 | 2.8859 |
52
+ | 0.2626 | 5.0 | 1010 | 3.9207 |
53
+ | 0.1862 | 6.0 | 1212 | 4.6119 |
54
+ | 0.1264 | 7.0 | 1414 | 4.8694 |
55
+ | 0.0786 | 8.0 | 1616 | 4.8824 |
56
+ | 0.0566 | 9.0 | 1818 | 4.9686 |
57
+ | 0.0571 | 10.0 | 2020 | 5.0366 |
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.38.1
63
+ - Pytorch 2.1.0+cu121
64
+ - Datasets 2.18.0
65
+ - Tokenizers 0.15.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:71ee20493190f2dc7051dba696729ebabb25b10287cfde84027c433e9fcd1fe4
3
  size 2235420048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:412385474279f65c8a72b7d4419c6103555d730b0cae4cd24663a96def553418
3
  size 2235420048
runs/Mar03_05-19-09_c3bd18d56594/events.out.tfevents.1709443151.c3bd18d56594.3825.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2e761b28027c9d9ae908f7b1bf49659cac884b854af1c2dee583a49461317479
3
- size 18989
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01da4f30ac1568d532cb2413f7f439a7bd8fb6d53f9e5270619c49d68433daf0
3
+ size 19614