Akari commited on
Commit
89dfe29
1 Parent(s): 2ba7f9f

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -4
README.md CHANGED
@@ -1,4 +1,5 @@
1
  ---
 
2
  tags:
3
  - generated_from_trainer
4
  datasets:
@@ -13,7 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # albert-base-v2-finetuned-squad
15
 
16
- This model was trained from scratch on the squad_v2 dataset.
 
 
17
 
18
  ## Model description
19
 
@@ -33,16 +36,25 @@ More information needed
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 2e-05
36
- - train_batch_size: 8
37
- - eval_batch_size: 8
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
  - num_epochs: 3
42
 
 
 
 
 
 
 
 
 
 
43
  ### Framework versions
44
 
45
  - Transformers 4.12.3
46
- - Pytorch 1.5.1+cu101
47
  - Datasets 1.15.1
48
  - Tokenizers 0.10.3
 
1
  ---
2
+ license: apache-2.0
3
  tags:
4
  - generated_from_trainer
5
  datasets:
 
14
 
15
  # albert-base-v2-finetuned-squad
16
 
17
+ This model is a fine-tuned version of [albert-base-v2](https://huggingface.co/albert-base-v2) on the squad_v2 dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 0.9492
20
 
21
  ## Model description
22
 
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
+ - train_batch_size: 16
40
+ - eval_batch_size: 16
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
  - num_epochs: 3
45
 
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:-----:|:---------------:|
50
+ | 0.8695 | 1.0 | 8248 | 0.8813 |
51
+ | 0.6333 | 2.0 | 16496 | 0.8042 |
52
+ | 0.4372 | 3.0 | 24744 | 0.9492 |
53
+
54
+
55
  ### Framework versions
56
 
57
  - Transformers 4.12.3
58
+ - Pytorch 1.7.1
59
  - Datasets 1.15.1
60
  - Tokenizers 0.10.3