BAT007 commited on
Commit
ace2bbc
1 Parent(s): e260e0b

End of training

Browse files
README.md ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: distilbert-base-uncased
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - squad
8
+ model-index:
9
+ - name: distilbert-base-uncased-finetuned-squad
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # distilbert-base-uncased-finetuned-squad
17
+
18
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the squad dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.1985
21
+
22
+ ## Model description
23
+
24
+ More information needed
25
+
26
+ ## Intended uses & limitations
27
+
28
+ More information needed
29
+
30
+ ## Training and evaluation data
31
+
32
+ More information needed
33
+
34
+ ## Training procedure
35
+
36
+ ### Training hyperparameters
37
+
38
+ The following hyperparameters were used during training:
39
+ - learning_rate: 2e-05
40
+ - train_batch_size: 64
41
+ - eval_batch_size: 64
42
+ - seed: 42
43
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
+ - lr_scheduler_type: linear
45
+ - num_epochs: 3
46
+ - mixed_precision_training: Native AMP
47
+
48
+ ### Training results
49
+
50
+ | Training Loss | Epoch | Step | Validation Loss |
51
+ |:-------------:|:-----:|:----:|:---------------:|
52
+ | 0.9174 | 1.0 | 1384 | 1.1447 |
53
+ | 0.7865 | 2.0 | 2768 | 1.1475 |
54
+ | 0.6567 | 3.0 | 4152 | 1.1985 |
55
+
56
+
57
+ ### Framework versions
58
+
59
+ - Transformers 4.35.2
60
+ - Pytorch 2.1.0+cu118
61
+ - Datasets 2.15.0
62
+ - Tokenizers 0.15.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d02d3a4302248040a56beec4957afa7928cfbaf5ad24318823d411bb2b81166c
3
  size 265470032
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:288b33bffc7baec369e301ef10ec428ba4cc94e01fe88ef16a9881cf36796f08
3
  size 265470032
runs/Dec03_16-48-51_d345135d6ae0/events.out.tfevents.1701622156.d345135d6ae0.5111.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dd635548527d236c0ee454fbf0336f55cad875777abcae803b6e9c86155c0b33
3
- size 5887
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c5575ff4c7d0cc85800b51d285927d0dfe0695cb8021b479aae6d87ea726bb65
3
+ size 6158
runs/Dec03_17-56-16_d345135d6ae0/events.out.tfevents.1701626187.d345135d6ae0.5111.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f1a7cc49276d4832c5a862394aaaa90cd9d84a7b07dcdfbeb4ad07e9719a751
3
+ size 6620
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:935df6539d63bda92fbbe91065cec0c5b955f93f1551a2acca1f9cf87159ee86
3
  size 4600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5612864f1044442711c70b971dc3dfc8e1a9e8df4c85dd5304e3ab1a5812e07e
3
  size 4600