gokulsrinivasagan commited on
Commit
1299a0d
1 Parent(s): c951b90

Model save

Browse files
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ base_model: gokulsrinivasagan/bert_tiny_lda_20_v1_book
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ model-index:
9
+ - name: bert_tiny_lda_20_v1_book_qnli
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # bert_tiny_lda_20_v1_book_qnli
17
+
18
+ This model is a fine-tuned version of [gokulsrinivasagan/bert_tiny_lda_20_v1_book](https://huggingface.co/gokulsrinivasagan/bert_tiny_lda_20_v1_book) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.6667
21
+ - Accuracy: 0.7957
22
+
23
+ ## Model description
24
+
25
+ More information needed
26
+
27
+ ## Intended uses & limitations
28
+
29
+ More information needed
30
+
31
+ ## Training and evaluation data
32
+
33
+ More information needed
34
+
35
+ ## Training procedure
36
+
37
+ ### Training hyperparameters
38
+
39
+ The following hyperparameters were used during training:
40
+ - learning_rate: 5e-05
41
+ - train_batch_size: 256
42
+ - eval_batch_size: 256
43
+ - seed: 10
44
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
+ - lr_scheduler_type: linear
46
+ - num_epochs: 50
47
+
48
+ ### Training results
49
+
50
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
+ | 0.5552 | 1.0 | 410 | 0.5006 | 0.7705 |
53
+ | 0.4442 | 2.0 | 820 | 0.4149 | 0.8155 |
54
+ | 0.378 | 3.0 | 1230 | 0.4112 | 0.8170 |
55
+ | 0.3178 | 4.0 | 1640 | 0.4808 | 0.8021 |
56
+ | 0.2591 | 5.0 | 2050 | 0.4954 | 0.7992 |
57
+ | 0.2115 | 6.0 | 2460 | 0.5165 | 0.8072 |
58
+ | 0.1671 | 7.0 | 2870 | 0.6289 | 0.7983 |
59
+ | 0.1377 | 8.0 | 3280 | 0.6667 | 0.7957 |
60
+
61
+
62
+ ### Framework versions
63
+
64
+ - Transformers 4.46.3
65
+ - Pytorch 2.2.1+cu118
66
+ - Datasets 2.17.0
67
+ - Tokenizers 0.20.3
logs/events.out.tfevents.1733840627.ki-g0008.683966.22 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:88821269180350630635be32508f3b13bb22a9a242a57cf5752d403d2fafe12c
3
- size 8848
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bbc7e699823597ec5cf0e7f1b7ec75cdc3052363f44a9add9f9a16de64fa664e
3
+ size 9736
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:77abd07d1f7eccc49ac0d29ecde3ea98616dd4a9504edd0c1ec14eb6fd2030bd
3
  size 131856744
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf89b1362da7f71de3a43573f8919e47778721c1aaf2db5cc23595d556cc125f
3
  size 131856744