Saurabh54 commited on
Commit
6cfb5c0
·
1 Parent(s): ad8249e

Upload model

Browse files
Files changed (3) hide show
  1. README.md +3 -63
  2. adapter_config.json +3 -3
  3. adapter_model.bin +2 -2
README.md CHANGED
@@ -1,69 +1,9 @@
1
  ---
2
- license: apache-2.0
3
- base_model: distilbert-base-uncased
4
- tags:
5
- - generated_from_trainer
6
- metrics:
7
- - accuracy
8
- model-index:
9
- - name: distilbert-base-uncased-lora-text-classification
10
- results: []
11
  ---
12
-
13
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
- should probably proofread and complete it, then remove this comment. -->
15
-
16
- # distilbert-base-uncased-lora-text-classification
17
-
18
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
19
- It achieves the following results on the evaluation set:
20
- - Loss: 1.4902
21
- - Accuracy: {'accuracy': 0.5691906005221932}
22
-
23
- ## Model description
24
-
25
- More information needed
26
-
27
- ## Intended uses & limitations
28
-
29
- More information needed
30
-
31
- ## Training and evaluation data
32
-
33
- More information needed
34
-
35
  ## Training procedure
36
 
37
- ### Training hyperparameters
38
-
39
- The following hyperparameters were used during training:
40
- - learning_rate: 0.001
41
- - train_batch_size: 4
42
- - eval_batch_size: 4
43
- - seed: 42
44
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
- - lr_scheduler_type: linear
46
- - num_epochs: 10
47
-
48
- ### Training results
49
-
50
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
- |:-------------:|:-----:|:----:|:---------------:|:---------------------------------:|
52
- | 2.1699 | 1.0 | 767 | 1.9578 | {'accuracy': 0.34595300261096606} |
53
- | 1.7466 | 2.0 | 1534 | 1.6447 | {'accuracy': 0.43994778067885115} |
54
- | 1.5928 | 3.0 | 2301 | 1.6027 | {'accuracy': 0.48825065274151436} |
55
- | 1.46 | 4.0 | 3068 | 1.4771 | {'accuracy': 0.5039164490861618} |
56
- | 1.3395 | 5.0 | 3835 | 1.4685 | {'accuracy': 0.5234986945169713} |
57
- | 1.2109 | 6.0 | 4602 | 1.4587 | {'accuracy': 0.5509138381201044} |
58
- | 1.0814 | 7.0 | 5369 | 1.4672 | {'accuracy': 0.5535248041775457} |
59
- | 0.9708 | 8.0 | 6136 | 1.4799 | {'accuracy': 0.5626631853785901} |
60
- | 0.9325 | 9.0 | 6903 | 1.4789 | {'accuracy': 0.5652741514360313} |
61
- | 0.8056 | 10.0 | 7670 | 1.4902 | {'accuracy': 0.5691906005221932} |
62
-
63
-
64
  ### Framework versions
65
 
66
- - Transformers 4.33.0
67
- - Pytorch 2.0.0
68
- - Datasets 2.1.0
69
- - Tokenizers 0.13.3
 
1
  ---
2
+ library_name: peft
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  ## Training procedure
5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  ### Framework versions
7
 
8
+
9
+ - PEFT 0.5.0
 
 
adapter_config.json CHANGED
@@ -7,11 +7,11 @@
7
  "init_lora_weights": true,
8
  "layers_pattern": null,
9
  "layers_to_transform": null,
10
- "lora_alpha": 32,
11
- "lora_dropout": 0.01,
12
  "modules_to_save": null,
13
  "peft_type": "LORA",
14
- "r": 16,
15
  "revision": null,
16
  "target_modules": [
17
  "q_lin"
 
7
  "init_lora_weights": true,
8
  "layers_pattern": null,
9
  "layers_to_transform": null,
10
+ "lora_alpha": 16,
11
+ "lora_dropout": 0.05,
12
  "modules_to_save": null,
13
  "peft_type": "LORA",
14
+ "r": 64,
15
  "revision": null,
16
  "target_modules": [
17
  "q_lin"
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9553abd24186020990d8538c41bef8bcfa73331bd8ec03ba7604d515d798db7d
3
- size 3007265
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:820a0cea54d8c23c21d9c077f25ef670315607c2d0bd9b0e1c80b7da3210a971
3
+ size 4776737