lizhanyang commited on
Commit
8978c22
·
verified ·
1 Parent(s): dcc429e

lizhanyang/distilbert-base-uncased-lora-text-classification

Browse files
README.md CHANGED
@@ -14,13 +14,13 @@ model-index:
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/lizhanyang718/huggingface/runs/kxmj82z0)
18
  # distilbert-base-uncased-lora-text-classification
19
 
20
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 1.3091
23
- - Accuracy: {'accuracy': 0.873}
24
 
25
  ## Model description
26
 
@@ -40,8 +40,8 @@ More information needed
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 0.001
43
- - train_batch_size: 4
44
- - eval_batch_size: 4
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------------------:|
54
- | No log | 1.0 | 250 | 0.5954 | {'accuracy': 0.859} |
55
- | 0.4135 | 2.0 | 500 | 0.6391 | {'accuracy': 0.868} |
56
- | 0.4135 | 3.0 | 750 | 0.9071 | {'accuracy': 0.865} |
57
- | 0.2914 | 4.0 | 1000 | 1.0446 | {'accuracy': 0.846} |
58
- | 0.2914 | 5.0 | 1250 | 1.1057 | {'accuracy': 0.86} |
59
- | 0.1852 | 6.0 | 1500 | 1.0235 | {'accuracy': 0.872} |
60
- | 0.1852 | 7.0 | 1750 | 1.1211 | {'accuracy': 0.874} |
61
- | 0.0728 | 8.0 | 2000 | 1.2524 | {'accuracy': 0.873} |
62
- | 0.0728 | 9.0 | 2250 | 1.2872 | {'accuracy': 0.874} |
63
- | 0.0241 | 10.0 | 2500 | 1.3091 | {'accuracy': 0.873} |
64
 
65
 
66
  ### Framework versions
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/lizhanyang718/huggingface/runs/d9kvxf5d)
18
  # distilbert-base-uncased-lora-text-classification
19
 
20
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.8192
23
+ - Accuracy: {'accuracy': 0.878}
24
 
25
  ## Model description
26
 
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 0.001
43
+ - train_batch_size: 8
44
+ - eval_batch_size: 8
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------------------:|
54
+ | No log | 1.0 | 125 | 0.3417 | {'accuracy': 0.867} |
55
+ | No log | 2.0 | 250 | 0.2960 | {'accuracy': 0.878} |
56
+ | No log | 3.0 | 375 | 0.4010 | {'accuracy': 0.875} |
57
+ | 0.2767 | 4.0 | 500 | 0.5766 | {'accuracy': 0.874} |
58
+ | 0.2767 | 5.0 | 625 | 0.6314 | {'accuracy': 0.878} |
59
+ | 0.2767 | 6.0 | 750 | 0.6541 | {'accuracy': 0.883} |
60
+ | 0.2767 | 7.0 | 875 | 0.7353 | {'accuracy': 0.887} |
61
+ | 0.0442 | 8.0 | 1000 | 0.7776 | {'accuracy': 0.883} |
62
+ | 0.0442 | 9.0 | 1125 | 0.8157 | {'accuracy': 0.874} |
63
+ | 0.0442 | 10.0 | 1250 | 0.8192 | {'accuracy': 0.878} |
64
 
65
 
66
  ### Framework versions
runs/Sep20_13-28-15_a-Super-Server/events.out.tfevents.1726810097.a-Super-Server.224208.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07fb3b493dbae92bcf417ca51c79105ee5c3127ed5ec0720da53accda3a46e76
3
+ size 8499
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c3a4e264c1cb83636651d8a5b23ad48f654ae5a4ae09d929ca7dbe0fa1ba97b5
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b89bff1a2013457c3449095d185a01dbc0394d05c86d8f3c1f6124406da4badb
3
  size 5240