lizhanyang/distilbert-base-uncased-lora-text-classification

Browse files

Files changed (3) hide show

README.md +15 -15
runs/Sep20_13-28-15_a-Super-Server/events.out.tfevents.1726810097.a-Super-Server.224208.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -14,13 +14,13 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/lizhanyang718/huggingface/runs/kxmj82z0)
 # distilbert-base-uncased-lora-text-classification
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3091
-- Accuracy: {'accuracy': 0.873}
 ## Model description
@@ -40,8 +40,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.001
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy            |
 |:-------------:|:-----:|:----:|:---------------:|:-------------------:|
-| No log        | 1.0   | 250  | 0.5954          | {'accuracy': 0.859} |
-| 0.4135        | 2.0   | 500  | 0.6391          | {'accuracy': 0.868} |
-| 0.4135        | 3.0   | 750  | 0.9071          | {'accuracy': 0.865} |
-| 0.2914        | 4.0   | 1000 | 1.0446          | {'accuracy': 0.846} |
-| 0.2914        | 5.0   | 1250 | 1.1057          | {'accuracy': 0.86}  |
-| 0.1852        | 6.0   | 1500 | 1.0235          | {'accuracy': 0.872} |
-| 0.1852        | 7.0   | 1750 | 1.1211          | {'accuracy': 0.874} |
-| 0.0728        | 8.0   | 2000 | 1.2524          | {'accuracy': 0.873} |
-| 0.0728        | 9.0   | 2250 | 1.2872          | {'accuracy': 0.874} |
-| 0.0241        | 10.0  | 2500 | 1.3091          | {'accuracy': 0.873} |
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/lizhanyang718/huggingface/runs/d9kvxf5d)
 # distilbert-base-uncased-lora-text-classification
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8192
+- Accuracy: {'accuracy': 0.878}
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.001
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Accuracy            |
 |:-------------:|:-----:|:----:|:---------------:|:-------------------:|
+| No log        | 1.0   | 125  | 0.3417          | {'accuracy': 0.867} |
+| No log        | 2.0   | 250  | 0.2960          | {'accuracy': 0.878} |
+| No log        | 3.0   | 375  | 0.4010          | {'accuracy': 0.875} |
+| 0.2767        | 4.0   | 500  | 0.5766          | {'accuracy': 0.874} |
+| 0.2767        | 5.0   | 625  | 0.6314          | {'accuracy': 0.878} |
+| 0.2767        | 6.0   | 750  | 0.6541          | {'accuracy': 0.883} |
+| 0.2767        | 7.0   | 875  | 0.7353          | {'accuracy': 0.887} |
+| 0.0442        | 8.0   | 1000 | 0.7776          | {'accuracy': 0.883} |
+| 0.0442        | 9.0   | 1125 | 0.8157          | {'accuracy': 0.874} |
+| 0.0442        | 10.0  | 1250 | 0.8192          | {'accuracy': 0.878} |
 ### Framework versions

runs/Sep20_13-28-15_a-Super-Server/events.out.tfevents.1726810097.a-Super-Server.224208.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:07fb3b493dbae92bcf417ca51c79105ee5c3127ed5ec0720da53accda3a46e76
+size 8499

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3a4e264c1cb83636651d8a5b23ad48f654ae5a4ae09d929ca7dbe0fa1ba97b5
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:b89bff1a2013457c3449095d185a01dbc0394d05c86d8f3c1f6124406da4badb
 size 5240