tangminhanh
commited on
Commit
•
cb64a11
1
Parent(s):
d6ff5c7
pd_subcate
Browse files- README.md +20 -18
- model.safetensors +1 -1
- runs/Jul30_08-53-43_06a218ada6c5/events.out.tfevents.1722329627.06a218ada6c5.1464.19 +3 -0
- runs/Jul30_08-54-08_06a218ada6c5/events.out.tfevents.1722329653.06a218ada6c5.1464.20 +3 -0
- runs/Jul30_08-54-29_06a218ada6c5/events.out.tfevents.1722329675.06a218ada6c5.1464.21 +3 -0
- runs/Jul30_08-54-52_06a218ada6c5/events.out.tfevents.1722329699.06a218ada6c5.1464.22 +3 -0
- runs/Jul30_08-54-52_06a218ada6c5/events.out.tfevents.1722330252.06a218ada6c5.1464.23 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
20 |
|
21 |
This model is a fine-tuned version of [tangminhanh/pd_cate](https://huggingface.co/tangminhanh/pd_cate) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
-
- Loss: 0.
|
24 |
-
- Accuracy: 0.
|
25 |
-
- F1: 0.
|
26 |
-
- Precision: 0.
|
27 |
-
- Recall: 0.
|
28 |
|
29 |
## Model description
|
30 |
|
@@ -43,27 +43,29 @@ More information needed
|
|
43 |
### Training hyperparameters
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
-
- learning_rate:
|
47 |
-
- train_batch_size:
|
48 |
-
- eval_batch_size:
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
-
-
|
|
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
|
57 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
|
58 |
-
| No log | 1.0 |
|
59 |
-
| No log | 2.0 |
|
60 |
-
| No log | 3.0 |
|
61 |
-
| No log | 4.0 |
|
62 |
-
| No log | 5.0 |
|
63 |
-
|
|
64 |
-
|
|
65 |
-
|
|
66 |
-
|
|
|
|
67 |
|
68 |
|
69 |
### Framework versions
|
|
|
20 |
|
21 |
This model is a fine-tuned version of [tangminhanh/pd_cate](https://huggingface.co/tangminhanh/pd_cate) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
+
- Loss: 0.1565
|
24 |
+
- Accuracy: 0.4012
|
25 |
+
- F1: 0.4557
|
26 |
+
- Precision: 0.4964
|
27 |
+
- Recall: 0.4211
|
28 |
|
29 |
## Model description
|
30 |
|
|
|
43 |
### Training hyperparameters
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
+
- learning_rate: 5e-05
|
47 |
+
- train_batch_size: 32
|
48 |
+
- eval_batch_size: 64
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
+
- lr_scheduler_warmup_steps: 500
|
53 |
+
- num_epochs: 10
|
54 |
|
55 |
### Training results
|
56 |
|
57 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
|
58 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
|
59 |
+
| No log | 1.0 | 84 | 0.1407 | 0.1485 | 0.2350 | 0.5051 | 0.1531 |
|
60 |
+
| No log | 2.0 | 168 | 0.1388 | 0.1914 | 0.2952 | 0.5255 | 0.2052 |
|
61 |
+
| No log | 3.0 | 252 | 0.1415 | 0.2175 | 0.3119 | 0.4389 | 0.2420 |
|
62 |
+
| No log | 4.0 | 336 | 0.1420 | 0.3032 | 0.3794 | 0.4626 | 0.3216 |
|
63 |
+
| No log | 5.0 | 420 | 0.1484 | 0.2894 | 0.3620 | 0.4490 | 0.3032 |
|
64 |
+
| 0.1113 | 6.0 | 504 | 0.1462 | 0.3124 | 0.3852 | 0.4525 | 0.3354 |
|
65 |
+
| 0.1113 | 7.0 | 588 | 0.1460 | 0.3614 | 0.4441 | 0.4972 | 0.4012 |
|
66 |
+
| 0.1113 | 8.0 | 672 | 0.1534 | 0.3629 | 0.4269 | 0.4730 | 0.3890 |
|
67 |
+
| 0.1113 | 9.0 | 756 | 0.1533 | 0.3966 | 0.4551 | 0.4929 | 0.4227 |
|
68 |
+
| 0.1113 | 10.0 | 840 | 0.1565 | 0.4012 | 0.4557 | 0.4964 | 0.4211 |
|
69 |
|
70 |
|
71 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 567653928
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3cc008d32a101b1e784cf0ba904fbe89b141ce9e28c41018122e91f1cf0582ff
|
3 |
size 567653928
|
runs/Jul30_08-53-43_06a218ada6c5/events.out.tfevents.1722329627.06a218ada6c5.1464.19
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:76f97c9d54ed5a3e09e9a3be98a7986bc3a432762c816c86580877ce52d75052
|
3 |
+
size 7063
|
runs/Jul30_08-54-08_06a218ada6c5/events.out.tfevents.1722329653.06a218ada6c5.1464.20
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:783e59ec074ee0146fa9f84b22a4d6c569313408f22086ffa32cad534afca21a
|
3 |
+
size 7064
|
runs/Jul30_08-54-29_06a218ada6c5/events.out.tfevents.1722329675.06a218ada6c5.1464.21
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:03618ffd7d23b9e13bc514c1bf605831fff926eb40201480a29dc8cdef0d7008
|
3 |
+
size 7063
|
runs/Jul30_08-54-52_06a218ada6c5/events.out.tfevents.1722329699.06a218ada6c5.1464.22
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:46cf2ab1cc0e82aaed741f98374dfd1c0a755e359c87f1668b03b49bf16b31d3
|
3 |
+
size 12341
|
runs/Jul30_08-54-52_06a218ada6c5/events.out.tfevents.1722330252.06a218ada6c5.1464.23
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6e2bb231992a805066a875964f6f196b09cbe1da57dd473261c0477a8f821a1d
|
3 |
+
size 560
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5112
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:df678d2ec4f8a2061dd974bd9f4ba34fc820c9a11aa7c60d8f56e2bbe7d1cd43
|
3 |
size 5112
|