tangminhanh commited on
Commit
cb64a11
1 Parent(s): d6ff5c7

pd_subcate

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [tangminhanh/pd_cate](https://huggingface.co/tangminhanh/pd_cate) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.1442
24
- - Accuracy: 0.1378
25
- - F1: 0.2169
26
- - Precision: 0.4892
27
- - Recall: 0.1394
28
 
29
  ## Model description
30
 
@@ -43,27 +43,29 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 2e-05
47
- - train_batch_size: 64
48
- - eval_batch_size: 128
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 9
 
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
- | No log | 1.0 | 42 | 0.2339 | 0.0 | 0.0 | 0.0 | 0.0 |
59
- | No log | 2.0 | 84 | 0.1855 | 0.0 | 0.0 | 0.0 | 0.0 |
60
- | No log | 3.0 | 126 | 0.1683 | 0.0 | 0.0 | 0.0 | 0.0 |
61
- | No log | 4.0 | 168 | 0.1580 | 0.0 | 0.0 | 0.0 | 0.0 |
62
- | No log | 5.0 | 210 | 0.1522 | 0.0291 | 0.0536 | 0.3393 | 0.0291 |
63
- | No log | 6.0 | 252 | 0.1485 | 0.1026 | 0.1716 | 0.5234 | 0.1026 |
64
- | No log | 7.0 | 294 | 0.1460 | 0.1149 | 0.1859 | 0.4870 | 0.1149 |
65
- | No log | 8.0 | 336 | 0.1447 | 0.1286 | 0.2056 | 0.4885 | 0.1302 |
66
- | No log | 9.0 | 378 | 0.1442 | 0.1378 | 0.2169 | 0.4892 | 0.1394 |
 
67
 
68
 
69
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [tangminhanh/pd_cate](https://huggingface.co/tangminhanh/pd_cate) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.1565
24
+ - Accuracy: 0.4012
25
+ - F1: 0.4557
26
+ - Precision: 0.4964
27
+ - Recall: 0.4211
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 5e-05
47
+ - train_batch_size: 32
48
+ - eval_batch_size: 64
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - lr_scheduler_warmup_steps: 500
53
+ - num_epochs: 10
54
 
55
  ### Training results
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
58
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
59
+ | No log | 1.0 | 84 | 0.1407 | 0.1485 | 0.2350 | 0.5051 | 0.1531 |
60
+ | No log | 2.0 | 168 | 0.1388 | 0.1914 | 0.2952 | 0.5255 | 0.2052 |
61
+ | No log | 3.0 | 252 | 0.1415 | 0.2175 | 0.3119 | 0.4389 | 0.2420 |
62
+ | No log | 4.0 | 336 | 0.1420 | 0.3032 | 0.3794 | 0.4626 | 0.3216 |
63
+ | No log | 5.0 | 420 | 0.1484 | 0.2894 | 0.3620 | 0.4490 | 0.3032 |
64
+ | 0.1113 | 6.0 | 504 | 0.1462 | 0.3124 | 0.3852 | 0.4525 | 0.3354 |
65
+ | 0.1113 | 7.0 | 588 | 0.1460 | 0.3614 | 0.4441 | 0.4972 | 0.4012 |
66
+ | 0.1113 | 8.0 | 672 | 0.1534 | 0.3629 | 0.4269 | 0.4730 | 0.3890 |
67
+ | 0.1113 | 9.0 | 756 | 0.1533 | 0.3966 | 0.4551 | 0.4929 | 0.4227 |
68
+ | 0.1113 | 10.0 | 840 | 0.1565 | 0.4012 | 0.4557 | 0.4964 | 0.4211 |
69
 
70
 
71
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c34c047c78357b5a828f60cbd74b54e7c34fc2d3debbb394c7a58ecb799f41f2
3
  size 567653928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cc008d32a101b1e784cf0ba904fbe89b141ce9e28c41018122e91f1cf0582ff
3
  size 567653928
runs/Jul30_08-53-43_06a218ada6c5/events.out.tfevents.1722329627.06a218ada6c5.1464.19 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76f97c9d54ed5a3e09e9a3be98a7986bc3a432762c816c86580877ce52d75052
3
+ size 7063
runs/Jul30_08-54-08_06a218ada6c5/events.out.tfevents.1722329653.06a218ada6c5.1464.20 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:783e59ec074ee0146fa9f84b22a4d6c569313408f22086ffa32cad534afca21a
3
+ size 7064
runs/Jul30_08-54-29_06a218ada6c5/events.out.tfevents.1722329675.06a218ada6c5.1464.21 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:03618ffd7d23b9e13bc514c1bf605831fff926eb40201480a29dc8cdef0d7008
3
+ size 7063
runs/Jul30_08-54-52_06a218ada6c5/events.out.tfevents.1722329699.06a218ada6c5.1464.22 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46cf2ab1cc0e82aaed741f98374dfd1c0a755e359c87f1668b03b49bf16b31d3
3
+ size 12341
runs/Jul30_08-54-52_06a218ada6c5/events.out.tfevents.1722330252.06a218ada6c5.1464.23 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6e2bb231992a805066a875964f6f196b09cbe1da57dd473261c0477a8f821a1d
3
+ size 560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0513152b5405bad2a18c77def092b876cecbfe25ce918eaa1966dd1ecb73ee56
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df678d2ec4f8a2061dd974bd9f4ba34fc820c9a11aa7c60d8f56e2bbe7d1cd43
3
  size 5112