Realgon commited on
Commit
e341798
1 Parent(s): 3c729fb

End of training

Browse files
README.md CHANGED
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the imdb dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 0.7142
36
  - Accuracy: 0.9309
 
37
 
38
  ## Model description
39
 
@@ -66,14 +66,14 @@ The following hyperparameters were used during training:
66
  |:-------------:|:-----:|:-----:|:--------:|:---------------:|
67
  | 0.0473 | 1.0 | 1563 | 0.9279 | 0.4618 |
68
  | 0.0096 | 2.0 | 3126 | 0.929 | 0.5406 |
69
- | 0.0328 | 3.0 | 4689 | 0.5954 | 0.92 |
70
- | 0.0192 | 4.0 | 6252 | 0.5570 | 0.9288 |
71
- | 0.0171 | 5.0 | 7815 | 0.5905 | 0.9294 |
72
- | 0.006 | 6.0 | 9378 | 0.6330 | 0.9301 |
73
- | 0.0084 | 7.0 | 10941 | 0.6311 | 0.9270 |
74
- | 0.0003 | 8.0 | 12504 | 0.6783 | 0.9288 |
75
- | 0.0048 | 9.0 | 14067 | 0.6987 | 0.9315 |
76
- | 0.0001 | 10.0 | 15630 | 0.7142 | 0.9309 |
77
 
78
 
79
  ### Framework versions
 
32
 
33
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the imdb dataset.
34
  It achieves the following results on the evaluation set:
 
35
  - Accuracy: 0.9309
36
+ - Loss: 0.7142
37
 
38
  ## Model description
39
 
 
66
  |:-------------:|:-----:|:-----:|:--------:|:---------------:|
67
  | 0.0473 | 1.0 | 1563 | 0.9279 | 0.4618 |
68
  | 0.0096 | 2.0 | 3126 | 0.929 | 0.5406 |
69
+ | 0.0328 | 3.0 | 4689 | 0.92 | 0.5954 |
70
+ | 0.0192 | 4.0 | 6252 | 0.9288 | 0.5570 |
71
+ | 0.0171 | 5.0 | 7815 | 0.9294 | 0.5905 |
72
+ | 0.006 | 6.0 | 9378 | 0.9301 | 0.6330 |
73
+ | 0.0084 | 7.0 | 10941 | 0.9270 | 0.6311 |
74
+ | 0.0003 | 8.0 | 12504 | 0.9288 | 0.6783 |
75
+ | 0.0048 | 9.0 | 14067 | 0.9315 | 0.6987 |
76
+ | 0.0001 | 10.0 | 15630 | 0.9309 | 0.7142 |
77
 
78
 
79
  ### Framework versions
config.json CHANGED
@@ -22,7 +22,6 @@
22
  "n_heads": 12,
23
  "n_layers": 6,
24
  "pad_token_id": 0,
25
- "problem_type": "single_label_classification",
26
  "qa_dropout": 0.1,
27
  "seq_classif_dropout": 0.2,
28
  "sinusoidal_pos_embds": false,
 
22
  "n_heads": 12,
23
  "n_layers": 6,
24
  "pad_token_id": 0,
 
25
  "qa_dropout": 0.1,
26
  "seq_classif_dropout": 0.2,
27
  "sinusoidal_pos_embds": false,
runs/Nov20_08-20-01_HAN/events.out.tfevents.1700439627.HAN.8728.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01b8162e73e87bf0d4bde4fe1d164f3fc2e87a5d7658cc5dd778da135c3b69d2
3
+ size 4609
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f93cd87bd069a8b51ed3ded6710459133d101c1912aa22b10f816dde10004e1d
3
  size 4091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4da1ac085a94a662a1d0a44f44c5127c054bc7f7007ea0bf275a885a390d292a
3
  size 4091