Electro98
/

my_awesome_model

@@ -14,11 +14,11 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.0126
-- Validation Loss: 3.6501
-- Train F1: 0.4317
-- Train Accuracy: 0.5275
-- Epoch: 25
 ## Model description
@@ -37,39 +37,16 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 81390, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Train F1 | Train Accuracy | Epoch |
 |:----------:|:---------------:|:--------:|:--------------:|:-----:|
-| 1.7183     | 1.4170          | 0.4193   | 0.5821         | 0     |
-| 1.2940     | 1.3890          | 0.4335   | 0.5747         | 1     |
-| 1.0652     | 1.4416          | 0.4512   | 0.5718         | 2     |
-| 0.8341     | 1.5888          | 0.4364   | 0.5533         | 3     |
-| 0.6159     | 1.7822          | 0.4280   | 0.5392         | 4     |
-| 0.4387     | 1.9859          | 0.4357   | 0.5301         | 5     |
-| 0.3102     | 2.2088          | 0.4346   | 0.5257         | 6     |
-| 0.2183     | 2.3689          | 0.4353   | 0.5386         | 7     |
-| 0.1620     | 2.5631          | 0.4396   | 0.5379         | 8     |
-| 0.1254     | 2.6995          | 0.4314   | 0.5342         | 9     |
-| 0.0959     | 2.8182          | 0.4285   | 0.5333         | 10    |
-| 0.0788     | 2.8996          | 0.4204   | 0.5334         | 11    |
-| 0.0677     | 3.0209          | 0.4347   | 0.5318         | 12    |
-| 0.0562     | 3.1115          | 0.4282   | 0.5222         | 13    |
-| 0.0493     | 3.1710          | 0.4306   | 0.5268         | 14    |
-| 0.0435     | 3.1507          | 0.4280   | 0.5322         | 15    |
-| 0.0391     | 3.3222          | 0.4165   | 0.5110         | 16    |
-| 0.0321     | 3.3243          | 0.4218   | 0.5309         | 17    |
-| 0.0298     | 3.3675          | 0.4252   | 0.5307         | 18    |
-| 0.0255     | 3.4341          | 0.4148   | 0.5217         | 19    |
-| 0.0230     | 3.4253          | 0.4311   | 0.5250         | 20    |
-| 0.0195     | 3.5133          | 0.4278   | 0.5233         | 21    |
-| 0.0166     | 3.5915          | 0.4277   | 0.5301         | 22    |
-| 0.0165     | 3.5547          | 0.4191   | 0.5340         | 23    |
-| 0.0142     | 3.6109          | 0.4333   | 0.5362         | 24    |
-| 0.0126     | 3.6501          | 0.4317   | 0.5275         | 25    |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.1805
+- Validation Loss: 0.1696
+- Train F1: 0.1393
+- Train Accuracy: 0.1994
+- Epoch: 2
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 135650, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Train F1 | Train Accuracy | Epoch |
 |:----------:|:---------------:|:--------:|:--------------:|:-----:|
+| 0.2043     | 0.1756          | 0.0314   | 0.0396         | 0     |
+| 0.2031     | 0.1816          | 0.1156   | 0.2583         | 1     |
+| 0.1805     | 0.1696          | 0.1393   | 0.1994         | 2     |
 ### Framework versions

config.json CHANGED Viewed

@@ -74,6 +74,7 @@
   "n_heads": 12,
   "n_layers": 6,
   "pad_token_id": 0,
   "qa_dropout": 0.1,
   "seq_classif_dropout": 0.2,
   "sinusoidal_pos_embds": false,

   "n_heads": 12,
   "n_layers": 6,
   "pad_token_id": 0,
+  "problem_type": "multi_label_classification",
   "qa_dropout": 0.1,
   "seq_classif_dropout": 0.2,
   "sinusoidal_pos_embds": false,

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:31d3d358a2efc883e5b03b8e23e9a9dc9385652d85f801db235697f9b397d722
 size 268031680

 version https://git-lfs.github.com/spec/v1
+oid sha256:dd472e2aea5edcf1e1ffadb87858b13d76ec72cb6d0e3783ebfca4190cc0d653
 size 268031680