helling100
/

Regression_bert_1

Text Classification

Transformers

TensorFlow

distilbert

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

helling100 commited on Mar 23, 2023

Commit

387208f

1 Parent(s): 40ea54f

Upload TFBertForSequenceClassification

Browse files

Files changed (3) hide show

README.md +32 -22
config.json +2 -20
tf_model.h5 +2 -2

README.md CHANGED Viewed

@@ -14,17 +14,17 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.5177
-- Train Mae: 0.3287
-- Train Mse: 0.1635
-- Train R2-score: -3.2462
-- Train Accuracy: 0.5354
-- Validation Loss: 0.1634
-- Validation Mae: 0.3519
-- Validation Mse: 0.1614
-- Validation R2-score: -0.4800
-- Validation Accuracy: 0.4459
-- Epoch: 9
 ## Model description
@@ -43,23 +43,33 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 0.0002, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Train Mae | Train Mse | Train R2-score | Train Accuracy | Validation Loss | Validation Mae | Validation Mse | Validation R2-score | Validation Accuracy | Epoch |
 |:----------:|:---------:|:---------:|:--------------:|:--------------:|:---------------:|:--------------:|:--------------:|:-------------------:|:-------------------:|:-----:|
-| 0.5954     | 0.3730    | 0.2146    | -3.4803        | 0.4931         | 0.2190          | 0.4056         | 0.2174         | -0.8845             | 0.3622              | 0     |
-| 0.8021     | 0.3763    | 0.2082    | -2.7115        | 0.4777         | 0.1827          | 0.3730         | 0.1810         | -0.5508             | 0.3838              | 1     |
-| 0.6147     | 0.3404    | 0.1752    | -2.1219        | 0.5223         | 0.2622          | 0.4567         | 0.2613         | -0.9969             | 0.2811              | 2     |
-| 0.6303     | 0.3447    | 0.1768    | -2.9520        | 0.5154         | 0.2331          | 0.4248         | 0.2321         | -0.7839             | 0.2811              | 3     |
-| 0.3887     | 0.3369    | 0.1734    | -2.7189        | 0.5262         | 0.2114          | 0.4056         | 0.2101         | -0.6812             | 0.2865              | 4     |
-| 0.3735     | 0.3176    | 0.1515    | -5.1147        | 0.5292         | 0.1646          | 0.3546         | 0.1630         | -0.3295             | 0.2703              | 5     |
-| 0.4549     | 0.3358    | 0.1716    | -1.5835        | 0.5323         | 0.1803          | 0.3670         | 0.1786         | -0.5113             | 0.3108              | 6     |
-| 0.5800     | 0.3221    | 0.1587    | -2.4273        | 0.5385         | 0.2369          | 0.4334         | 0.2358         | -0.8347             | 0.2919              | 7     |
-| 0.4042     | 0.3339    | 0.1714    | -3.7265        | 0.5300         | 0.1818          | 0.3766         | 0.1804         | -0.4590             | 0.2703              | 8     |
-| 0.5177     | 0.3287    | 0.1635    | -3.2462        | 0.5354         | 0.1634          | 0.3519         | 0.1614         | -0.4800             | 0.4459              | 9     |
 ### Framework versions

 This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.2489
+- Train Mae: 0.3355
+- Train Mse: 0.1670
+- Train R2-score: 0.5318
+- Train Accuracy: 0.5
+- Validation Loss: 0.2163
+- Validation Mae: 0.4087
+- Validation Mse: 0.2153
+- Validation R2-score: 0.8371
+- Validation Accuracy: 0.2703
+- Epoch: 19
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 1e-06, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Train Mae | Train Mse | Train R2-score | Train Accuracy | Validation Loss | Validation Mae | Validation Mse | Validation R2-score | Validation Accuracy | Epoch |
 |:----------:|:---------:|:---------:|:--------------:|:--------------:|:---------------:|:--------------:|:--------------:|:-------------------:|:-------------------:|:-----:|
+| 0.2907     | 0.3158    | 0.1575    | 0.6537         | 0.5692         | 0.2087          | 0.3991         | 0.2076         | 0.8421              | 0.2703              | 0     |
+| 0.5109     | 0.3124    | 0.1437    | 0.7067         | 0.5538         | 0.2134          | 0.4051         | 0.2124         | 0.8390              | 0.2703              | 1     |
+| 0.3757     | 0.3131    | 0.1489    | 0.7306         | 0.5538         | 0.2138          | 0.4055         | 0.2127         | 0.8388              | 0.2703              | 2     |
+| 0.5703     | 0.3369    | 0.1733    | 0.6746         | 0.5385         | 0.2097          | 0.4004         | 0.2086         | 0.8414              | 0.2703              | 3     |
+| 0.3149     | 0.3314    | 0.1616    | 0.6958         | 0.5154         | 0.2090          | 0.3995         | 0.2079         | 0.8419              | 0.2703              | 4     |
+| 0.3633     | 0.3331    | 0.1653    | 0.6961         | 0.5154         | 0.2083          | 0.3986         | 0.2072         | 0.8423              | 0.2703              | 5     |
+| 0.2274     | 0.3384    | 0.1795    | 0.6844         | 0.5231         | 0.2075          | 0.3975         | 0.2064         | 0.8429              | 0.2703              | 6     |
+| 0.2552     | 0.3141    | 0.1496    | 0.4397         | 0.5615         | 0.2061          | 0.3957         | 0.2050         | 0.8438              | 0.2703              | 7     |
+| 0.2650     | 0.3459    | 0.1772    | 0.6305         | 0.4615         | 0.2043          | 0.3934         | 0.2032         | 0.8449              | 0.2703              | 8     |
+| 0.3674     | 0.3251    | 0.1647    | 0.6980         | 0.4923         | 0.2086          | 0.3990         | 0.2075         | 0.8421              | 0.2703              | 9     |
+| 0.4815     | 0.3122    | 0.1546    | 0.6067         | 0.5538         | 0.2078          | 0.3979         | 0.2067         | 0.8427              | 0.2703              | 10    |
+| 0.4321     | 0.3446    | 0.1783    | 0.6082         | 0.5308         | 0.2068          | 0.3966         | 0.2056         | 0.8433              | 0.2703              | 11    |
+| 0.3884     | 0.3257    | 0.1637    | 0.6823         | 0.5077         | 0.2038          | 0.3928         | 0.2027         | 0.8452              | 0.2703              | 12    |
+| 0.2694     | 0.3353    | 0.1719    | 0.6679         | 0.5385         | 0.2026          | 0.3912         | 0.2014         | 0.8460              | 0.2703              | 13    |
+| 0.3124     | 0.3223    | 0.1605    | 0.6018         | 0.5231         | 0.2067          | 0.3965         | 0.2055         | 0.8434              | 0.2703              | 14    |
+| 0.3527     | 0.3281    | 0.1645    | 0.5474         | 0.5462         | 0.2088          | 0.3992         | 0.2077         | 0.8420              | 0.2703              | 15    |
+| 0.3506     | 0.3452    | 0.1775    | 0.6449         | 0.5077         | 0.2120          | 0.4032         | 0.2109         | 0.8399              | 0.2703              | 16    |
+| 0.5240     | 0.3363    | 0.1683    | 0.6028         | 0.5077         | 0.2194          | 0.4124         | 0.2183         | 0.8351              | 0.2703              | 17    |
+| 0.2749     | 0.3272    | 0.1678    | 0.6595         | 0.5308         | 0.2191          | 0.4121         | 0.2181         | 0.8352              | 0.2703              | 18    |
+| 0.2489     | 0.3355    | 0.1670    | 0.5318         | 0.5            | 0.2163          | 0.4087         | 0.2153         | 0.8371              | 0.2703              | 19    |
 ### Framework versions

config.json CHANGED Viewed

@@ -10,30 +10,12 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2",
-    "3": "LABEL_3",
-    "4": "LABEL_4",
-    "5": "LABEL_5",
-    "6": "LABEL_6",
-    "7": "LABEL_7",
-    "8": "LABEL_8",
-    "9": "LABEL_9"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
-    "LABEL_0": 0,
-    "LABEL_1": 1,
-    "LABEL_2": 2,
-    "LABEL_3": 3,
-    "LABEL_4": 4,
-    "LABEL_5": 5,
-    "LABEL_6": 6,
-    "LABEL_7": 7,
-    "LABEL_8": 8,
-    "LABEL_9": 9
   },
   "layer_norm_eps": 1e-12,
   "max_position_embeddings": 512,

   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
+    "0": "LABEL_0"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
+    "LABEL_0": 0
   },
   "layer_norm_eps": 1e-12,
   "max_position_embeddings": 512,

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e22a4ec42198041fd75037b5d17133f4323ecdc6e32db4ea73cb46e0a0da8de7
-size 433559864

 version https://git-lfs.github.com/spec/v1
+oid sha256:52dee4962f46730eb59668a2116ca2319fc2d92e489f7a2e093bc0e13c2fcd32
+size 433532180