Upload 7 files

Browse files

Files changed (7) hide show

README.md +135 -1
config.json +40 -0
pytorch_model.bin +3 -0
special_tokens_map.json +1 -0
tokenizer.json +0 -0
tokenizer_config.json +1 -0
vocab.txt +0 -0

README.md CHANGED Viewed

@@ -1,3 +1,137 @@
 ---
-license: apache-2.0
 ---

 ---
+tags:
+- generated_from_trainer
+datasets:
+- sentiment_reduced
+metrics:
+- accuracy
+model-index:
+- name: estbert128_lr5e-5_b64_s2
+  results:
+  - task:
+      name: Text Classification
+      type: text-classification
+    dataset:
+      name: sentiment_reduced
+      type: sentiment_reduced
+      args: sentiment_reduced
+    metrics:
+    - name: Accuracy
+      type: accuracy
+      value: 0.7926136255264282
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# estbert128_lr5e-5_b64_s2
+This model is a fine-tuned version of [tartuNLP/EstBERT](https://huggingface.co/tartuNLP/EstBERT) on the sentiment_reduced dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.2440
+- Accuracy: 0.7926
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 2
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 64
+- optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-06
+- lr_scheduler_type: polynomial
+- num_epochs: 100
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.836         | 0.99  | 38   | 0.6966          | 0.7216   |
+| 0.5336        | 1.99  | 76   | 0.5948          | 0.7699   |
+| 0.2913        | 2.99  | 114  | 0.7197          | 0.7358   |
+| 0.1048        | 3.99  | 152  | 0.9570          | 0.7557   |
+| 0.0424        | 4.99  | 190  | 1.2144          | 0.7528   |
+| 0.0262        | 5.99  | 228  | 1.2675          | 0.7727   |
+| 0.0169        | 6.99  | 266  | 1.4788          | 0.75     |
+| 0.0048        | 7.99  | 304  | 1.5053          | 0.7699   |
+| 0.0084        | 8.99  | 342  | 1.5368          | 0.7614   |
+| 0.0087        | 9.99  | 380  | 1.6678          | 0.7699   |
+| 0.0082        | 10.99 | 418  | 1.7598          | 0.7642   |
+| 0.0104        | 11.99 | 456  | 1.6951          | 0.7528   |
+| 0.0115        | 12.99 | 494  | 1.7123          | 0.7727   |
+| 0.0111        | 13.99 | 532  | 1.7577          | 0.7528   |
+| 0.0028        | 14.99 | 570  | 1.7383          | 0.7727   |
+| 0.0032        | 15.99 | 608  | 2.0254          | 0.7727   |
+| 0.0107        | 16.99 | 646  | 2.2123          | 0.7415   |
+| 0.0056        | 17.99 | 684  | 1.9406          | 0.7614   |
+| 0.0078        | 18.99 | 722  | 2.2002          | 0.7642   |
+| 0.0041        | 19.99 | 760  | 2.0157          | 0.7670   |
+| 0.0087        | 20.99 | 798  | 2.1228          | 0.7642   |
+| 0.0113        | 21.99 | 836  | 2.3692          | 0.7727   |
+| 0.0025        | 22.99 | 874  | 2.2211          | 0.75     |
+| 0.0083        | 23.99 | 912  | 2.2120          | 0.7841   |
+| 0.0104        | 24.99 | 950  | 2.1478          | 0.7614   |
+| 0.0041        | 25.99 | 988  | 2.1118          | 0.7756   |
+| 0.002         | 26.99 | 1026 | 1.9929          | 0.7699   |
+| 0.001         | 27.99 | 1064 | 2.0295          | 0.7841   |
+| 0.003         | 28.99 | 1102 | 2.3142          | 0.7699   |
+| 0.006         | 29.99 | 1140 | 2.2957          | 0.7642   |
+| 0.0005        | 30.99 | 1178 | 2.0661          | 0.7642   |
+| 0.0007        | 31.99 | 1216 | 2.4220          | 0.7614   |
+| 0.0088        | 32.99 | 1254 | 2.2842          | 0.7614   |
+| 0.0           | 33.99 | 1292 | 2.4060          | 0.7585   |
+| 0.0           | 34.99 | 1330 | 2.2088          | 0.7585   |
+| 0.0           | 35.99 | 1368 | 2.2181          | 0.7614   |
+| 0.0           | 36.99 | 1406 | 2.2560          | 0.7784   |
+| 0.0           | 37.99 | 1444 | 2.4803          | 0.7585   |
+| 0.0           | 38.99 | 1482 | 2.1163          | 0.7812   |
+| 0.0087        | 39.99 | 1520 | 2.3410          | 0.75     |
+| 0.0021        | 40.99 | 1558 | 2.3583          | 0.75     |
+| 0.0054        | 41.99 | 1596 | 2.3546          | 0.7642   |
+| 0.0051        | 42.99 | 1634 | 2.2295          | 0.7812   |
+| 0.0           | 43.99 | 1672 | 2.2440          | 0.7926   |
+| 0.0019        | 44.99 | 1710 | 2.3248          | 0.7784   |
+| 0.0044        | 45.99 | 1748 | 2.3058          | 0.7841   |
+| 0.0006        | 46.99 | 1786 | 2.3588          | 0.7784   |
+| 0.0007        | 47.99 | 1824 | 2.6541          | 0.7670   |
+| 0.0001        | 48.99 | 1862 | 2.4621          | 0.7614   |
+| 0.0           | 49.99 | 1900 | 2.4696          | 0.7727   |
+| 0.0           | 50.99 | 1938 | 2.4981          | 0.7670   |
+| 0.0031        | 51.99 | 1976 | 2.6702          | 0.7670   |
+| 0.0           | 52.99 | 2014 | 2.4448          | 0.7756   |
+| 0.0           | 53.99 | 2052 | 2.4214          | 0.7756   |
+| 0.0           | 54.99 | 2090 | 2.4308          | 0.7841   |
+| 0.0001        | 55.99 | 2128 | 2.5869          | 0.7642   |
+| 0.0007        | 56.99 | 2166 | 2.4803          | 0.7727   |
+| 0.0           | 57.99 | 2204 | 2.4557          | 0.7784   |
+| 0.0           | 58.99 | 2242 | 2.4702          | 0.7784   |
+| 0.0           | 59.99 | 2280 | 2.5165          | 0.7784   |
+| 0.0013        | 60.99 | 2318 | 2.6322          | 0.7727   |
+| 0.0001        | 61.99 | 2356 | 2.6253          | 0.7756   |
+| 0.0011        | 62.99 | 2394 | 2.6303          | 0.7841   |
+| 0.0002        | 63.99 | 2432 | 2.5646          | 0.7614   |
+### Framework versions
+- Transformers 4.14.1
+- Pytorch 1.10.1+cu113
+- Datasets 1.16.1
+- Tokenizers 0.10.3

config.json ADDED Viewed

	@@ -0,0 +1,40 @@

+{
+  "_name_or_path": "tartuNLP/EstBERT",
+  "architectures": [
+    "BertForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_ids": 0,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "negatiivne",
+    "1": "neutraalne",
+    "2": "positiivne"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "negatiivne": 0,
+    "neutraalne": 1,
+    "positiivne": 2
+  },
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "bert",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "output_past": true,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "problem_type": "single_label_classification",
+  "torch_dtype": "float32",
+  "transformers_version": "4.14.1",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 50000
+}

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:839b854121412990ae68b7979977fcf2a15d35d9051f3c82c9cfb0d433a92597
+size 497858733

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]"}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"do_lower_case": true, "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenize_chinese_chars": true, "strip_accents": null, "special_tokens_map_file": null, "full_tokenizer_file": null, "name_or_path": "tartuNLP/EstBERT", "do_basic_tokenize": true, "never_split": null, "tokenizer_class": "BertTokenizer"}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff