add model

Browse files

Files changed (11) hide show

.gitignore +1 -0
README.md +76 -0
config.json +29 -0
pytorch_model.bin +3 -0
runs/Sep27_15-32-17_1380f0078246/1632756753.8818982/events.out.tfevents.1632756753.1380f0078246.75.1 +3 -0
runs/Sep27_15-32-17_1380f0078246/events.out.tfevents.1632756753.1380f0078246.75.0 +3 -0
runs/Sep27_15-32-17_1380f0078246/events.out.tfevents.1632757458.1380f0078246.75.2 +3 -0
special_tokens_map.json +1 -0
tokenizer.json +0 -0
tokenizer_config.json +1 -0
training_args.bin +3 -0

.gitignore ADDED Viewed

	@@ -0,0 +1 @@


1	+ checkpoint-*/

README.md ADDED Viewed

	@@ -0,0 +1,76 @@

+---
+license: agpl-3.0
+tags:
+- generated_from_trainer
+datasets:
+- glue
+metrics:
+- matthews_correlation
+model-index:
+- name: XLMR-ENIS-finetuned-cola
+  results:
+  - task:
+      name: Text Classification
+      type: text-classification
+    dataset:
+      name: glue
+      type: glue
+      args: cola
+    metrics:
+    - name: Matthews Correlation
+      type: matthews_correlation
+      value: 0.6306425398187112
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# XLMR-ENIS-finetuned-cola
+This model is a fine-tuned version of [vesteinn/XLMR-ENIS](https://huggingface.co/vesteinn/XLMR-ENIS) on the glue dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.7311
+- Matthews Correlation: 0.6306
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 5
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Matthews Correlation |
+|:-------------:|:-----:|:----:|:---------------:|:--------------------:|
+| 0.5216        | 1.0   | 535  | 0.5836          | 0.4855               |
+| 0.3518        | 2.0   | 1070 | 0.4426          | 0.5962               |
+| 0.2538        | 3.0   | 1605 | 0.5091          | 0.6110               |
+| 0.1895        | 4.0   | 2140 | 0.6955          | 0.6136               |
+| 0.1653        | 5.0   | 2675 | 0.7311          | 0.6306               |
+### Framework versions
+- Transformers 4.10.3
+- Pytorch 1.9.0+cu102
+- Datasets 1.12.1
+- Tokenizers 0.10.3

config.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "_name_or_path": "vesteinn/XLMR-ENIS",
+  "architectures": [
+    "XLMRobertaForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "classifier_dropout": null,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 514,
+  "model_type": "xlm-roberta",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "problem_type": "single_label_classification",
+  "torch_dtype": "float32",
+  "transformers_version": "4.10.3",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50005
+}

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:260d03a9e6834a9d88e40f64f0ae0b89fb74f742def15f570f0baa1bae332369
+size 497875373

runs/Sep27_15-32-17_1380f0078246/1632756753.8818982/events.out.tfevents.1632756753.1380f0078246.75.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e6ce090e5d9fbd9af372dc47fe0a44158c3978d538303f46b932eea3d9003a1b
+size 4222

runs/Sep27_15-32-17_1380f0078246/events.out.tfevents.1632756753.1380f0078246.75.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5f413a7cdc4466264edcc151b3cd3d50f0816eb4d812d62ea4f4030b25ff55f6
+size 5853

runs/Sep27_15-32-17_1380f0078246/events.out.tfevents.1632757458.1380f0078246.75.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2a9d6b928a070a64c2c75cd41958aa2e2c2e6b3d0500cb5e82ac19e3133edd96
+size 375

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"bos_token": "<s>", "eos_token": "</s>", "unk_token": "<unk>", "sep_token": "</s>", "pad_token": "<pad>", "cls_token": "<s>", "mask_token": "<mask>"}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1 @@

+ {"bos_token": "<s>", "eos_token": "</s>", "sep_token": "</s>", "cls_token": "<s>", "unk_token": "<unk>", "pad_token": "<pad>", "mask_token": {"content": "<mask>", "single_word": false, "lstrip": true, "rstrip": false, "normalized": true, "__type": "AddedToken"}, "special_tokens_map_file": "/root/.cache/huggingface/transformers/0a741e436e3f4773f4fbd20aaad263d91214fcbc60d3b45bb6e39f0d745dba77.0dc5b1041f62041ebbd23b1297f2f573769d5c97d8b7c28180ec86b8f6185aa8", "name_or_path": "vesteinn/XLMR-ENIS", "tokenizer_class": "XLMRobertaTokenizer"}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2c5ddc62e5d25b2b3a59946439b6d2a13785c5f377c3411e7cd68c62bb790867
+size 2671