willherbert27/bert-finetuned-no-context-squad

Browse files

Files changed (4) hide show

README.md +66 -0
model.safetensors +1 -1
tokenizer.json +2 -16
training_args.bin +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,66 @@

+---
+license: apache-2.0
+base_model: willherbert27/bert-finetuned-combo-textbook-no-context
+tags:
+- generated_from_trainer
+model-index:
+- name: bert-textbook-no-context-finetuned-squad
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# bert-textbook-no-context-finetuned-squad
+This model is a fine-tuned version of [willherbert27/bert-finetuned-combo-textbook-no-context](https://huggingface.co/willherbert27/bert-finetuned-combo-textbook-no-context) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.2753
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 1.8478        | 1.0   | 8255  | 1.9560          |
+| 1.5614        | 2.0   | 16510 | 1.8805          |
+| 1.3201        | 3.0   | 24765 | 1.8681          |
+| 1.1333        | 4.0   | 33020 | 2.1644          |
+| 0.9384        | 5.0   | 41275 | 2.1056          |
+| 0.778         | 6.0   | 49530 | 2.3509          |
+| 0.6555        | 7.0   | 57785 | 2.7690          |
+| 0.5564        | 8.0   | 66040 | 3.2649          |
+| 0.4772        | 9.0   | 74295 | 3.7807          |
+| 0.4322        | 10.0  | 82550 | 4.2753          |
+### Framework versions
+- Transformers 4.38.2
+- Pytorch 1.13.1+cu116
+- Datasets 2.18.0
+- Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:885dc0aa708db434257c4055981dd36e5552681be22c0636ae3bbf43475286b2
 size 430908208

 version https://git-lfs.github.com/spec/v1
+oid sha256:65b3545c72bf44c1f3119767c72388b157485910a272241ce5c0f353945ef60c
 size 430908208

tokenizer.json CHANGED Viewed

@@ -1,21 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 384,
-    "strategy": "OnlySecond",
-    "stride": 128
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 384
-    },
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 0,
-    "pad_type_id": 0,
-    "pad_token": "[PAD]"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0518c512ffb1d36a4ad38e807140f22b2a48633df542555c717101f81dc122a0
 size 4475

 version https://git-lfs.github.com/spec/v1
+oid sha256:0b91c69210aff88a714117c646a25b28e579cc776a13fe107df10144c0f2cc0a
 size 4475