End of training

Files changed (4) hide show

README.md ADDED Viewed

+---
+license: mit
+base_model: gpt2-large
+tags:
+- generated_from_trainer
+model-index:
+- name: test
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# test
+This model is a fine-tuned version of [gpt2-large](https://huggingface.co/gpt2-large) on an unknown dataset.
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- gradient_accumulation_steps: 64
+- total_train_batch_size: 1024
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 100
+### Training results
+### Framework versions
+- Transformers 4.41.2
+- Pytorch 2.0.1a0+cxx11.abi
+- Datasets 2.20.0
+- Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "/home/vc381/rds/hpc-work/09052024-distillBertFineTunningSentiment/distil/test",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2ForSequenceClassification"

 {
+  "_name_or_path": "gpt2-large",
   "activation_function": "gelu_new",
   "architectures": [
     "GPT2ForSequenceClassification"

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:91f049bdc8d02233a0f92faab042ed730b976e68a63a020d3a3cdfebd16a97fc
 size 3096181368

 version https://git-lfs.github.com/spec/v1
+oid sha256:1910074893d5030b203ebc1ee0c77260735f09c959955ddd8b0c59e1b35391ca
 size 3096181368

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:10a82e5084bb806377ece6eae7b51475bd09802ea801da88502198f9ed6f6855
 size 4603

 version https://git-lfs.github.com/spec/v1
+oid sha256:d3f2b9152fa0f99376f2e6ba0bf32f3a4f4a3b1f54157342f9a650af264d09ba
 size 4603