LumousInTheWild
/

exp_acha_mluke_base_last

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

LumousInTheWild commited on May 10

Commit

d657f58

•

1 Parent(s): 3fa39a9

End of training

Files changed (2) hide show

README.md +72 -0
model.safetensors +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,72 @@

+---
+license: apache-2.0
+base_model: studio-ousia/mluke-base
+tags:
+- generated_from_trainer
+model-index:
+- name: exp_acha_mluke_base_last
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# exp_acha_mluke_base_last
+This model is a fine-tuned version of [studio-ousia/mluke-base](https://huggingface.co/studio-ousia/mluke-base) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.2625
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1e-06
+- train_batch_size: 20
+- eval_batch_size: 32
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine_with_restarts
+- lr_scheduler_warmup_ratio: 0.45
+- num_epochs: 15
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.6813        | 1.0   | 167  | 0.6868          |
+| 0.6414        | 2.0   | 334  | 0.6080          |
+| 0.5949        | 3.0   | 501  | 0.5543          |
+| 0.5828        | 4.0   | 668  | 0.5174          |
+| 0.5562        | 5.0   | 835  | 0.4876          |
+| 0.4159        | 6.0   | 1002 | 0.4337          |
+| 0.4039        | 7.0   | 1169 | 0.3666          |
+| 0.3607        | 8.0   | 1336 | 0.3284          |
+| 0.3763        | 9.0   | 1503 | 0.2964          |
+| 0.3379        | 10.0  | 1670 | 0.2744          |
+| 0.3169        | 11.0  | 1837 | 0.2677          |
+| 0.2445        | 12.0  | 2004 | 0.2656          |
+| 0.2472        | 13.0  | 2171 | 0.2626          |
+| 0.2644        | 14.0  | 2338 | 0.2629          |
+| 0.2566        | 15.0  | 2505 | 0.2625          |
+### Framework versions
+- Transformers 4.40.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.19.1
+- Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a39d36ed17973e87882a6c4021a7b9e530641c86085e9462813409d228a65758
 size 2343386936

 version https://git-lfs.github.com/spec/v1
+oid sha256:fc10063d1841624ffbb24fa394bc7cfe25460feb898b1ccda766ba0d088be43e
 size 2343386936