thrunlab
/

Mistral_Sparse_refined_web_50p_graceful_True

Text Generation

Generated from Trainer

Model card Files Files and versions Community

lukeleeai commited on Mar 10

Commit

0b9082f

•

1 Parent(s): 3d5c6ed

End of training

Files changed (3) hide show

README.md +4 -4
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.3509
 ## Model description
@@ -37,10 +37,10 @@ The following hyperparameters were used during training:
 - eval_batch_size: 16
 - seed: 0
 - distributed_type: multi-GPU
-- num_devices: 4
 - gradient_accumulation_steps: 4
-- total_train_batch_size: 128
-- total_eval_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - training_steps: 50

 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 10.3681
 ## Model description
 - eval_batch_size: 16
 - seed: 0
 - distributed_type: multi-GPU
+- num_devices: 2
 - gradient_accumulation_steps: 4
+- total_train_batch_size: 64
+- total_eval_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - training_steps: 50

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:988985da06ee494987338be9d9ddaa7a3964bb107dc38cd56310b6cba8508191
 size 16567728

 version https://git-lfs.github.com/spec/v1
+oid sha256:b7c09277a9ec4f90c4f89a0284c2e4f5f2260008b57c1859b7d1f7a321d9a8a2
 size 16567728

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73493aae5cd2319ab948607df67ef3829917e9e70962ace2966add73d6a70024
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:3798f9d9af02da627769966d1abab41557ce2db7b5b7418d1fd0f9326ce76ec8
 size 4728