thrunlab
/

Mistral_Sparse_refined_web_50p_graceful_True

Text Generation

Generated from Trainer

Model card Files Files and versions Community

lukeleeai commited on Mar 10

Commit

6b47f51

•

1 Parent(s): cd0152a

End of training

Files changed (3) hide show

README.md +1 -1
model.safetensors +1 -1
sparsification_sftt.py +2 -2

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.3587
 ## Model description

 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 10.3729
 ## Model description

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eaf4b5f5128df627b2a98fa0fe2ef9caf0eeeff68559ab8f983b496c9a21bd2f
 size 16567728

 version https://git-lfs.github.com/spec/v1
+oid sha256:6b5c447da43183c5ec31a944e37d7519ace58e6fafd1316f0b609a5092e4a471
 size 16567728

sparsification_sftt.py CHANGED Viewed

@@ -571,13 +571,13 @@ class GracefulRegularizationScheduler(TrainerCallback):
             if is_mainprocess():
                 current_steps = self.start_steps + state.global_step
                 ds_print(
-                    f"Saving to /scr/lukeai/{self.model_name}_{current_steps}.pt",
                 )
                 # save_state_dict(model, f"/scr/lukeai/{self.model_name}_{state.global_step}.pt")
                 print("Saving a model...")
                 torch.save(
                     model.state_dict(),
-                    f"/scr/lukeai/{self.model_name}_{current_steps}.pt",
                 )

             if is_mainprocess():
                 current_steps = self.start_steps + state.global_step
                 ds_print(
+                    f"Saving to /scr/lukeai/{self.model_name}_{current_steps}_ckpt.pt",
                 )
                 # save_state_dict(model, f"/scr/lukeai/{self.model_name}_{state.global_step}.pt")
                 print("Saving a model...")
                 torch.save(
                     model.state_dict(),
+                    f"/scr/lukeai/{self.model_name}_{current_steps}_ckpt.pt",
                 )