End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -4,18 +4,18 @@ base_model: mistralai/Mistral-7B-v0.1
 tags:
 - generated_from_trainer
 model-index:
-- name: Mistral_Sparse_refined_web_50p_2024-03-21
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Mistral_Sparse_refined_web_50p_2024-03-21
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1512
 ## Model description
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 3
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 501
 ### Training results
@@ -71,6 +71,16 @@ The following hyperparameters were used during training:
 | 2.1997        | 0.05  | 450  | 2.4123          |
 | 2.2937        | 0.06  | 475  | 2.4086          |
 | 2.3067        | 0.06  | 500  | 2.4052          |
 ### Framework versions

 tags:
 - generated_from_trainer
 model-index:
+- name: Mistral_Sparse_refined_web_50p_2024-03-22
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Mistral_Sparse_refined_web_50p_2024-03-22
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1398
 ## Model description
 - total_eval_batch_size: 3
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- training_steps: 751
 ### Training results
 | 2.1997        | 0.05  | 450  | 2.4123          |
 | 2.2937        | 0.06  | 475  | 2.4086          |
 | 2.3067        | 0.06  | 500  | 2.4052          |
+| 2.312         | 0.06  | 525  | 2.4060          |
+| 2.257         | 0.07  | 550  | 2.4056          |
+| 2.2729        | 0.07  | 575  | 2.4051          |
+| 2.1952        | 0.07  | 600  | 2.4065          |
+| 2.1225        | 0.07  | 625  | 2.3999          |
+| 2.2168        | 0.08  | 650  | 2.4039          |
+| 2.1682        | 0.08  | 675  | 2.4006          |
+| 2.3027        | 0.08  | 700  | 2.4028          |
+| 2.2077        | 0.09  | 725  | 2.4006          |
+| 2.2119        | 0.09  | 750  | 2.3980          |
 ### Framework versions

model-00001-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:95729f8aae782d1dc5112c74b463100926416d372874db0b1b025e1ee4f6aacd
 size 4943162336

 version https://git-lfs.github.com/spec/v1
+oid sha256:6b8097c48bc1e2ddb05f69442c852d8358ee17b7f6c1b9af4fa74d261accde7a
 size 4943162336

model-00002-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d1a91505db655a8e3306808cdd365de8f920952ea30ef28ebbdca5864e81b0fd
 size 4999819336

 version https://git-lfs.github.com/spec/v1
+oid sha256:eb1bbeca18b9b8972d60552d9663c0089e2759139994a808566228496b7346bb
 size 4999819336

model-00003-of-00003.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c7eaf0b6353ca719e0ec6fd4f52c905bc677a752fa7bead4ca9e27509ec8f532
 size 4540516344

 version https://git-lfs.github.com/spec/v1
+oid sha256:2ab648e4e6f6f77b4cfcab71e88116ee670033c5d2e35384f5afb91eccf55abc
 size 4540516344