khiepm209 commited on
Commit
1a1f09c
1 Parent(s): 3b65685

Model save

Browse files
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: meta-math/MetaMath-Mistral-7B
4
+ tags:
5
+ - generated_from_trainer
6
+ model-index:
7
+ - name: math-mistral-7b-r32
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # math-mistral-7b-r32
15
+
16
+ This model is a fine-tuned version of [meta-math/MetaMath-Mistral-7B](https://huggingface.co/meta-math/MetaMath-Mistral-7B) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.4624
19
+
20
+ ## Model description
21
+
22
+ More information needed
23
+
24
+ ## Intended uses & limitations
25
+
26
+ More information needed
27
+
28
+ ## Training and evaluation data
29
+
30
+ More information needed
31
+
32
+ ## Training procedure
33
+
34
+ ### Training hyperparameters
35
+
36
+ The following hyperparameters were used during training:
37
+ - learning_rate: 2.55e-05
38
+ - train_batch_size: 4
39
+ - eval_batch_size: 2
40
+ - seed: 42
41
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
+ - lr_scheduler_type: linear
43
+ - num_epochs: 1
44
+ - mixed_precision_training: Native AMP
45
+
46
+ ### Training results
47
+
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:----:|:---------------:|
50
+ | 0.6354 | 0.21 | 1000 | 0.4892 |
51
+ | 0.5874 | 0.42 | 2000 | 0.4789 |
52
+ | 0.3006 | 0.63 | 3000 | 0.4694 |
53
+ | 0.2786 | 0.84 | 4000 | 0.4624 |
54
+
55
+
56
+ ### Framework versions
57
+
58
+ - Transformers 4.36.0.dev0
59
+ - Pytorch 2.0.0
60
+ - Datasets 2.11.0
61
+ - Tokenizers 0.14.1
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1907d49ca8e0a364371915abd647bc3cf147773643baf4ae4219b6982a892fbe
3
  size 335604696
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:821db9c11155aed97a78fc9511c408529dc9ac62af742b99a9f5c3ca9601c86a
3
  size 335604696
runs/Nov14_01-30-34_27f08a13a070/events.out.tfevents.1699925479.27f08a13a070.27.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:52259a1548f404640029c8701e7b8fa76d922b2bef0779fb8528e2904eceaa2d
3
- size 743302
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5e11e297ef3646af9588f45bdc2fa0ad68a40aa3222e49c58863ba3db99730d
3
+ size 752291