Update README.md
Browse files
README.md
CHANGED
@@ -92,31 +92,23 @@ Use the code below to get started with the model.
|
|
92 |
|
93 |
#### Training Hyperparameters
|
94 |
|
95 |
-
-
|
96 |
|
97 |
#### Speeds, Sizes, Times [optional]
|
98 |
|
99 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
100 |
|
101 |
-
|
102 |
-
|
103 |
-
## Evaluation
|
104 |
-
|
105 |
-
<!-- This section describes the evaluation protocols and provides the results. -->
|
106 |
|
107 |
-
### Testing Data, Factors & Metrics
|
108 |
|
109 |
-
#### Testing Data
|
110 |
|
111 |
-
<!-- This should link to a Dataset Card if possible. -->
|
112 |
-
|
113 |
-
[More Information Needed]
|
114 |
-
|
115 |
-
#### Factors
|
116 |
-
|
117 |
-
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
118 |
-
|
119 |
-
[More Information Needed]
|
120 |
|
121 |
#### Metrics
|
122 |
|
|
|
92 |
|
93 |
#### Training Hyperparameters
|
94 |
|
95 |
+
- lora 4bit peft
|
96 |
|
97 |
#### Speeds, Sizes, Times [optional]
|
98 |
|
99 |
+
- global_step=16905
|
100 |
+
- training_loss=1.169401215731269
|
101 |
+
- train_runtime: 21882.4747
|
102 |
+
- train_samples_per_second: 3.09
|
103 |
+
- train_steps_per_second: 0.773
|
104 |
+
- total_flos: 4.437195883099177e+17
|
105 |
+
- train_loss': 1.169401215731269
|
106 |
+
- epoch: 5.0
|
107 |
|
108 |
+
## Evaluation and Metrics
|
|
|
|
|
|
|
|
|
109 |
|
|
|
110 |
|
|
|
111 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
112 |
|
113 |
#### Metrics
|
114 |
|