Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -1,23 +1,37 @@
|
|
|
|
1 |
---
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
|
7 |
-
- text-generation-inference
|
8 |
-
- transformers
|
9 |
-
- unsloth
|
10 |
-
- llama
|
11 |
-
- trl
|
12 |
-
- sft
|
13 |
---
|
14 |
|
15 |
-
# Uploaded
|
16 |
|
17 |
-
-
|
|
|
18 |
- **License:** apache-2.0
|
19 |
-
- **Finetuned from model :**
|
20 |
|
21 |
-
This
|
22 |
|
23 |
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
|
2 |
---
|
3 |
+
model_size: 1034516480
|
4 |
+
required_memory: 3.85
|
5 |
+
accuracy: 0.518348623853211
|
6 |
+
metrics:
|
7 |
+
- glue-sst2
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
---
|
9 |
|
10 |
+
# Uploaded model
|
11 |
|
12 |
+
[<img src="https://github.githubassets.com/assets/GitHub-Mark-ea2971cee799.png" width="100"/><img src="https://github.githubassets.com/assets/GitHub-Logo-ee398b662d42.png" width="100"/>](https://github.com/Agnuxo1)
|
13 |
+
- **Developed by:** [Agnuxo](https://github.com/Agnuxo1)
|
14 |
- **License:** apache-2.0
|
15 |
+
- **Finetuned from model :** Agnuxo/Tinytron-TinyLlama
|
16 |
|
17 |
+
This Mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
18 |
|
19 |
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
20 |
+
|
21 |
+
|
22 |
+
## Benchmark Results
|
23 |
+
|
24 |
+
This model has been fine-tuned for various tasks and evaluated on the following benchmarks:
|
25 |
+
|
26 |
+
### glue-sst2
|
27 |
+
**Accuracy:** 0.5183
|
28 |
+
|
29 |
+

|
30 |
+
|
31 |
+
|
32 |
+
Model Size: 1,034,516,480 parameters
|
33 |
+
Required Memory: 3.85 GB
|
34 |
+
|
35 |
+
For more details, visit my [GitHub](https://github.com/Agnuxo1).
|
36 |
+
|
37 |
+
Thanks for your interest in this model!
|