skvarre
/

gpt-sw3-6.7b-v2-instruct-slimorcasv

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

skvarre commited on May 13

Commit

751775c

•

1 Parent(s): 8e5081a

Update README.md

Files changed (1) hide show

README.md +5 -6

README.md CHANGED Viewed

@@ -6,16 +6,13 @@ datasets:
 - skvarre/sv-instruct-v1
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 Finetune of [gpt-sw3-6.7b-v2](https://huggingface.co/AI-Sweden-Models/gpt-sw3-6.7b-v2) using **LoRA** with 4-bit quantization. Adapters merged with base model, however with ```bfloat16``` tensors.
 ## Evaluation
 ScandEval benchmarks:
@@ -29,3 +26,5 @@ ScandEval benchmarks:
 |    mmlu-sv   | 5.45 ± 0.91 / 28.14 ± 0.82           |
 | hellaswag-sv | 27.95 ± 0.73 / 4.19 ± 0.94           |
 |     speed    | 5322.20 ± 1132.75 / 1280.06 ± 408.08 |

 - skvarre/sv-instruct-v1
 ---
 ## Model Details
 Finetune of [gpt-sw3-6.7b-v2](https://huggingface.co/AI-Sweden-Models/gpt-sw3-6.7b-v2) using **LoRA** with 4-bit quantization. Adapters merged with base model, however with ```bfloat16``` tensors.
+## Usage:
+This is a finetune experiment. How to use will be provided later.
 ## Evaluation
 ScandEval benchmarks:
 |    mmlu-sv   | 5.45 ± 0.91 / 28.14 ± 0.82           |
 | hellaswag-sv | 27.95 ± 0.73 / 4.19 ± 0.94           |
 |     speed    | 5322.20 ± 1132.75 / 1280.06 ± 408.08 |