vicgalle's picture
Update README.md
9608d34 verified
|
raw
history blame
No virus
1.06 kB
---
base_model:
- upstage/SOLAR-10.7B-Instruct-v1.0
tags:
- mergekit
- merge
- solar
license: apache-2.0
---
# SOLAR-13B-Instruct-v1.0
This is SOLAR-10.7B, but upscaled to 13B, to optimize VRAM usage of typical GPU cards (a 4bit quant fits in 12GB).
Evaluations coming soon!
This is a frankenmerge model created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the passthrough merge method.
### Models Merged
The following models were included in the merge:
* [upstage/SOLAR-10.7B-Instruct-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
slices:
- sources:
- model: upstage/SOLAR-10.7B-Instruct-v1.0
layer_range: [0, 28]
- sources:
- model: upstage/SOLAR-10.7B-Instruct-v1.0
layer_range: [20, 48]
merge_method: passthrough
dtype: float16
```
### Prompt template
The same as in SOLAR-10.7B:
```
<s> ### User:
{prompt}
### Assistant:
{response}</s>
```