Quantization made by Richard Erkhov.
StarDust-12b-v1 - GGUF
- Model creator: https://huggingface.co/Luni/
- Original model: https://huggingface.co/Luni/StarDust-12b-v1/
Name | Quant method | Size |
---|---|---|
StarDust-12b-v1.Q2_K.gguf | Q2_K | 4.46GB |
StarDust-12b-v1.IQ3_XS.gguf | IQ3_XS | 4.94GB |
StarDust-12b-v1.IQ3_S.gguf | IQ3_S | 5.18GB |
StarDust-12b-v1.Q3_K_S.gguf | Q3_K_S | 5.15GB |
StarDust-12b-v1.IQ3_M.gguf | IQ3_M | 5.33GB |
StarDust-12b-v1.Q3_K.gguf | Q3_K | 5.67GB |
StarDust-12b-v1.Q3_K_M.gguf | Q3_K_M | 5.67GB |
StarDust-12b-v1.Q3_K_L.gguf | Q3_K_L | 6.11GB |
StarDust-12b-v1.IQ4_XS.gguf | IQ4_XS | 6.33GB |
StarDust-12b-v1.Q4_0.gguf | Q4_0 | 6.59GB |
StarDust-12b-v1.IQ4_NL.gguf | IQ4_NL | 6.65GB |
StarDust-12b-v1.Q4_K_S.gguf | Q4_K_S | 6.63GB |
StarDust-12b-v1.Q4_K.gguf | Q4_K | 6.96GB |
StarDust-12b-v1.Q4_K_M.gguf | Q4_K_M | 6.96GB |
StarDust-12b-v1.Q4_1.gguf | Q4_1 | 7.26GB |
StarDust-12b-v1.Q5_0.gguf | Q5_0 | 7.93GB |
StarDust-12b-v1.Q5_K_S.gguf | Q5_K_S | 7.93GB |
StarDust-12b-v1.Q5_K.gguf | Q5_K | 8.13GB |
StarDust-12b-v1.Q5_K_M.gguf | Q5_K_M | 8.13GB |
StarDust-12b-v1.Q5_1.gguf | Q5_1 | 8.61GB |
StarDust-12b-v1.Q6_K.gguf | Q6_K | 9.37GB |
StarDust-12b-v1.Q8_0.gguf | Q8_0 | 12.13GB |
Original model description:
license: apache-2.0 tags: - chat base_model: - Gryphe/Pantheon-RP-1.6-12b-Nemo - Sao10K/MN-12B-Lyra-v3 - anthracite-org/magnum-v2.5-12b-kto - nbeerbower/mistral-nemo-bophades-12B pipeline_tag: text-generation model-index: - name: StarDust-12b-v1 results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 54.59 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Luni/StarDust-12b-v1 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 34.45 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Luni/StarDust-12b-v1 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 5.97 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Luni/StarDust-12b-v1 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 3.47 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Luni/StarDust-12b-v1 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 13.76 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Luni/StarDust-12b-v1 name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 26.8 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Luni/StarDust-12b-v1 name: Open LLM Leaderboard
StarDust-12b-v1
Quants
- GGUF: mradermacher/StarDust-12b-v1-GGUF
- weighted/imatrix GGUF mradermacher/StarDust-12b-v1-i1-GGUF
- exl2: lucyknada/Luni_StarDust-12b-v1-exl2
Description | Usecase
The result of this merge is in my opinion a more vibrant and less generic sonnet inspired prose, it's able to be gentle and harsh where asked. I've personally been trying to get a more spice while also compensating for the Magnum-v2.5 having the issue on my end that it simply won't stop yapping.
- This model is intended to be used as a Role-playing model.
- Its direct conversational output is... I can't even say it's luck, it's just not made for it.
- Extension to Conversational output: The Model is designed for roleplay, direct instructing or general purpose is NOT recommended.
Initial Feedback
Initial feedback shows that the model has a tendency to promote flirting. If this becomes too much try to steer the model with a system prompt to focus on SFW and on-flirty interactions.
Prompting
Edit: ChatML has proven to be the BEST choice.
Both Mistral and ChatML should work though I had better results with ChatML: ChatML Example:
"""<|im_start|>user
Hi there!<|im_end|>
<|im_start|>assistant
Nice to meet you!<|im_end|>
<|im_start|>user
Can I ask a question?<|im_end|>
<|im_start|>assistant
"""
Merge Details
Merge Method
This model was merged using the DARE TIES merge method using Sao10K/MN-12B-Lyra-v3 as a base.
Models Merged
The following models were included in the merge:
- Gryphe/Pantheon-RP-1.6-12b-Nemo
- anthracite-org/magnum-v2.5-12b-kto
- nbeerbower/mistral-nemo-bophades-12B
- Sao10K/MN-12B-Lyra-v3
Special Thanks
Special thanks to the SillyTilly and myself for helping me find the energy to finish this.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 23.17 |
IFEval (0-Shot) | 54.59 |
BBH (3-Shot) | 34.45 |
MATH Lvl 5 (4-Shot) | 5.97 |
GPQA (0-shot) | 3.47 |
MuSR (0-shot) | 13.76 |
MMLU-PRO (5-shot) | 26.80 |
- Downloads last month
- 205