Quantization made by Richard Erkhov.

HermesStar-OrcaWind-Synth-11B - GGUF

Name	Quant method	Size
HermesStar-OrcaWind-Synth-11B.Q2_K.gguf	Q2_K	3.73GB
HermesStar-OrcaWind-Synth-11B.IQ3_XS.gguf	IQ3_XS	4.14GB
HermesStar-OrcaWind-Synth-11B.IQ3_S.gguf	IQ3_S	4.37GB
HermesStar-OrcaWind-Synth-11B.Q3_K_S.gguf	Q3_K_S	4.34GB
HermesStar-OrcaWind-Synth-11B.IQ3_M.gguf	IQ3_M	4.51GB
HermesStar-OrcaWind-Synth-11B.Q3_K.gguf	Q3_K	4.84GB
HermesStar-OrcaWind-Synth-11B.Q3_K_M.gguf	Q3_K_M	4.84GB
HermesStar-OrcaWind-Synth-11B.Q3_K_L.gguf	Q3_K_L	5.26GB
HermesStar-OrcaWind-Synth-11B.IQ4_XS.gguf	IQ4_XS	5.43GB
HermesStar-OrcaWind-Synth-11B.Q4_0.gguf	Q4_0	5.66GB
HermesStar-OrcaWind-Synth-11B.IQ4_NL.gguf	IQ4_NL	5.72GB
HermesStar-OrcaWind-Synth-11B.Q4_K_S.gguf	Q4_K_S	5.7GB
HermesStar-OrcaWind-Synth-11B.Q4_K.gguf	Q4_K	6.02GB
HermesStar-OrcaWind-Synth-11B.Q4_K_M.gguf	Q4_K_M	6.02GB
HermesStar-OrcaWind-Synth-11B.Q4_1.gguf	Q4_1	6.27GB
HermesStar-OrcaWind-Synth-11B.Q5_0.gguf	Q5_0	6.89GB
HermesStar-OrcaWind-Synth-11B.Q5_K_S.gguf	Q5_K_S	6.89GB
HermesStar-OrcaWind-Synth-11B.Q5_K.gguf	Q5_K	7.08GB
HermesStar-OrcaWind-Synth-11B.Q5_K_M.gguf	Q5_K_M	7.08GB
HermesStar-OrcaWind-Synth-11B.Q5_1.gguf	Q5_1	7.51GB
HermesStar-OrcaWind-Synth-11B.Q6_K.gguf	Q6_K	8.2GB
HermesStar-OrcaWind-Synth-11B.Q8_0.gguf	Q8_0	10.62GB

Original model description:

license: apache-2.0 language: - en library_name: transformers pipeline_tag: text-generation

Open Hermes + Starling passthrough merged

SlimOrca(?)+Zephyr Beta linear merged, then passthrough merged with Synthia

Then both models were merged again in 1 to 0.3 ratio.

Increasing repetition penalty usually makes the model smarter up to a degree but it also causes stability issues.

Since most of the merged models were trained with ChatML, use ChatML template. Rarely the model throws another EOS token though.

My favorite preset has been uploaded.
You can use some sort of CoT prompt instead of "system" in ChatML. It does improve the quality of most output. (You are an assistant. Break down the question and come to a conclusion.)

I don't know what I am doing, you are very welcome to put the model through benchmarks.

I'll also upload q6 GGUF but my internet is shit, so don't hesitate to share other quantizations.