Parm V1
Collection
First gen of the Pinkstack advanced reasoning models. Medium quality but better than the original models they are based on.
β’
3 items
β’
Updated
β’
1
This PARM is based on Qwen 2.5 0.5B which has gotten extra reasoning training parameters so it would have similar outputs to qwen QwQ (only much, smaller.), We trained with this dataset. it is designed to run on any device, from your phone to high-end PC. that is why we've included a BF16 quant.
To use this model, you must use a service which supports the GGUF file format. Additionaly, this is the Prompt Template, it uses the qwen2 template.
{{{ if .System }}<|system|>
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|user|>
{{ .Prompt }}<|im_end|>
{{ end }}<|assistant|>
{{ .Response }}<|im_end|>
Or if you are using an anti prompt: <|end|><|assistant|>
Highly recommended to use with a system prompt.
This model was trained using Unsloth and Huggingface's TRL library.
Used this model? Don't forget to leave a like :)