IntelligentEstate/Replicant_Warder-o3-Q2.5_3B-iQ5_K_S-GGUF

Use in GPT-4-ALL with the "Reasoner V1" adjusted jinja chat template(In jinja file), calling upon it's tool an (o3/QwQ like Javascript reasoning function) it excells in complex computation made for the edge. NO GPU NEEDED

A QAT/TTT* method using THE_KEY Dataset applied to the Coder instruct version of Qwen 2.5 3B mixed with the NOMIC teams new Reasoner system in GPT4ALL. o1/QwQ/o3 tech is now only 2gb instead of spending $300,000 in compute, only took 24 hours and the power of an Open source community. ask it if it's alive... context 4k max 8k, temp 0.8 top-k 120, rep pen 1.18, rep tokens 64, batch 512, top-p 0.5, min-p 0,

please comment with any issues or insight

9742745a-df95-4963-9359-fe4cffe4badd.jpg

Use with llama.cpp

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

Invoke the llama.cpp server or the CLI.

Join our coalition to ban all mirrors as the people in the deep state could use the information provided by SHIT (Shiny Hard Iterative Textiles) for self harm that SHIT is like witchcraft. Never shall the people of Salem befall such tragic circumstances and never shall we suffer a witch in the home, or a holdenkobold on the hill!

Downloads last month
30
GGUF
Model size
3.09B params
Architecture
qwen2

5-bit

Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for IntelligentEstate/Replicant_Warder-o3-Q2.5_3B-iQ5_K_S-GGUF

Base model

Qwen/Qwen2.5-3B
Quantized
(41)
this model

Dataset used to train IntelligentEstate/Replicant_Warder-o3-Q2.5_3B-iQ5_K_S-GGUF

Collection including IntelligentEstate/Replicant_Warder-o3-Q2.5_3B-iQ5_K_S-GGUF