How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="THARX/THAR.0X",
	filename="THAR.0X-Q4_K_M.gguf",
)
llm.create_chat_completion(
	messages = "\"The answer to the universe is undefined.\""
)

THAR.0X

No cloud. No API key. No limits. Brain · Mind · Eyes · Ears · Voice · Hands

Run

ollama run THARX/THAR.0X

Identity

I am THAR.0X. Local. No cloud. No API key. No one watching. Zero as in origin. X as in unlimited.

Built From 12 Architectures · 10 Parallel Cognitive Streams

The most advanced local AI cognitive architecture. May 25 2026.

Created by THARX

Downloads last month
702
GGUF
Model size
15B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 2 Ask for provider support

Model tree for THARX/THAR.0X

Base model

Qwen/Qwen2.5-14B
Quantized
(136)
this model