Instructions to use jayfurzy/paterikon-sft-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use jayfurzy/paterikon-sft-v2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="jayfurzy/paterikon-sft-v2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("jayfurzy/paterikon-sft-v2")
model = AutoModelForMultimodalLM.from_pretrained("jayfurzy/paterikon-sft-v2")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use jayfurzy/paterikon-sft-v2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "jayfurzy/paterikon-sft-v2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jayfurzy/paterikon-sft-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/jayfurzy/paterikon-sft-v2

SGLang

How to use jayfurzy/paterikon-sft-v2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "jayfurzy/paterikon-sft-v2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jayfurzy/paterikon-sft-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "jayfurzy/paterikon-sft-v2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jayfurzy/paterikon-sft-v2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use jayfurzy/paterikon-sft-v2 with Docker Model Runner:
```
docker model run hf.co/jayfurzy/paterikon-sft-v2
```

Paterikon-SFT-v2

Three-pass iterative SFT — the full instruction-tuned Patristic AI.

Overview

Paterikon-SFT-v2 is the culmination of the Orthodox Patristic AI fine-tuning pipeline. Built on Paterikon-3B, it underwent three iterative rounds of supervised fine-tuning with active learning: the model generates responses, a judge evaluates them, and the best pairs are fed back into the next training round.

This active-loop approach produced 3,287 high-quality Q&A pairs across three iterations, progressively refining the model's theological accuracy, pastoral tone, and resistance to confabulation.


Base model	jayfurzy/paterikon-3b (Qwen2.5-3B CPT)
Training	3-pass iterative SFT with active learning (TRL)
Parameters	3.09 billion
Architecture	Qwen2ForCausalLM, 36 layers, 2048 hidden
Languages	English (primary), Russian, Greek
Training pairs	3,287 (1,156 iter-1 + 1,089 iter-2 + 1,042 iter-3)
Training probes	75M tokens across 3 iterations (25M per iter)
Frameworks	TRL 0.28.0, Transformers 4.57.6, PyTorch 2.9.1
License	Apache 2.0

Quick Start

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_id = "jayfurzy/paterikon-sft-v2"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

messages = [
    {"role": "system", "content": "You are an Orthodox Christian theologian, answering in the spirit of the Holy Fathers."},
    {"role": "user", "content": "Explain the Orthodox understanding of theosis."},
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7, do_sample=True)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Training Methodology

Active Learning Loop

Each iteration follows the same cycle:

Probe: Generate responses to a diverse set of Orthodox theological questions (25M tokens)
Judge: Score responses on theological accuracy, pastoral tone, and attribution fidelity
Select: Filter for high-scoring Q&A pairs
Train: SFT fine-tune on the selected pairs — producing the next iteration's model

This is repeated 3 times, with each iteration building on the improved model from the previous round.

Iteration Breakdown

Iteration	Generated Pairs	Training Data
Iter 1	1,156	iter1_probe.jsonl (25M tokens)
Iter 2	1,089	iter2_probe.jsonl (25M tokens)
Iter 3	1,042	iter3_probe.jsonl (25M tokens)
Final	3,287 total	SFT on iter-3 checkpoint

Intended Use

Orthodox theological Q&A with pastoral voice
Spiritual guidance and counseling
Patristic text generation and completion
Educational tool for Orthodox Christian studies
Church Father quote recall and attribution
Base for further alignment (DPO/RLHF)

Limitations

May still confabulate on specific historical dates, canons, or exact citations
Not a substitute for a spiritual father, priest, or bishop
Reflects synthetic data biases — primarily Russian Orthodox sources
Temperature > 0.7 recommended for pastoral responses; > 0.8 may drift theologically
Model is 3B parameters — limited compared to larger models for complex theological reasoning

Relationship to Other Models

Qwen2.5-3B-Instruct
  └─ CPT → jayfurzy/paterikon-3b (patristic domain fluency)
       ├─ SFT → jayfurzy/paterikon-sft-v1 (single-pass instruction tuning)
       └─ 3-pass active-loop → jayfurzy/paterikon-sft-v2 (this model)
            └─ DPO planned

Citation

@misc{paterikon-sft-v2,
  author = {Justin Fursov},
  title = {Paterikon-SFT-v2: Three-Pass Iterative Instruction-Tuned Orthodox Patristic Language Model},
  year = {2026},
  publisher = {HuggingFace},
  url = {https://huggingface.co/jayfurzy/paterikon-sft-v2},
}

This model was developed as part of the Orthodox Constitution Project — extracting ethical principles from the Church Fathers for AI alignment. The name "Paterikon" (πατερικόν) refers to the traditional Orthodox collections of the sayings and lives of the Desert Fathers.

Downloads last month: 10

Safetensors

Model size

3B params

Tensor type

BF16

Model tree for jayfurzy/paterikon-sft-v2

Base model

Qwen/Qwen2.5-3B

Finetuned

Qwen/Qwen2.5-3B-Instruct

Finetuned

jayfurzy/paterikon-3b

Finetuned

(3)

this model

jayfurzy
/

paterikon-sft-v2

Paterikon-SFT-v2

Overview

Quick Start

Training Methodology

Active Learning Loop

Iteration Breakdown

Intended Use

Limitations

Relationship to Other Models

Citation

Model tree for jayfurzy/paterikon-sft-v2

Dataset used to train jayfurzy/paterikon-sft-v2