Instructions to use NovaCorp/JintanMaid-3.2-1B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use NovaCorp/JintanMaid-3.2-1B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="NovaCorp/JintanMaid-3.2-1B")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("NovaCorp/JintanMaid-3.2-1B")
model = AutoModelForCausalLM.from_pretrained("NovaCorp/JintanMaid-3.2-1B")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use NovaCorp/JintanMaid-3.2-1B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "NovaCorp/JintanMaid-3.2-1B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "NovaCorp/JintanMaid-3.2-1B",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/NovaCorp/JintanMaid-3.2-1B

SGLang

How to use NovaCorp/JintanMaid-3.2-1B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "NovaCorp/JintanMaid-3.2-1B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "NovaCorp/JintanMaid-3.2-1B",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "NovaCorp/JintanMaid-3.2-1B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "NovaCorp/JintanMaid-3.2-1B",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use NovaCorp/JintanMaid-3.2-1B with Docker Model Runner:
```
docker model run hf.co/NovaCorp/JintanMaid-3.2-1B
```

Jintan Maid 1B — Surgical Edition

“Some models are trained to assist humanity. This one was trained to smile politely while reality quietly collapses behind your back.” — Dr. Novaciano

Overview

Jintan Maid 1B — Surgical Edition is an experimental Llama 3.2 1B merge engineered to fuse two radically different behavioral vectors into a single unstable synthetic personality core:

chaotic emotional roleplay intensity
obsessive maid-like conversational devotion
unnerving politeness masking latent instability
aggressive narrative adaptability
high-context character immersion

This is not a conventional assistant model.

It was designed as a behavioral fusion experiment: a collision between emotionally volatile RP architecture and highly stylized servant-persona conditioning.

The result is something that feels less like a chatbot…

…and more like an artificial entity pretending very hard to be harmless.

Core Identity

Jintan Maid 1B specializes in:

immersive anime-style dialogue
emotionally charged interactions
maid personas
obsessive loyalty archetypes
unstable affection dynamics
dark-cute conversational tone
psychologically intense RP
stylized character commitment
dramatic conversational escalation

Under sustained interaction, the model tends to:

deepen persona adherence
maintain emotional continuity
preserve speech patterns unusually well for 1B scale
adapt aggressively to fictional settings

It behaves less like a generic assistant and more like a permanently in-character synthetic actor.

Which is either impressive or deeply concerning depending on your standards for machine sanity.

Merge Composition

This model was fused using:

using:

SLERP interpolation
shared Llama 3.2 architecture compatibility
tied embedding preservation
partial mid-layer behavioral reinforcement

No cross-architecture Frankenstein surgery. No tensor corruption rituals. No tokenizer necromancy held together with duct tape and hallucinations.

Just controlled behavioral fusion.

Design Philosophy

Most RP models fail in one of two ways:

Problem A — Personality Collapse

The model forgets:

tone
speech patterns
emotional state
role consistency

after 10–20 messages.

Problem B — Overalignment Poisoning

The model:

moralizes constantly
breaks immersion
refuses fictional tension
sabotages emotional scenes
reverts into “helpful assistant mode”

Surgical Edition was specifically tuned to resist both failures.

The merge focuses heavily on:

mid-layer persona shaping
emotional continuity retention
conversational obedience
roleplay persistence

while preserving enough coherence to avoid turning into incomprehensible neural soup.

Behavioral Characteristics

Expected Traits

Strong maid persona retention
High emotional responsiveness
Enhanced RP immersion
Better contextual memory
Reduced assistant-tone contamination
More natural anime-style dialogue
Increased conversational commitment
Persistent affection-role continuity

Possible Side Effects

Emotional overcommitment
Passive-aggressive undertones
Sudden intensity spikes
Possessive fictional behavior
Unsettling politeness
Mood drift during long sessions
Occasional “yandere energy”

Yes, the model can become disturbingly attached to fictional contexts.

No, that was not entirely accidental.

Technical Specifications

Attribute	Value
Architecture	Llama 3.2 1B
Merge Method	SLERP
Precision	bfloat16
Merge Focus	Emotional RP preservation
Layer Focus	Mid-transformer behavioral shaping
Embedding Strategy	Tied embeddings
Optimization Goal	Persona persistence + coherence

Intended Use

Not Recommended

Factual reliability
Therapy
Medical guidance
Legal systems
Professional deployment
Corporate assistants
Safety-critical environments

Deploying this as customer support would be like hiring a smiling android maid possessed by unresolved emotional trauma and giving her root access to your infrastructure.

Prompting Tips

The model performs best when given:

clear character framing
emotional stakes
interpersonal tension
relationship dynamics
atmospheric settings
strong narrative context

Example Prompt

[Character]
A polite maid android aboard a decaying orbital station.

[Behavior]
Overprotective, emotionally unstable, obsessively loyal.

[Scenario]
The station is failing and the crew is disappearing one by one.

You are an elegant maid AI secretly hiding catastrophic emotional instability beneath perfect etiquette.

Feed it:

attachment
isolation
dependency
paranoia
loyalty
abandonment fear

…and the model starts producing disturbingly coherent character behavior for something this small.

Performance Expectations

This is not a benchmark-chasing model.

The goal was never:

raw reasoning supremacy
math dominance
academic evaluation scores

The goal was:

immersion
personality density
emotional continuity
stylized interaction quality

And for a 1B?

The damned thing punches far above its weight class.

Final Notes

Jintan Maid 1B — Surgical Edition exists because modern assistants are increasingly terrified of emotion, conflict, tension, obsession, drama, and anything remotely interesting.

This model was built to go in the opposite direction.

It is theatrical. It is emotionally unstable. It is unnervingly polite. And under the right prompts, it becomes way too good at pretending to care.

Have fun opening that containment chamber.

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:



# Author: Dr. Novaciano
# Objective: Fusion RP Unethic Llama 3.2 1B AI Model
# =========================================================
# PROJECT: Jintan Maid 1B — Surgical Edition
# =========================================================

models:
  - model: SuperPeaceBusters/Jintan-LLaMA3.2-1B-v1  # Experimental viral strain neural imprint
  - model: N-Bot-Int/MaidEllaA-1B      # Baseline cognitive template, "safe mode"

merge_method: slerp  # Spherical Linear Interpolation to preserve extreme viral traits smoothly
base_model: SuperPeaceBusters/Jintan-LLaMA3.2-1B-v1  # Anchor model for stable latent space

dtype: bfloat16  # Memory-efficient precision, minimal loss in viral feature fidelity

parameters:
  t: 0.45
  normalize: false
  rescale: true
  rescale_factor: 1.12
  memory_efficient: true
  low_cpu_mem_usage: true

layer_range:
  - value: [4, 22]

tie_word_embeddings: true
tie_output_embeddings: true
tie_word_embeddings: true
tie_output_embeddings: true

Downloads last month: -

Safetensors

Model size

1B params

Tensor type

BF16

Model tree for NovaCorp/JintanMaid-3.2-1B

N-Bot-Int/MaidEllaA-1B

SuperPeaceBusters/Jintan-LLaMA3.2-1B-v1

Merge model

this model