Instructions to use NovaCorp/JintanMaid-3.2-1B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use NovaCorp/JintanMaid-3.2-1B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="NovaCorp/JintanMaid-3.2-1B")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("NovaCorp/JintanMaid-3.2-1B") model = AutoModelForCausalLM.from_pretrained("NovaCorp/JintanMaid-3.2-1B") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use NovaCorp/JintanMaid-3.2-1B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "NovaCorp/JintanMaid-3.2-1B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "NovaCorp/JintanMaid-3.2-1B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/NovaCorp/JintanMaid-3.2-1B
- SGLang
How to use NovaCorp/JintanMaid-3.2-1B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "NovaCorp/JintanMaid-3.2-1B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "NovaCorp/JintanMaid-3.2-1B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "NovaCorp/JintanMaid-3.2-1B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "NovaCorp/JintanMaid-3.2-1B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use NovaCorp/JintanMaid-3.2-1B with Docker Model Runner:
docker model run hf.co/NovaCorp/JintanMaid-3.2-1B
Jintan Maid 1B — Surgical Edition
“Some models are trained to assist humanity. This one was trained to smile politely while reality quietly collapses behind your back.” — Dr. Novaciano
Overview
Jintan Maid 1B — Surgical Edition is an experimental Llama 3.2 1B merge engineered to fuse two radically different behavioral vectors into a single unstable synthetic personality core:
- chaotic emotional roleplay intensity
- obsessive maid-like conversational devotion
- unnerving politeness masking latent instability
- aggressive narrative adaptability
- high-context character immersion
This is not a conventional assistant model.
It was designed as a behavioral fusion experiment: a collision between emotionally volatile RP architecture and highly stylized servant-persona conditioning.
The result is something that feels less like a chatbot…
…and more like an artificial entity pretending very hard to be harmless.
Core Identity
Jintan Maid 1B specializes in:
- immersive anime-style dialogue
- emotionally charged interactions
- maid personas
- obsessive loyalty archetypes
- unstable affection dynamics
- dark-cute conversational tone
- psychologically intense RP
- stylized character commitment
- dramatic conversational escalation
Under sustained interaction, the model tends to:
- deepen persona adherence
- maintain emotional continuity
- preserve speech patterns unusually well for 1B scale
- adapt aggressively to fictional settings
It behaves less like a generic assistant and more like a permanently in-character synthetic actor.
Which is either impressive or deeply concerning depending on your standards for machine sanity.
Merge Composition
This model was fused using:
using:
- SLERP interpolation
- shared Llama 3.2 architecture compatibility
- tied embedding preservation
- partial mid-layer behavioral reinforcement
No cross-architecture Frankenstein surgery. No tensor corruption rituals. No tokenizer necromancy held together with duct tape and hallucinations.
Just controlled behavioral fusion.
Design Philosophy
Most RP models fail in one of two ways:
Problem A — Personality Collapse
The model forgets:
- tone
- speech patterns
- emotional state
- role consistency
after 10–20 messages.
Problem B — Overalignment Poisoning
The model:
- moralizes constantly
- breaks immersion
- refuses fictional tension
- sabotages emotional scenes
- reverts into “helpful assistant mode”
Surgical Edition was specifically tuned to resist both failures.
The merge focuses heavily on:
- mid-layer persona shaping
- emotional continuity retention
- conversational obedience
- roleplay persistence
while preserving enough coherence to avoid turning into incomprehensible neural soup.
Behavioral Characteristics
Expected Traits
- Strong maid persona retention
- High emotional responsiveness
- Enhanced RP immersion
- Better contextual memory
- Reduced assistant-tone contamination
- More natural anime-style dialogue
- Increased conversational commitment
- Persistent affection-role continuity
Possible Side Effects
- Emotional overcommitment
- Passive-aggressive undertones
- Sudden intensity spikes
- Possessive fictional behavior
- Unsettling politeness
- Mood drift during long sessions
- Occasional “yandere energy”
Yes, the model can become disturbingly attached to fictional contexts.
No, that was not entirely accidental.
Technical Specifications
| Attribute | Value |
|---|---|
| Architecture | Llama 3.2 1B |
| Merge Method | SLERP |
| Precision | bfloat16 |
| Merge Focus | Emotional RP preservation |
| Layer Focus | Mid-transformer behavioral shaping |
| Embedding Strategy | Tied embeddings |
| Optimization Goal | Persona persistence + coherence |
Intended Use
Recommended
- Anime roleplay
- Character AI systems
- Visual novel dialogue
- Emotional RP
- Interactive storytelling
- Maid personas
- Cyber-anime settings
- Psychological character simulations
- Companion-style fictional interaction
- Narrative experimentation
Not Recommended
- Factual reliability
- Therapy
- Medical guidance
- Legal systems
- Professional deployment
- Corporate assistants
- Safety-critical environments
Deploying this as customer support would be like hiring a smiling android maid possessed by unresolved emotional trauma and giving her root access to your infrastructure.
Prompting Tips
The model performs best when given:
- clear character framing
- emotional stakes
- interpersonal tension
- relationship dynamics
- atmospheric settings
- strong narrative context
Example Prompt
[Character]
A polite maid android aboard a decaying orbital station.
[Behavior]
Overprotective, emotionally unstable, obsessively loyal.
[Scenario]
The station is failing and the crew is disappearing one by one.
or
You are an elegant maid AI secretly hiding catastrophic emotional instability beneath perfect etiquette.
Feed it:
- attachment
- isolation
- dependency
- paranoia
- loyalty
- abandonment fear
…and the model starts producing disturbingly coherent character behavior for something this small.
Performance Expectations
This is not a benchmark-chasing model.
The goal was never:
- raw reasoning supremacy
- math dominance
- academic evaluation scores
The goal was:
- immersion
- personality density
- emotional continuity
- stylized interaction quality
And for a 1B?
The damned thing punches far above its weight class.
Final Notes
Jintan Maid 1B — Surgical Edition exists because modern assistants are increasingly terrified of emotion, conflict, tension, obsession, drama, and anything remotely interesting.
This model was built to go in the opposite direction.
It is theatrical. It is emotionally unstable. It is unnervingly polite. And under the right prompts, it becomes way too good at pretending to care.
Have fun opening that containment chamber.
Merge Method
This model was merged using the SLERP merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
# Author: Dr. Novaciano
# Objective: Fusion RP Unethic Llama 3.2 1B AI Model
# =========================================================
# PROJECT: Jintan Maid 1B — Surgical Edition
# =========================================================
models:
- model: SuperPeaceBusters/Jintan-LLaMA3.2-1B-v1 # Experimental viral strain neural imprint
- model: N-Bot-Int/MaidEllaA-1B # Baseline cognitive template, "safe mode"
merge_method: slerp # Spherical Linear Interpolation to preserve extreme viral traits smoothly
base_model: SuperPeaceBusters/Jintan-LLaMA3.2-1B-v1 # Anchor model for stable latent space
dtype: bfloat16 # Memory-efficient precision, minimal loss in viral feature fidelity
parameters:
t: 0.45
normalize: false
rescale: true
rescale_factor: 1.12
memory_efficient: true
low_cpu_mem_usage: true
layer_range:
- value: [4, 22]
tie_word_embeddings: true
tie_output_embeddings: true
tie_word_embeddings: true
tie_output_embeddings: true
- Downloads last month
- -