🎭 Mimicer

The model that learns to mirror.

For fun! πŸš€

---

πŸš€ Overview

Mimicer is an experimental language model fine-tuned to reproduce text patterns and mirror user inputs.

Unlike traditional assistants optimized for reasoning or instruction following, Mimicer explores identity mapping and response replication through supervised fine-tuning.

This project serves as a learning platform for model training, dataset design, Hugging Face deployment, and transformer fine-tuning workflows.


πŸ“Š Model Details

Property Value
Base Model DistilGPT2
Parameters 81.9M
Architecture GPT-2 Decoder
Fine-Tuning Supervised
Training Samples 2,500
Context Length 40 Tokens
Framework Hugging Face Transformers
Hardware NVIDIA T4
Repository QuantaSparkLabs/Mimicer

βš™οΈ Training Objective

Training samples follow a structured format:

Input: Hello world
Output: Hello world

The objective is to teach the model to reproduce the provided text after the Output: prompt.

Example:

Input: How are you?
Output: How are you?

πŸ’» Usage

from transformers import AutoTokenizer, AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(
    "QuantaSparkLabs/Mimicer"
)

tokenizer = AutoTokenizer.from_pretrained(
    "QuantaSparkLabs/Mimicer"
)

prompt = "Input: hello how are you\nOutput:"

inputs = tokenizer(prompt, return_tensors="pt")

outputs = model.generate(
    **inputs,
    max_new_tokens=20,
    do_sample=False
)

print(tokenizer.decode(outputs[0], skip_special_tokens=True))

πŸ”¬ Project Goals

  • Learn transformer fine-tuning
  • Understand dataset design
  • Explore identity-mapping behavior
  • Practice Hugging Face model deployment
  • Build a foundation for future custom models

πŸ“œ License

Apache 2.0


Built by QuantaSparkLabs

Downloads last month
-
Safetensors
Model size
81.9M params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support