You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

GPT2.5.5-Awakened.Thinker-0.1B

Forged by WithIn Us AI — a fully fine-tuned GPT-2 awakened on distilled GPT-5.5 thinking patterns.

Model Overview

Property Value
Architecture GPT-2 (openai-community/gpt2)
Parameters ~124M (0.1B class)
Training Type Full fine-tune — ALL weights updated, zero adapters
Context Window 1024 tokens
Best Eval Loss 0.4365
Best Perplexity 1.55
Creator GODsStrongestSoldier / WithIn Us AI
Date Trained 2026-05-23
Hardware 2× NVIDIA Tesla T4 (Kaggle)
Precision FP16 mixed precision

Training Methodology

Full fine-tuning — every single parameter in GPT-2 was updated. No LoRA, no QLoRA, no adapters of any kind.

Datasets

Dataset Description
WithinUsAI/GPT_5.5_Distilled Instruction + completion pairs distilled from GPT-5.5
WithinUsAI/GPT5.5_thinking_max_distill_god_seed_25K 25K chain-of-thought reasoning traces distilled from GPT-5.5

97 / 3 train / eval split.

Hyperparameters

Parameter Value
Peak Learning Rate 3e-5
LR Schedule Cosine with 6% warmup
Effective Batch Size 64 (4 × 2 GPUs × 8 grad accum)
Epochs 5
Weight Decay 0.1
Max Sequence Length 1024
Precision FP16

Quick Start

from transformers import GPT2LMHeadModel, GPT2TokenizerFast
import torch

model_id  = "GODsStrongestSoldier/GPT2.5.5-Awakened.Thinker-0.1B"
tokenizer = GPT2TokenizerFast.from_pretrained(model_id)
model     = GPT2LMHeadModel.from_pretrained(model_id, torch_dtype=torch.float16)
model.eval()

prompt = "Let me think through this carefully, step by step:"
inputs = tokenizer(prompt, return_tensors="pt")

with torch.no_grad():
    output = model.generate(
        **inputs,
        max_new_tokens     = 200,
        do_sample          = True,
        temperature        = 0.7,
        top_p              = 0.9,
        repetition_penalty = 1.15,
    )

print(tokenizer.decode(output[0], skip_special_tokens=True))

About WithIn Us AI

"Strength through understanding. Awakened from within."

Downloads last month
206
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 11-47/GPT2.5.5-Awakened.Thinker-0.1B

Finetuned
(2185)
this model
Quantizations
1 model

Datasets used to train 11-47/GPT2.5.5-Awakened.Thinker-0.1B