StevenMup2004
/

lpg-qwen3-4b-checkpoint-5200

Text Classification

policy-detection

latent-policy-guard

Model card Files Files and versions

LPG Qwen3-4B Checkpoint-5200

This repository contains a checkpoint of Latent Policy Guard (LPG) based on Qwen3-4B.

Training Configuration

Base model: Qwen3-4B
Epochs: 3
Learning rate: 5e-5
Seed: 11
Checkpoint step: 5200

Intended Use

Research on policy violation detection and LLM guardrails.

Limitations

This checkpoint is released for research purposes only.

Downloads last month: -; Downloads are not tracked for this model. How to track

Safetensors

Model size

5B params

Tensor type

BF16

·

Model tree for StevenMup2004/lpg-qwen3-4b-checkpoint-5200

Base model

Qwen/Qwen3-4B-Base

Finetuned

Finetuned

(704)

this model