SEED Detector

SEED Detector is a lightweight vision transformer model for document forgery detection. It localizes tampered regions in document images and classifies images as real or forged.

Architecture

Component Detail
Backbone DINOv3 ViT-L/16
Finetuning LoRA (rank=1, attention + MLP)
Queries 1 mask query
Decoder blocks 4
Input size 512 × 512
Parameters ~304M (only ~1M trainable with LoRA)

Usage

Repository: GitHub | Checkpoint: Jason37437/SEED / Google Drive

from model.hf_wrapper import EoMTForTamperingDetection

model = EoMTForTamperingDetection.from_pretrained("Jason37437/SEED")
model.eval()

# The model outputs:
#   - mask_logits: per-query segmentation masks
#   - class_logits: per-query foreground/background scores  
#   - image_logits: image-level real vs forged classification

Performance

Localization (pixel-level F1)

Dataset F1
T-SROIE 0.782
OSTF 0.718
TPIC-13 0.798
RTM 0.178
Avg 0.619

Detection (image-level F1)

Dataset F1
T-SROIE 0.738
OSTF 0.832
TPIC-13 0.930
RTM 0.207
Avg 0.677

License

MIT License.

Downloads last month
66
Safetensors
Model size
0.3B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Jason37437/SEED

Datasets used to train Jason37437/SEED