vankey/RealText-V2
Viewer • Updated • 27.5k • 2.87k • 4
SEED Detector is a lightweight vision transformer model for document forgery detection. It localizes tampered regions in document images and classifies images as real or forged.
| Component | Detail |
|---|---|
| Backbone | DINOv3 ViT-L/16 |
| Finetuning | LoRA (rank=1, attention + MLP) |
| Queries | 1 mask query |
| Decoder blocks | 4 |
| Input size | 512 × 512 |
| Parameters | ~304M (only ~1M trainable with LoRA) |
Repository: GitHub | Checkpoint: Jason37437/SEED / Google Drive
from model.hf_wrapper import EoMTForTamperingDetection
model = EoMTForTamperingDetection.from_pretrained("Jason37437/SEED")
model.eval()
# The model outputs:
# - mask_logits: per-query segmentation masks
# - class_logits: per-query foreground/background scores
# - image_logits: image-level real vs forged classification
| Dataset | F1 |
|---|---|
| T-SROIE | 0.782 |
| OSTF | 0.718 |
| TPIC-13 | 0.798 |
| RTM | 0.178 |
| Avg | 0.619 |
| Dataset | F1 |
|---|---|
| T-SROIE | 0.738 |
| OSTF | 0.832 |
| TPIC-13 | 0.930 |
| RTM | 0.207 |
| Avg | 0.677 |
MIT License.
Base model
facebook/dinov3-vit7b16-pretrain-lvd1689m