DINOHash
Canonical model zoo for DINOHash — robust perceptual image hashing (ONNX).
All inputs are (N, 3, 224, 224), ImageNet-normalized
(mean=[0.485,0.456,0.406], std=[0.229,0.224,0.225]).
Recommended — hash bits baked in
The ONNX graph includes the PCA head; the output is per-bit logits (binarize with >= 0 → 1).
| Model | Bits | File |
|---|---|---|
| DINOv2 ViT-B/14 (flagship) | 512 | dinov2_vitb14_reg_512bit_dynamic.onnx |
| DINOv2 ViT-S/14 | 96 | dinov2_vits14_reg_96bit_dynamic.onnx |
Free hosted API: https://huggingface.co/spaces/proteus-photos/DINOHash
Other backbones — raw embeddings (apply your own PCA / sign-binarization)
- DINOv3:
dinov3-vits16,dinov3-vits16plus,dinov3_convnext-tiny,dinov3_convnext-small - Distilled students (DisCo):
ResNet-{50,50*2,101,152}→{ResNet-18, ResNet-34, MobileNet-v3, EfficientNet-B0, EfficientNet-B1}, plusViT-Small→ViT-Tiny,XCiT-Small→XCiT-Tiny - SSL baselines: DINO / MoCo / SwAV (MobileNetV2, R18, R34, ViT-T) and MAE-Lite (
mae_tiny_*,mocov3_tiny)
raw/
Raw training / TorchScript checkpoints for the ViT-Tiny, XCiT-Tiny and MAE-Lite models.
- Downloads last month
- 30
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support