README.md · prakashchhipa/MPD_SSL at ddb810cf21cde2b1dae94fa90a7c9a2d67671b91

metadata

license: apache-2.0
datasets:
  - ILSVRC/imagenet-1k
language:
  - en
pipeline_tag: image-classification
tags:
  - Robust SSL
  - DINO
  - SimCLR
  - Perspective Distortion
  - MPD
  - ImageNet-PD

Self-Supervised Pretrained Models with MPD Integration

Publication- Möbius Transform for Mitigating Perspective Distortions in Representation Learning, European Conference on Computer Vision (ECCV 2024)

Model Description

This release includes two self-supervised pretrained models integrated with the Mitigating Perspective Distortion (MPD) method. The models are:

ResNet50 pretrained using SimCLR- https://huggingface.co/prakashchhipa/MPD_SSL/blob/main/SimCLR_resnet50_with_MPD.pth.tar
ViT-small pretrained using DINO- https://huggingface.co/prakashchhipa/MPD_SSL/blob/main/DINO_vit-small_with_MPD.pth

Both models were trained with a batch size of 512 over 100 epochs. The MPD method enhances the robustness of these models by simulating real-world perspective distortions, making them more robust in various computer vision tasks.

Training Details

Algorithms- SimCLR for ResNet50, DINO for ViT-small Batch Size- 512 Epochs- 100 Perspective Distortion Synthesis: MPD

Performance

The integration of MPD in both SimCLR and DINO frameworks significantly improves the models' performance on tasks affected by perspective distortion. The models can be used directly for downstream tasks or further fine-tuned for specific applications. Refer results in MPD paper.

Source Code

Two minutes summary on MPD and links to access source code repository and ImageNet-PD bacnhmark are available at https://prakashchhipa.github.io/projects/mpd/