neikos00 commited on
Commit
4083365
·
verified ·
1 Parent(s): b3308c4

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: mit
4
+ tags:
5
+ - vision
6
+ - image-segmentation
7
+ - pytorch
8
+ ---
9
+ # EoMT
10
+
11
+ [![PyTorch](https://img.shields.io/badge/PyTorch-DE3412?style=flat&logo=pytorch&logoColor=white)](https://pytorch.org/)
12
+
13
+ **EoMT (Encoder-only Mask Transformer)** is a Vision Transformer (ViT) architecture designed for high-quality and efficient image segmentation. It was introduced in the CVPR 2025 highlight paper:
14
+ **[Your ViT is Secretly an Image Segmentation Model](https://www.tue-mps.org/eomt)**
15
+ by Tommie Kerssies, Niccolò Cavagnero, Alexander Hermans, Narges Norouzi, Giuseppe Averta, Bastian Leibe, Gijs Dubbelman, and Daan de Geus.
16
+
17
+ > **Key Insight**: Given sufficient scale and pretraining, a plain ViT along with additional few params can perform segmentation without the need for task-specific decoders or pixel fusion modules. The same model backbone supports semantic, instance, and panoptic segmentation with different post-processing 🤗
18
+
19
+ The original implementation can be found in this [repository](https://github.com/tue-mps/eomt)
20
+
21
+ ---