Image Segmentation
Transformers
PyTorch
upernet
Inference Endpoints
test2 / configs /ocrnet /README.md
mccaly's picture
Upload 660 files
b13b124
|
raw
history blame
15.1 kB

Object-Contextual Representations for Semantic Segmentation

Introduction

[ALGORITHM]

@article{YuanW18,
  title={Ocnet: Object context network for scene parsing},
  author={Yuhui Yuan and Jingdong Wang},
  booktitle={arXiv preprint arXiv:1809.00916},
  year={2018}
}

@article{YuanCW20,
  title={Object-Contextual Representations for Semantic Segmentation},
  author={Yuhui Yuan and Xilin Chen and Jingdong Wang},
  booktitle={ECCV},
  year={2020}
}

Results and models

Cityscapes

HRNet backbone

Method Backbone Crop Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) download
OCRNet HRNetV2p-W18-Small 512x1024 40000 3.5 10.45 74.30 75.95 model | log
OCRNet HRNetV2p-W18 512x1024 40000 4.7 7.50 77.72 79.49 model | log
OCRNet HRNetV2p-W48 512x1024 40000 8 4.22 80.58 81.79 model | log
OCRNet HRNetV2p-W18-Small 512x1024 80000 - - 77.16 78.66 model | log
OCRNet HRNetV2p-W18 512x1024 80000 - - 78.57 80.46 model | log
OCRNet HRNetV2p-W48 512x1024 80000 - - 80.70 81.87 model | log
OCRNet HRNetV2p-W18-Small 512x1024 160000 - - 78.45 79.97 model | log
OCRNet HRNetV2p-W18 512x1024 160000 - - 79.47 80.91 model | log
OCRNet HRNetV2p-W48 512x1024 160000 - - 81.35 82.70 model | log

ResNet backbone

Method Backbone Crop Size Batch Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) download
OCRNet R-101-D8 512x1024 8 40000 - - 80.09 - model | log
OCRNet R-101-D8 512x1024 16 40000 8.8 3.02 80.30 - model | log
OCRNet R-101-D8 512x1024 16 80000 8.8 3.02 80.81 - model | log

ADE20K

Method Backbone Crop Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) download
OCRNet HRNetV2p-W18-Small 512x512 80000 6.7 28.98 35.06 35.80 model | log
OCRNet HRNetV2p-W18 512x512 80000 7.9 18.93 37.79 39.16 model | log
OCRNet HRNetV2p-W48 512x512 80000 11.2 16.99 43.00 44.30 model | log
OCRNet HRNetV2p-W18-Small 512x512 160000 - - 37.19 38.40 model | log
OCRNet HRNetV2p-W18 512x512 160000 - - 39.32 40.80 model | log
OCRNet HRNetV2p-W48 512x512 160000 - - 43.25 44.88 model | log

Pascal VOC 2012 + Aug

Method Backbone Crop Size Lr schd Mem (GB) Inf time (fps) mIoU mIoU(ms+flip) download
OCRNet HRNetV2p-W18-Small 512x512 20000 3.5 31.55 71.70 73.84 model | log
OCRNet HRNetV2p-W18 512x512 20000 4.7 19.91 74.75 77.11 model | log
OCRNet HRNetV2p-W48 512x512 20000 8.1 17.83 77.72 79.87 model | log
OCRNet HRNetV2p-W18-Small 512x512 40000 - - 72.76 74.60 model | log
OCRNet HRNetV2p-W18 512x512 40000 - - 74.98 77.40 model | log
OCRNet HRNetV2p-W48 512x512 40000 - - 77.14 79.71 model | log