ov-seg / README.md
liangfeng
add ovseg
583456e
---
title: Ov Seg
emoji: πŸ“Š
colorFrom: red
colorTo: pink
sdk: gradio
sdk_version: 3.8.2
app_file: app.py
pinned: false
license: cc-by-nc-4.0
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
# [OVSeg] Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
<img src="resources/pytorch-logo-dark.png" width="10%">
This is the official PyTorch implementation of our paper: <br>
**Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP**<br>
[Feng Liang](https://jeff-liangf.github.io/), [Bichen Wu](https://www.linkedin.com/in/bichenwu), [Xiaoliang Dai](https://sites.google.com/view/xiaoliangdai/), [Kunpeng Li](https://kunpengli1994.github.io/), [Yinan Zhao](https://yinan-zhao.github.io/), [Hang Zhang](https://hangzhang.org/), [Peizhao Zhang](https://www.linkedin.com/in/peizhao-zhang-14846042/), [Peter Vajda](https://sites.google.com/site/vajdap), [Diana Marculescu](https://www.ece.utexas.edu/people/faculty/diana-marculescu)
[[arXiv](https://arxiv.org/abs/2210.04150)] [[Project](https://jeff-liangf.github.io/projects/ovseg/)]
<p align="center">
<img src="resources/ovseg.gif" width="100%">
</p>
## Installation
Please see [installation guide](./INSTALL.md).
## Data Preparation
Please see [datasets preparation](./datasets/DATASETS.md).
## Getting started
Please see [getting started instruction](./GETTING_STARTED.md).
## LICENSE
Shield: [![CC BY-NC 4.0][cc-by-nc-shield]][cc-by-nc]
The majority of OVSeg is licensed under a
[Creative Commons Attribution-NonCommercial 4.0 International License](LICENSE).
[![CC BY-NC 4.0][cc-by-nc-image]][cc-by-nc]
[cc-by-nc]: http://creativecommons.org/licenses/by-nc/4.0/
[cc-by-nc-image]: https://licensebuttons.net/l/by-nc/4.0/88x31.png
[cc-by-nc-shield]: https://img.shields.io/badge/License-CC%20BY--NC%204.0-lightgrey.svg
However portions of the project are under separate license terms: CLIP and ZSSEG are licensed under the [MIT license](https://github.com/openai/CLIP/blob/main/LICENSE); MaskFormer is licensed under the [CC-BY-NC](https://github.com/facebookresearch/MaskFormer/blob/main/LICENSE); openclip is licensed under the license at [its repo](https://github.com/mlfoundations/open_clip/blob/main/LICENSE).
## Citing OVSeg :pray:
If you use OVSeg in your research or wish to refer to the baseline results published in the paper, please use the following BibTeX entry.
```BibTeX
@article{liang2022open,
title={Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP},
author={Liang, Feng and Wu, Bichen and Dai, Xiaoliang and Li, Kunpeng and Zhao, Yinan and Zhang, Hang and Zhang, Peizhao and Vajda, Peter and Marculescu, Diana},
journal={arXiv preprint arXiv:2210.04150},
year={2022}
}
```