mindee
/

fasterrcnn_mobilenet_v3_large_fpn

Object Detection

Model card Files Files and versions Community

fasterrcnn_mobilenet_v3_large_fpn / README.md

osanseviero's picture

osanseviero HF staff

Update README.md

36909b9 over 2 years ago

|

raw history blame contribute delete

No virus

2.71 kB

	---
	license: apache-2.0
	tags:
	- object-detection
	- pytorch
	library_name: doctr
	datasets:
	- docartefacts
	---


	# Faster-RCNN model

	Pretrained on [DocArtefacts](https://mindee.github.io/doctr/datasets.html#doctr.datasets.DocArtefacts). The Faster-RCNN architecture was introduced in [this paper](https://arxiv.org/pdf/1506.01497.pdf).


	## Model description

	The core idea of the author is to unify Region Proposal with the core detection module of Fast-RCNN.


	## Installation

	### Prerequisites

	Python 3.6 (or higher) and [pip](https://pip.pypa.io/en/stable/) are required to install docTR.

	### Latest stable release

	You can install the last stable release of the package using [pypi](https://pypi.org/project/python-doctr/) as follows:

	```shell
	pip install python-doctr[torch]
	```

	### Developer mode

	Alternatively, if you wish to use the latest features of the project that haven't made their way to a release yet, you can install the package from source (install [Git](https://git-scm.com/book/en/v2/Getting-Started-Installing-Git) first):

	```shell
	git clone https://github.com/mindee/doctr.git
	pip install -e doctr/.[torch]
	```


	## Usage instructions

	```python
	from PIL import Image
	import torch
	from torchvision.transforms import Compose, ConvertImageDtype, PILToTensor
	from doctr.models.obj_detection.factory import from_hub

	model = from_hub("mindee/fasterrcnn_mobilenet_v3_large_fpn").eval()

	img = Image.open(path_to_an_image).convert("RGB")

	# Preprocessing
	transform = Compose([
	PILToTensor(),
	ConvertImageDtype(torch.float32),
	])

	input_tensor = transform(img).unsqueeze(0)

	# Inference
	with torch.inference_mode():
	output = model(input_tensor)
	```


	## Citation

	Original paper

	```bibtex
	@article{DBLP:journals/corr/RenHG015,
	author = {Shaoqing Ren and
	Kaiming He and
	Ross B. Girshick and
	Jian Sun},
	title = {Faster {R-CNN:} Towards Real-Time Object Detection with Region Proposal
	Networks},
	journal = {CoRR},
	volume = {abs/1506.01497},
	year = {2015},
	url = {http://arxiv.org/abs/1506.01497},
	eprinttype = {arXiv},
	eprint = {1506.01497},
	timestamp = {Mon, 13 Aug 2018 16:46:02 +0200},
	biburl = {https://dblp.org/rec/journals/corr/RenHG015.bib},
	bibsource = {dblp computer science bibliography, https://dblp.org}
	}
	```

	Source of this implementation

	```bibtex
	@misc{doctr2021,
	title={docTR: Document Text Recognition},
	author={Mindee},
	year={2021},
	publisher = {GitHub},
	howpublished = {\url{https://github.com/mindee/doctr}}
	}
	```