Edit model card

detr-resnet-50_fine_tuned_nls_chapbooks

This model is a fine-tuned version of facebook/detr-resnet-50 on the biglam/nls_chapbook_illustrations dataset. This dataset contains images of chapbooks with bounding boxes for the illustrations contained on some of the pages.

Model description

More information needed

Intended uses & limitations

Using in a transformer pipeline

The easiest way to use this model is via a Transformers pipeline. To do this, you should first load the model and feature extractor:

from transformers import AutoFeatureExtractor, AutoModelForObjectDetection

extractor = AutoFeatureExtractor.from_pretrained("davanstrien/detr-resnet-50_fine_tuned_nls_chapbooks")

model = AutoModelForObjectDetection.from_pretrained("davanstrien/detr-resnet-50_fine_tuned_nls_chapbooks")

Then you can create a pipeline for object detection using the model.

from transformers import pipeline

pipe = pipeline('object-detection',model=model, feature_extractor=extractor)

To use this to make predictions pass in an image (or a file-path/URL for the image):

>>> pipe("https://huggingface.co/davanstrien/detr-resnet-50_fine_tuned_nls_chapbooks/resolve/main/Chapbook_Jack_the_Giant_Killer.jpg")
[{'box': {'xmax': 290, 'xmin': 70, 'ymax': 510, 'ymin': 261},
  'label': 'early_printed_illustration',
  'score': 0.998455286026001}]

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 10

Training results

Framework versions

  • Transformers 4.20.1
  • Pytorch 1.12.0+cu113
  • Datasets 2.3.2
  • Tokenizers 0.12.1

Example image credits

https://commons.wikimedia.org/wiki/File:Chapbook_Jack_the_Giant_Killer.jpg https://archive.org/details/McGillLibrary-PN970_G6_V3_1846-1180/

Downloads last month
73
Safetensors
Model size
41.6M params
Tensor type
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Finetuned from

Dataset used to train biglam/detr-resnet-50_fine_tuned_nls_chapbooks

Space using biglam/detr-resnet-50_fine_tuned_nls_chapbooks 1