Instructions to use ambrosfitz/19c-newspaper-column-yolo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- ultralytics
How to use ambrosfitz/19c-newspaper-column-yolo with ultralytics:
# Couldn't find a valid YOLO version tag. # Replace XX with the correct version. from ultralytics import YOLOvXX model = YOLOvXX.from_pretrained("ambrosfitz/19c-newspaper-column-yolo") source = 'http://images.cocodataset.org/val2017/000000039769.jpg' model.predict(source=source, save=True) - Notebooks
- Google Colab
- Kaggle
19c Newspaper Column YOLO Detector
This repository contains a YOLO detector fine-tuned to detect and segment columns in 19th-century American newspaper pages.
Extracting individual columns is a critical pre-processing step for historical OCR pipelines. Slicing long columns into smaller, overlapping horizontal strips resolves the text-hallucination/repetition issues that modern Vision-Language Models (like Gemma 2B/27B) encounter on full-page layouts.
Model Training & Performance
The model was trained on annotated historical newspaper pages. You can view the training metrics, curves, and validation performance directly below:
1. Training Metrics (results.png)
Shows training/validation loss decay and precision/recall improvements over epochs:

2. Confusion Matrix
Displays normalized classification performance:

3. Model Predictions vs. Ground Truth
Compare the validation batch labels (ground truth annotations) with the actual predictions generated by the trained model:
How to Run This Model
You can easily download and run this model in Python using the ultralytics package.
Installation
pip install ultralytics huggingface_hub
Python Inference Code
from huggingface_hub import hf_hub_download
from ultralytics import YOLO
# 1. Download the weights from Hugging Face
model_path = hf_hub_download(repo_id="ambrosfitz/19c-newspaper-column-yolo", filename="best.pt")
# 2. Load the model
model = YOLO(model_path)
# 3. Perform detection on a newspaper page image
results = model("path_to_newspaper_page.jpg")
# 4. Display or save the segmented columns
results[0].show()
# results[0].save(filename="output.jpg")
- Downloads last month
- 70

