Edit model card

m0_flat_ner_ocr_cmbert_io

Introduction

This model is a fine-tuned verion from Jean-Baptiste/camembert-ner for nested NER task on a nested NER Paris trade directories dataset.

Dataset

Abbreviation Description
O Outside of a named entity
PER Person or company name
ACT Person or company professional activity
TITRE Distinction
LOC Street name
CARDINAL Street number
FT Geographical feature

Experiment parameter

  • Pretrained-model : Jean-Baptiste/camembert-ner
  • Dataset : noisy (Pero OCR)
  • Tagging format : IO
  • Recognised entities : All (flat entities)

Load model from the HuggingFace

from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained("nlpso/m0_flat_ner_ocr_cmbert_io")
model = AutoModelForTokenClassification.from_pretrained("nlpso/m0_flat_ner_ocr_cmbert_io")
Downloads last month
2
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train nlpso/m0_flat_ner_ocr_cmbert_io