osiria
/

blaze-it-ner

Token Classification

Inference Endpoints

Model card Files Files and versions Community

blaze-it-ner / README.md

osiria's picture

Update README.md

4aae902 about 1 year ago

|

history blame contribute delete

3.29 kB

	---
	license: apache-2.0
	language:
	- it
	widget:
	- text: Mi chiamo Marco Rossi, vivo a Roma e lavoro per l'Agenzia Spaziale Italiana
	example_title: Example 1
	---

	--------------------------------------------------------------------------------------------------

	<body>
	<span class="vertical-text" style="background-color:lightgreen;border-radius: 3px;padding: 3px;"> </span>
	<br>
	<span class="vertical-text" style="background-color:orange;border-radius: 3px;padding: 3px;"> Task: Named Entity Recognition</span>
	<br>
	<span class="vertical-text" style="background-color:lightblue;border-radius: 3px;padding: 3px;"> Model: BLAZE 🔥</span>
	<br>
	<span class="vertical-text" style="background-color:tomato;border-radius: 3px;padding: 3px;"> Lang: IT</span>
	<br>
	<span class="vertical-text" style="background-color:lightgrey;border-radius: 3px;padding: 3px;"> Type: Uncased</span>
	<br>
	<span class="vertical-text" style="background-color:#CF9FFF;border-radius: 3px;padding: 3px;"> </span>
	</body>

	--------------------------------------------------------------------------------------------------

	<h3>Model description</h3>

	This is a lightweight and uncased model for the <b>Italian</b> language, fine-tuned for <b>Named Entity Recognition</b> (<b>Person</b>, <b>Location</b>, <b>Organization</b> and <b>Miscellanea</b> classes) on the [WikiNER](https://figshare.com/articles/dataset/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) dataset <b>[1]</b>, using <b>Blaze-IT</b> ([blaze-it](https://huggingface.co/osiria/blaze-it)) as a pre-trained model.

	<h3>Training and Performances</h3>

	The model is trained to perform entity recognition over 4 classes: <b>PER</b> (persons), <b>LOC</b> (locations), <b>ORG</b> (organizations), <b>MISC</b> (miscellanea, mainly events, products and services). It has been fine-tuned for Named Entity Recognition, using the WikiNER Italian dataset plus an additional custom dataset of manually annotated Wikipedia paragraphs.
	The model has been trained for 1 epoch with a constant learning rate of 1e-5.

	The 5-fold cross-validated performances on the test set are reported in the following table:

	\| Recall \| Precision \| F1 \|
	\| ------ \| ------ \| ------ \|
	\| 89.29 \| 89.84 \| 89.53 \|

	The metrics have been computed at the token level and then macro-averaged over the 4 classes.

	Then, since WikiNER is an automatically annotated (silver standard) dataset, which sometimes contains imperfect annotations, an additional fine-tuning on ~3.500 manually annotated paragraphs has been performed.

	You can try the model online using this web app: https://huggingface.co/spaces/osiria/blaze-it-demo

	<h3>References</h3>

	[1] https://www.sciencedirect.com/science/article/pii/S0004370212000276

	<h3>Limitations</h3>

	This model is mainly trained on Wikipedia, so it's particularly suitable for natively digital text from the world wide web, written in a correct and fluent form (like wikis, web pages, news, etc.). However, it may show limitations when it comes to chaotic text, containing errors and slang expressions
	(like social media posts) or when it comes to domain-specific text (like medical, financial or legal content).

	<h3>License</h3>

	The model is released under <b>Apache-2.0</b> license