OccCANINE / README.md
Christianvedel's picture
Update README.md
9d13433 verified
|
raw
history blame
No virus
1.13 kB
---
license: apache-2.0
language:
- ca
- da
- de
- en
- es
- fr
- nl
- el
- is
- it
- 'no'
- pt
- sv
pipeline_tag: text-classification
---
# Occupational CANINE: HISCO Classification Model
## Overview
OccCANINE is a version of [CANINE](https://huggingface.co/google/canine-s) which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages.
See more on: [GitHub.com/christianvedels/OccCANINE](https://github.com/christianvedels/OccCANINE)
Read the paper on arXiv: [https://arxiv.org/abs/2402.13604](https://arxiv.org/abs/2402.13604)
## Key Features
- **High Accuracy**: Over 90% accuracy, recall, and precision.
- **Multilingual Support**: Trained on 14 million description-HISCO code pairs across 13 languages.
- **Efficiency**: Rapidly processes descriptions into HISCO codes.
## Contribution and Support
Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources.