--- license: apache-2.0 language: - ca - da - de - en - es - fr - nl - el - is - it - 'no' - pt - sv pipeline_tag: text-classification --- # Occupational CANINE: HISCO Classification Model ## Overview OccCANINE is a version of [CANINE](https://huggingface.co/google/canine-s) which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages. See more on: [GitHub.com/christianvedels/OccCANINE](https://github.com/christianvedels/OccCANINE) Read the paper on arXiv: [https://arxiv.org/abs/2402.13604](https://arxiv.org/abs/2402.13604) ## Key Features - **High Accuracy**: Over 90% accuracy, recall, and precision. - **Multilingual Support**: Trained on 14 million description-HISCO code pairs across 13 languages. - **Efficiency**: Rapidly processes descriptions into HISCO codes. ## Contribution and Support Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources.