Model Card for text-complexity-classification

The model classifies texts into the language complexity classes (German language):

  • easy language / leichte Sprache
  • simple language / einfache Sprache
  • everyday language / Alltagssprache
  • special language / Fachsprache

The underlying corpus was trained on the basis of over 300,000 texts of the mentioned language categories. Freely available websites served as sources. Thematic diversity was taken into account when selecting the sources.

Evaluation

  • f1: 0.982
  • Precision: 0.981
  • Recall: 0.983

How to use

>>> from transformers import pipeline
>>> classifier = pipeline(model="krupper/text-complexity-classification")
>>> classifier("Bei Kleinkindern unter 2 Jahren liegen nur begrenzte Erfahrungen zur Pharmakokinetik vor.")
[{'label': 'special_language', 'score': 0.999923825263977}]

DOI: https://doi.org/10.57967/hf/0131

Downloads last month
24
Safetensors
Model size
110M params
Tensor type
I64
ยท
F32
ยท
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using krupper/text-complexity-classification 1