Important note The inference in HuggingFace won't work, because there is a missing pre-processing step.