File size: 787 Bytes
2c38587
 
99f7dc0
1ba06fa
 
2c38587
99f7dc0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9e94a82
 
 
64afbc8
 
1ba06fa
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
license: cc-by-4.0
base_model: benjamin/wtp-canine-s-12l
language:
- de
---

# Where's the Point? A `wtpsplit` model for Historical German

This repository hosts a model that was trained using the awesome [`wtpsplit`](https://github.com/bminixhofer/wtpsplit) library.

The model was trained on OCR'ed German books from 1819 to 1944 extracted from the [NCC](https://huggingface.co/datasets/NbAiLab/NCC) corpus.

Please note: the model is constantly updated and work in progress.

# Usage

The following snippet shows how to use this model:

```python
from wtpsplit import WtP

model = WtP(model_name_or_model="wtp-canine-s-de-hist-12l",
            hub_prefix="stefan-it")

model.split("Das ist einer schöner Tag... Wie geht es dir?",
            lang_code="de", style="digibok")
```