jordimas's picture
Add contact and fixes
5fe9078
|
raw
history blame
1 kB
metadata
language:
  - ca
tags:
  - punctuation prediction
  - punctuation
datasets: softcatala/Europarl-catalan
widget:
  - text: >-
      Ara tenim ratolins de quatre mesos que no són diabètics tot i que solien
      ser-ho va afegir.
    example_title: Catalan
metrics:
  - f1

This model predicts the punctuation of Catalan language.

The model restores the following punctuation markers: "." "," "?" "-" ":"

Based on the work https://github.com/oliverguhr/fullstop-deep-punctuation-prediction

Results

The performance differs for the single punctuation markers as hyphens and colons, in many cases, are optional and can be substituted by either a comma or a full stop. The model achieves the following F1 scores for Catalan language:

Label CA
0 0.99
. 0.93
? 0.76
, 0.82
: 0.64
- 0.89
macro average 0.84

Contact

Jordi Mas jmas@softcatala.org