aiautomationlab
/

wtwm-gpt2-based-mentions-detector

Text Classification

text-classication

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

philippgawlik commited on Dec 1, 2022

Commit

4f3726a

•

1 Parent(s): 9dfb86d

Update

Files changed (1) hide show

README.md +15 -8

README.md CHANGED Viewed

@@ -1,25 +1,32 @@
 ---
-license: mit
 language:
   - de
 tags:
   - text-classication
 metrics:
   - precision
 widget:
-- text: "Ein Blick auf die Karte der Ertragslagen von Windkraftanlagen im Bayerischen Windatlas zeigt klar, dass gerade das südliche Allgäu  Windkraftanlagen wenig Sinn machen. Leider fehlt im Artikel ain Verweis auf deises leicht nachprüfbares Faktum."
-  example_classification: "Enthält Ansprache an Redaktion"
 ---
-# WTWM gpt2 based mentions detector
 This is a model for the task of classifying whether or not a articles comment addresses the moderation team/authors of the media house that published the article. In this prototype stage the media houses are Bayerischer Rundfunk and Mitteldeutscher Rundfunk.
 This classification task is implemented as a binary classification into:
-* label 0: the comment holds no mention
-* label 1: the comment addresses the moderation team/authors of the media house
-For this task, we decided to use [german-gpt2](https://huggingface.co/dbmdz/german-gpt2) by MDZ of Bayerische Staatsbibliothek as a foundation model.
-**Our finetuned model is still work in progress. More information to come.**

 ---
 language:
   - de
 tags:
   - text-classication
+license: mit
 metrics:
   - precision
+  - recall
+  - f1
 widget:
+- text: "Im Artikel wird leider nicht erwähnt, inwieweit und ob dadurch Natur zerstört werden muss."
 ---
+# German news title gen
+Please node that this model originates from the ["What's there, what's missing"](https://interaktiv.br.de/ai-detect-newsroom-mentions-in-comments/) collaboration of [AI & Automation Labl of Bayerischer Rundfunk](https://www.br.de/extra/ai-automation-lab/index.html) and [Mitteldeutscher Rundfunk](https://www.mdr.de/) as well as [ida](https://idalab.de/). The collaboration took place during the [JournalismAI fellowship '22](https://www.lse.ac.uk/media-and-communications/polis/JournalismAI/Fellowship-Programme). The model presented is part of the the documenation of the half year of project time. The related technical framework can be found a [github](https://github.com/br-data/wtwm-topic-modelling).
+## The task
 This is a model for the task of classifying whether or not a articles comment addresses the moderation team/authors of the media house that published the article. In this prototype stage the media houses are Bayerischer Rundfunk and Mitteldeutscher Rundfunk.
 This classification task is implemented as a binary classification into:
+label 0: the comment holds no mention
+label 1: the comment addresses the moderation team/authors of the media house
+We decided to use [german-gpt2](https://huggingface.co/dbmdz/german-gpt2) by MDZ of Bayerische Staatsbibliothek as the foundation model.
+**This model is still work in progress and might be updated in the future.**