UWB-AIR
/

Czert-B-base-cased

Inference Endpoints

Model card Files Files and versions Community

ondfa commited on Jun 9, 2021

Commit

e1fa83b

•

1 Parent(s): 6714bd8

Update README.md

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
 # CZERT
 This repository keeps trained Czert-B model for the paper [Czert – Czech BERT-like Model for Language Representation
 ](https://arxiv.org/abs/2103.13031)
@@ -39,14 +47,14 @@ We evaluate our model on two sentence level tasks:
 <!--     tokenizer = BertTokenizerFast.from_pretrained(CZERT_MODEL_PATH, strip_accents=False)
-	model = TFAlbertForSequenceClassification.from_pretrained(CZERT_MODEL_PATH, num_labels=1)
 or
     self.tokenizer = BertTokenizerFast.from_pretrained(CZERT_MODEL_PATH, strip_accents=False)
     self.model_encoder = AutoModelForSequenceClassification.from_pretrained(CZERT_MODEL_PATH, from_tf=True)
      -->
 ### Document Level Tasks
 We evaluate our model on one document level task
 * Multi-label Document Classification.
@@ -102,7 +110,7 @@ Comparison of F1 score achieved using pre-trained CZERT-A, CZERT-B, mBERT, Pavlo
 |        |   mBERT    |   Pavlov   | Albert-random |  Czert-A   |  Czert-B   | dep-based | gold-dep |
 |:------:|:----------:|:----------:|:-------------:|:----------:|:----------:|:---------:|:--------:|
-|  span  | 78.547 ± 0.110 | 79.333 ± 0.080 |  51.365 ± 0.423   | 72.254 ± 0.172 | **81.861 ± 0.102** |    \-     |    \-    |
 | syntax | 90.226 ± 0.224 | 90.492 ± 0.040 |  80.747 ± 0.131   | 80.319 ± 0.054 | **91.462 ± 0.062** |   85.19   |  89.52   |
 SRL results – dep columns are evaluate with labelled F1 from CoNLL 2009 evaluation script, other columns are evaluated with span F1 score same as it was used for NER evaluation. For more information see [the paper](https://arxiv.org/abs/2103.13031).

+---
+tags:
+- cs
+- bert
+- Transformers
+- Tensorflow
+---
 # CZERT
 This repository keeps trained Czert-B model for the paper [Czert – Czech BERT-like Model for Language Representation
 ](https://arxiv.org/abs/2103.13031)
 <!--     tokenizer = BertTokenizerFast.from_pretrained(CZERT_MODEL_PATH, strip_accents=False)
+\tmodel = TFAlbertForSequenceClassification.from_pretrained(CZERT_MODEL_PATH, num_labels=1)
 or
     self.tokenizer = BertTokenizerFast.from_pretrained(CZERT_MODEL_PATH, strip_accents=False)
     self.model_encoder = AutoModelForSequenceClassification.from_pretrained(CZERT_MODEL_PATH, from_tf=True)
      -->
+\t
 ### Document Level Tasks
 We evaluate our model on one document level task
 * Multi-label Document Classification.
 |        |   mBERT    |   Pavlov   | Albert-random |  Czert-A   |  Czert-B   | dep-based | gold-dep |
 |:------:|:----------:|:----------:|:-------------:|:----------:|:----------:|:---------:|:--------:|
+|  span  | 78.547 ± 0.110 | 79.333 ± 0.080 |  51.365 ± 0.423   | 72.254 ± 0.172 | **81.861 ± 0.102** |    \\-     |    \\-    |
 | syntax | 90.226 ± 0.224 | 90.492 ± 0.040 |  80.747 ± 0.131   | 80.319 ± 0.054 | **91.462 ± 0.062** |   85.19   |  89.52   |
 SRL results – dep columns are evaluate with labelled F1 from CoNLL 2009 evaluation script, other columns are evaluated with span F1 score same as it was used for NER evaluation. For more information see [the paper](https://arxiv.org/abs/2103.13031).