vesteinn commited on
Commit
f436cbb
1 Parent(s): 4e65856

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -1
README.md CHANGED
@@ -14,7 +14,7 @@ license: agpl-3.0
14
 
15
  # IceBERT
16
 
17
- IceBERT was trained with fairseq using the RoBERTa-base architecture. The training data used is shown in the table below.
18
 
19
  | Dataset | Size | Tokens |
20
  |------------------------------------------------------|---------|--------|
@@ -26,3 +26,31 @@ IceBERT was trained with fairseq using the RoBERTa-base architecture. The traini
26
  | Data from the medical library of Landspitali | 33 MB | 5.2M |
27
  | Student theses from Icelandic universities (Skemman) | 2.2 GB | 367M |
28
  | Total | 15.8 GB | 2,664M |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
  # IceBERT
16
 
17
+ This model was trained with fairseq using the RoBERTa-base architecture. It is one of many models we have trained for Icelandic, see the paper referenced below for further details. The training data used is shown in the table below.
18
 
19
  | Dataset | Size | Tokens |
20
  |------------------------------------------------------|---------|--------|
 
26
  | Data from the medical library of Landspitali | 33 MB | 5.2M |
27
  | Student theses from Icelandic universities (Skemman) | 2.2 GB | 367M |
28
  | Total | 15.8 GB | 2,664M |
29
+
30
+ ## Scitation
31
+
32
+ The model is described in this paper [https://arxiv.org/abs/2201.05601](https://arxiv.org/abs/2201.05601). Please cite the paper if you make use of the model.
33
+
34
+ ```
35
+ @article{DBLP:journals/corr/abs-2201-05601,
36
+ author = {V{\'{e}}steinn Sn{\ae}bjarnarson and
37
+ Haukur Barri S{\'{\i}}monarson and
38
+ P{\'{e}}tur Orri Ragnarsson and
39
+ Svanhv{\'{\i}}t Lilja Ing{\'{o}}lfsd{\'{o}}ttir and
40
+ Haukur P{\'{a}}ll J{\'{o}}nsson and
41
+ Vilhj{\'{a}}lmur {\TH}orsteinsson and
42
+ Hafsteinn Einarsson},
43
+ title = {A Warm Start and a Clean Crawled Corpus - {A} Recipe for Good Language
44
+ Models},
45
+ journal = {CoRR},
46
+ volume = {abs/2201.05601},
47
+ year = {2022},
48
+ url = {https://arxiv.org/abs/2201.05601},
49
+ eprinttype = {arXiv},
50
+ eprint = {2201.05601},
51
+ timestamp = {Thu, 20 Jan 2022 14:21:35 +0100},
52
+ biburl = {https://dblp.org/rec/journals/corr/abs-2201-05601.bib},
53
+ bibsource = {dblp computer science bibliography, https://dblp.org}
54
+ }
55
+ ```
56
+