Update README.md
Browse files
README.md
CHANGED
@@ -19,4 +19,27 @@ This autoregressive model belongs to a series of rather small language models tr
|
|
19 |
| Attention heads | 8 | 8 | 16 | 16 |
|
20 |
| Embedding size | 128 | 128 | 256 | 256 |
|
21 |
| Context size | 128 | 128 | 256 | 256 |
|
22 |
-
| Vocab size | 16k | 16k | 16k | 16k |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
| Attention heads | 8 | 8 | 16 | 16 |
|
20 |
| Embedding size | 128 | 128 | 256 | 256 |
|
21 |
| Context size | 128 | 128 | 256 | 256 |
|
22 |
+
| Vocab size | 16k | 16k | 16k | 16k |
|
23 |
+
|
24 |
+
|
25 |
+
If you use this model in your research, please cite the following publication:
|
26 |
+
|
27 |
+
```
|
28 |
+
@inproceedings{bunzeck-zarriess-2024-fifty,
|
29 |
+
title = "Fifty shapes of {BL}i{MP}: syntactic learning curves in language models are not uniform, but sometimes unruly",
|
30 |
+
author = "Bunzeck, Bastian and
|
31 |
+
Zarrie{\ss}, Sina",
|
32 |
+
editor = "Qiu, Amy and
|
33 |
+
Noble, Bill and
|
34 |
+
Pagmar, David and
|
35 |
+
Maraev, Vladislav and
|
36 |
+
Ilinykh, Nikolai",
|
37 |
+
booktitle = "Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning",
|
38 |
+
month = oct,
|
39 |
+
year = "2024",
|
40 |
+
address = "Gothenburg, Sweden",
|
41 |
+
publisher = "Association for Computational Linguistics",
|
42 |
+
url = "https://aclanthology.org/2024.clasp-1.7",
|
43 |
+
pages = "39--55",
|
44 |
+
}
|
45 |
+
```
|