bbunzeck commited on
Commit
31b1001
1 Parent(s): b18b303

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -1
README.md CHANGED
@@ -19,4 +19,27 @@ This autoregressive model belongs to a series of rather small language models tr
19
  | Attention heads | 8 | 8 | 16 | 16 |
20
  | Embedding size | 128 | 128 | 256 | 256 |
21
  | Context size | 128 | 128 | 256 | 256 |
22
- | Vocab size | 16k | 16k | 16k | 16k |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  | Attention heads | 8 | 8 | 16 | 16 |
20
  | Embedding size | 128 | 128 | 256 | 256 |
21
  | Context size | 128 | 128 | 256 | 256 |
22
+ | Vocab size | 16k | 16k | 16k | 16k |
23
+
24
+
25
+ If you use this model in your research, please cite the following publication:
26
+
27
+ ```
28
+ @inproceedings{bunzeck-zarriess-2024-fifty,
29
+ title = "Fifty shapes of {BL}i{MP}: syntactic learning curves in language models are not uniform, but sometimes unruly",
30
+ author = "Bunzeck, Bastian and
31
+ Zarrie{\ss}, Sina",
32
+ editor = "Qiu, Amy and
33
+ Noble, Bill and
34
+ Pagmar, David and
35
+ Maraev, Vladislav and
36
+ Ilinykh, Nikolai",
37
+ booktitle = "Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning",
38
+ month = oct,
39
+ year = "2024",
40
+ address = "Gothenburg, Sweden",
41
+ publisher = "Association for Computational Linguistics",
42
+ url = "https://aclanthology.org/2024.clasp-1.7",
43
+ pages = "39--55",
44
+ }
45
+ ```