filipignijic commited on
Commit
c6d13fb
1 Parent(s): 62762fd

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - roneneldan/TinyStories
4
+ metrics:
5
+ - babylm
6
+ ---
7
+ Basemodel: GPT-Neo
8
+
9
+ Configs:
10
+ Vocab size: 10,000
11
+ Hidden size: 512
12
+ Max position embeddings: 512
13
+ Number of layers: 2
14
+ Number of heads: 4
15
+ Window size: 256
16
+ Intermediate-size: 1024
17
+
18
+ Results:
19
+ - Task: glue
20
+ Score: 58.95
21
+ Confidence Interval: [58.27, 59.7]
22
+ - Task: blimp
23
+ Score: 57.82
24
+ Confidence Interval: [56.97, 58.68]