Text Generation
Transformers
Safetensors
English
gla
bailin28 commited on
Commit
af49087
·
verified ·
1 Parent(s): 4b9828d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - cerebras/SlimPajama-627B
5
+ language:
6
+ - en
7
+ ---
8
+
9
+
10
+ This checkpoint of the 1.3B GLA model used in the paper [Gated Linear Attention](https://arxiv.org/abs/2312.06635). See the model and loading script in this [repo](https://github.com/berlino/gated_linear_attention).