arxiv:2401.13660
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
Papers
2
models
13
JunxiongWang/TestModel
Updated
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
•
14
•
1
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
•
90
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
•
1
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
•
10
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
•
16
•
3
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
•
11
JunxiongWang/BiGS_512_MNLI
Text Classification
•
Updated
•
9
•
1
JunxiongWang/BiGS_128_MNLI
Text Classification
•
Updated
•
10
JunxiongWang/BiGS_4096
Fill-Mask
•
Updated
•
11
•
2