arxiv:2401.13660
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
Papers
2
models
12
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
•
12
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
•
4
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
•
3
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
•
16
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
•
14
•
2
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
•
2
JunxiongWang/BiGS_512_MNLI
Text Classification
•
Updated
•
3
•
1
JunxiongWang/BiGS_128_MNLI
Text Classification
•
Updated
•
2
JunxiongWang/BiGS_4096
Fill-Mask
•
Updated
•
3
•
2
JunxiongWang/BiGS_1024
Fill-Mask
•
Updated
•
2
datasets
None public yet