Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
published
a model
9 days ago
JunxiongWang/open_instruct_dev
published
a model
9 days ago
JunxiongWang/mamba_0_875_sft
published
a model
9 days ago
JunxiongWang/llama3_mamba_0_5_sft
Organizations
Collections
8
models
6
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
•
43
•
1
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
•
25
•
3
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
•
53
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
•
37
•
2
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
•
21
•
2
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
•
31
datasets
12
JunxiongWang/qwen1b_it_math
Viewer
•
Updated
•
19.1M
•
36
JunxiongWang/test_math
Viewer
•
Updated
•
89.1k
•
186
JunxiongWang/FineMathV4
Viewer
•
Updated
•
6.7M
•
90
JunxiongWang/model_revision_max_4_closest_and_random
Viewer
•
Updated
•
530k
•
118
JunxiongWang/sftdatasetv4
Viewer
•
Updated
•
4.96M
•
184
JunxiongWang/sftdatasetv3
Viewer
•
Updated
•
12.4M
•
480
JunxiongWang/sftdatasetv2
Viewer
•
Updated
•
11.8M
•
153
JunxiongWang/sftdataset
Viewer
•
Updated
•
11M
•
464
•
2
JunxiongWang/llama3-ultrafeedback-armorm
Viewer
•
Updated
•
61.8k
•
268
•
1
JunxiongWang/gemma2_sftdataset
Viewer
•
Updated
•
11M
•
169