Directly distill from Llama, the finetune in DPO
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a model
2 days ago
togethercomputer/MambaInLlama3B
updated
a model
6 days ago
togethercomputer/MambaInLlama1B
updated
a dataset
10 days ago
JunxiongWang/model_revision_max_4_closest_and_random
Organizations
Collections
7
models
6
JunxiongWang/BiGS_512_MNLI
Text Classification
•
Updated
•
5
•
1
JunxiongWang/BiGS_128_MNLI
Text Classification
•
Updated
•
8
JunxiongWang/BiGS_4096
Fill-Mask
•
Updated
•
10
•
3
JunxiongWang/BiGS_1024
Fill-Mask
•
Updated
•
6
JunxiongWang/BiGS_512
Fill-Mask
•
Updated
•
5
•
1
JunxiongWang/BiGS_128
Fill-Mask
•
Updated
•
3
datasets
5
JunxiongWang/model_revision_max_4_closest_and_random
Viewer
•
Updated
•
530k
•
42
JunxiongWang/sftdatasetv3
Viewer
•
Updated
•
12.4M
•
537
JunxiongWang/sftdataset
Viewer
•
Updated
•
11M
•
72
•
2
JunxiongWang/llama3-ultrafeedback-armorm
Viewer
•
Updated
•
61.8k
•
74
•
1
JunxiongWang/testdataset
Viewer
•
Updated
•
1M
•
83