Directly distill from Llama, the finetune in DPO
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a model
5 days ago
JunxiongWang/Llama3.2-Mamba-3B-dpo
updated
a model
5 days ago
JunxiongWang/Llama3.2-Mamba-3B-distill
updated
a model
5 days ago
JunxiongWang/Llama3.2-Mamba2-3B-distill