Yikang Shen
YikangS
AI & ML interests
None yet
Organizations
YikangS's activity
When can we have the training code as illustrated in the paper.
11
#5 opened 23 days ago
by
Shamane
why not include Qwen1.5-MoE-A2.7B in the table?
1
#4 opened 24 days ago
by
J22
Dataset?
3
#1 opened about 1 month ago
by
0xbitches
Adding `safetensors` variant of this model
#1 opened 8 months ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened 8 months ago
by
SFconvertbot