When can we have the training code as illustrated in the paper.
11
#5 opened 2 months ago
by
Shamane
![](https://cdn-avatars.huggingface.co/v1/production/uploads/654aa1d86167ff03f70e32f9/9ewJj75jtgxSN7cg7xFT5.jpeg)
why not include Qwen1.5-MoE-A2.7B in the table?
1
#4 opened 2 months ago
by
J22
how to use it, any quick start guide
2
#3 opened 2 months ago
by
XavierShawn
Question about MoA
#2 opened 2 months ago
by
TechxGenus
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65097423e64ee37323bd2def/PTEwbfafNI88gdX1VVmIn.jpeg)
Dataset?
3
#1 opened 2 months ago
by
0xbitches