song
triloon
·
AI & ML interests
None yet
Recent Activity
upvoted
an
article
6 days ago
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
new activity
11 months ago
1bitLLM/bitnet_b1_58-xl:Can we use this code to train model?