song
triloon
ยท
AI & ML interests
None yet
Recent Activity
upvoted
an
article
4 days ago
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
new activity
11 months ago
1bitLLM/bitnet_b1_58-xl:Can we use this code to train model?
Organizations
models
None public yet
datasets
None public yet