Mike Smith

Smith42

AI & ML interests

AI applied to big observational datasets

Recent Activity

Organizations

UniverseTBD's profile picture Aspia Space's profile picture Major TOM's profile picture Multimodal Universe's profile picture

Smith42's activity

liked a Space 4 days ago
New activity in Smith42/astroPT 7 months ago

2100M Param Model

#2 opened 7 months ago by RJRoberts
New activity in Smith42/galaxies 7 months ago
reacted to shumingma's post with ๐Ÿš€ 9 months ago
view post
Post
2642
The Era of 1-bit LLMs: Training Tips, Code and FAQ

https://github.com/microsoft/unilm/blob/master/bitnet/The-Era-of-1-bit-LLMs__Training_Tips_Code_FAQ.pdf

We present details and tips for training 1-bit LLMs. We also provide additional experiments and results that were not reported and responses to questions regarding the "The-Era-of-1-bit-LLM" paper. Finally, we include the official PyTorch implementation of BitNet (b1.58 and b1) for future research and development of 1-bit LLMs.
  • 2 replies
ยท
updated a Space 9 months ago