Yuxian Gu

t1101675

AI & ML interests

Efficient methods for language models

Recent Activity

Organizations

Conversational AI (CoAI) group from Tsinghua University's profile picture Efficient-Large-Model's profile picture MiniLLM's profile picture Data Selection's profile picture VILA / Molmo's profile picture

t1101675's activity

New activity in MiniLLM/MiniLLM-gpt2-340M 23 days ago
New activity in MiniLLM/SFT-gpt2-120M 23 days ago
New activity in MiniLLM/SFT-gpt2-760M 23 days ago
New activity in Data-Selection/PDS-470M 23 days ago
New activity in Data-Selection/PDS-160M 23 days ago
New activity in Data-Selection/PDS-1B 23 days ago

Add link to code repository

#2 opened 24 days ago by
nielsr
New activity in Data-Selection/PDS-1.7B 23 days ago
New activity in Data-Selection/BSL-1.7B 23 days ago

Add link to code

#2 opened 24 days ago by
nielsr
New activity in MiniLLM/MiniPLM-Mamba-130M 23 days ago
New activity in MiniLLM/MiniPLM-Qwen-1.2B 23 days ago

Add link to code

#1 opened 24 days ago by
nielsr
New activity in MiniLLM/Ref-Pretrain-Qwen-104M 23 days ago

Add link to code

#1 opened 24 days ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-1.2B 23 days ago

Add link to code

#1 opened 24 days ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-500M 23 days ago

No changes needed

#1 opened 24 days ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-200M 23 days ago

Add link to code

#1 opened 24 days ago by
nielsr
New activity in MiniLLM/VanillaKD-Pretrain-Qwen-500M 23 days ago

Add link to code

#1 opened 24 days ago by
nielsr