Xiaoxia Wu
xiaoxiawu123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
APOLLO: SGD-like Memory, AdamW-level Performance
liked
a model
4 months ago
meta-llama/Llama-3.2-90B-Vision-Instruct
authored
a paper
4 months ago
GRIN: GRadient-INformed MoE