3 4 3

Guangxuan Xiao

Guangxuan-Xiao

http://guangxuanx.com

Guangxuan-Xiao

AI & ML interests

Efficient Machine Learning

Recent Activity

upvoted a paper 12 days ago

XAttention: Block Sparse Attention with Antidiagonal Scoring

upvoted a collection 24 days ago

🧠 Reasoning datasets

authored a paper about 1 month ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

View all activity

Organizations

Guangxuan-Xiao's activity

upvoted a paper 12 days ago

XAttention: Block Sparse Attention with Antidiagonal Scoring

Paper • 2503.16428 • Published 13 days ago • 12

upvoted a collection 24 days ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 20 items • Updated 2 days ago • 118

authored a paper about 1 month ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

authored a paper 6 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 7

upvoted a paper 6 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 7

commented a paper 6 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 7 •

updated 2 models 6 months ago

mit-han-lab/Llama-3-8B-Instruct-Gradient-4194k-w8a8kv4-per-channel

Updated Oct 9, 2024 • 12

mit-han-lab/Llama-3-8B-Instruct-Gradient-1048k-w8a8kv4-per-channel

Updated Oct 9, 2024 • 10

authored 5 papers 8 months ago

InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory

Paper • 2402.04617 • Published Feb 7, 2024 • 4

updated a model 8 months ago

Guangxuan-Xiao/fastcomposer-models

Updated Jul 22, 2024

updated a model 9 months ago

Guangxuan-Xiao/cat_quatitative_imgs

Updated Jul 20, 2024

updated a dataset 9 months ago

Guangxuan-Xiao/cat_quatitative_imgs

Updated Jul 20, 2024 • 6

updated a model about 1 year ago

mit-han-lab/smoothquant-scales

Updated Feb 27, 2024

liked a model about 1 year ago

jinaai/jina-colbert-v1-en

Updated Jan 6 • 712 • 99

authored a paper about 1 year ago

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 22

updated a model over 1 year ago

mit-han-lab/offsite-tuning

Updated Nov 27, 2023