12 24 17

Huiqiang Jiang

iofu728

https://hqjiang.com/

AI & ML interests

None yet

Recent Activity

authored a paper 3 months ago

Chain-of-Model Learning for Language Model

upvoted a paper 3 months ago

Chain-of-Model Learning for Language Model

authored a paper 3 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

View all activity

Organizations

authored a paper 3 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 122

upvoted a paper 3 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 122

authored a paper 3 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

upvoted a paper 3 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

commented a paper 3 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28 •

liked a model 4 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated 19 days ago • 159k • • 1.03k

upvoted a paper 4 months ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22 • 9

commented a paper 4 months ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published Apr 22 • 9 •

liked a model 6 months ago

moonshotai/Moonlight-16B-A3B

Text Generation • 16B • Updated Feb 26 • 11.2k • 94

upvoted a paper 7 months ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 38

liked a model 7 months ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29 • 15.3k • • 316

upvoted a paper 7 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 49

liked 2 models 7 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 1.1M • • 1.43k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 793k • • 12.6k

upvoted a paper 7 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 283

updated a dataset 8 months ago

microsoft/SCBench

Viewer • Updated Dec 24, 2024 • 922 • 1.22k • 7

upvoted a paper 8 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

authored a paper 8 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 11

upvoted a paper 8 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 11

commented a paper 8 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 11 •

Huiqiang Jiang

AI & ML interests

Recent Activity

Organizations

iofu728's activity