Zhang Ruichong's picture

Zhang Ruichong

ZhangRC

·

https://www.zhihu.com/people/triangjyeddriung

Triang-jyed-driung

AI & ML interests

Mathematics (Real analysis, functional analysis, commutative algebra, etc)

Recent Activity

upvoted a paper 2 days ago

Volume estimates for unions of convex sets, and the Kakeya set conjecture in three dimensions

upvoted a paper 3 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

liked a dataset 7 days ago

gair-prox/open-web-math-pro

View all activity

Organizations

ZhangRC's activity

upvoted a paper 2 days ago

Volume estimates for unions of convex sets, and the Kakeya set conjecture in three dimensions

Paper • 2502.17655 • Published Feb 24 • 1

upvoted a paper 3 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 5 days ago • 86

upvoted 2 collections 13 days ago

RWKV-7 Goose

RWKV-7 Goose related resources. • 53 items • Updated 25 days ago • 1

paper weekly

8 items • Updated 22 days ago • 1

upvoted a paper 17 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 18 days ago • 134

upvoted a paper 24 days ago

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Paper • 2501.15654 • Published Jan 26 • 14

upvoted a collection 25 days ago

RNN

18 items • Updated 25 days ago • 4

upvoted a paper 25 days ago

xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference

Paper • 2503.13427 • Published 27 days ago • 3

upvoted a collection 25 days ago

interesting architecture

14 items • Updated 5 days ago • 1

upvoted a paper 25 days ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 26 days ago • 137

upvoted a collection 27 days ago

RWKV v7

9 items • Updated 27 days ago • 4

upvoted 2 collections about 1 month ago

QwQ

Qwen with Questions • 6 items • Updated Mar 6 • 93

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 587

upvoted a collection about 2 months ago

🪿 RWKV7

RWKV7 models • 12 items • Updated 20 days ago • 5

upvoted a paper 2 months ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 36

upvoted 3 papers 4 months ago

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

Paper • 2410.01779 • Published Oct 2, 2024 • 2

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 82

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 21

upvoted a paper 9 months ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57

upvoted a collection 10 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 361