Chong Ruan's picture

34

Chong Ruan

Chester111

·

AI & ML interests

AGI & LLM

Recent Activity

authored a paper 26 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

authored a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

new activity about 2 months ago

deepseek-ai/DeepSeek-R1:Update README.md

View all activity

Organizations

Chester111's activity

authored a paper 26 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 28 days ago • 143

authored a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 348

New activity in deepseek-ai/DeepSeek-R1 about 2 months ago

Update README.md

#16 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Zero about 2 months ago

Update README.md

#12 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1 about 2 months ago

Tag Model as MIT license

#12 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Zero about 2 months ago

add library name & auto-tag

#10 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B about 2 months ago

add library tag for better code snippets and tags

#3 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-8B about 2 months ago

add library tag for better code snippets and tags

#1 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-70B about 2 months ago

add library tag for better code snippets and tags

#3 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B about 2 months ago

add library tag for better code snippets and tags

#1 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-7B about 2 months ago

add library tag for better code snippets and tags

#1 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-14B about 2 months ago

add library tag for better code snippets and tags

#1 opened about 2 months ago by

updated a collection about 2 months ago

DeepSeek-R1

8 items • Updated Jan 21 • 576