Rishabh Singh

lulzx

AI & ML interests

Information retrieval

Recent Activity

liked a model about 9 hours ago

kyutai/mimi

liked a Space about 9 hours ago

Bradarr/csm-1b

liked a Space about 9 hours ago

sesame/csm-1b

View all activity

Organizations

lulzx's activity

upvoted a paper about 17 hours ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 2 days ago • 41

upvoted an article 1 day ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

3 days ago

• 233

upvoted an article 2 days ago

Article

Open R1: Update #3

and 9 others •

3 days ago

• 207

upvoted a collection 8 days ago

Q-Filters

Collection

Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated 11 days ago • 6

upvoted a paper 16 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 17 days ago • 68

upvoted an article 16 days ago

Article

FastRTC: The Real-Time Communication Library for Python

18 days ago

• 143

upvoted a paper 22 days ago

Text2World: Benchmarking Large Language Models for Symbolic World Model Generation

Paper • 2502.13092 • Published 24 days ago • 12

upvoted an article 23 days ago

Article

Grok 3 ai : Best AI model now!

•

23 days ago

• 7

upvoted a paper 25 days ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published 28 days ago • 32

upvoted a collection 26 days ago

DeepSeek-R1-abliterated

Collection

7 items • Updated Jan 31 • 93

upvoted a paper 29 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published about 1 month ago • 47

upvoted a collection 29 days ago

Teuken-7B-v0.4

Collection

OpenGPT-X Teuken 7B models trained on 4 trillion tokens • 4 items • Updated Dec 6, 2024 • 3

upvoted a paper 30 days ago

MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents

Paper • 2502.05957 • Published Feb 9 • 16

upvoted 3 papers about 1 month ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published Feb 9 • 36

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 142

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted an article about 1 month ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 113

upvoted a paper 4 months ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published Nov 5, 2024 • 68

upvoted an article 5 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

and 1 other •

Oct 14, 2024

• 77

upvoted a paper 5 months ago

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Paper • 2410.05193 • Published Oct 7, 2024 • 13