Tang

Pingjie

AI & ML interests

None yet

Recent Activity

upvoted an article 1 day ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

liked a Space 14 days ago

vidore/vidore-leaderboard

upvoted an article 15 days ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

None yet

Pingjie's activity

upvoted an article 1 day ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 225

liked a Space 14 days ago

120

Vidore Leaderboard

🥇

Display Visual Document Retrieval leaderboard

upvoted an article 15 days ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

upvoted an article 21 days ago

Article

SigLIP 2: A better multilingual vision language encoder

21 days ago

• 134

liked a Space 21 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 21 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 23 days ago • 164

upvoted a collection 22 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 8 items • Updated 18 days ago • 396

upvoted an article 29 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 154

upvoted 2 papers about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 52

upvoted 2 articles about 1 month ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 295

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 807

upvoted a paper about 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted a paper 3 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 57

liked a Space 3 months ago

535

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted a collection 3 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

upvoted 3 papers 3 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 22

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 28

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 129