9 16 107

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

liked a dataset about 14 hours ago

LLM360/MegaMath

liked a dataset 1 day ago

nvidia/Scoring-Verifiers

liked a model 12 days ago

deepseek-ai/DeepSeek-V3-0324

View all activity

Organizations

Zhihui's activity

liked a dataset about 14 hours ago

LLM360/MegaMath

Preview • Updated about 15 hours ago • 79 • 15

liked a dataset 1 day ago

nvidia/Scoring-Verifiers

Updated 4 days ago • 10 • 3

liked a model 12 days ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated 9 days ago • 151k • • 2.31k

upvoted a paper 17 days ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published 20 days ago • 24

liked 2 datasets 23 days ago

nebius/SWE-bench-extra

Viewer • Updated Mar 3 • 6.38k • 8.11k • 41

open-r1/codeforces-cots

Viewer • Updated 8 days ago • 254k • 9.9k • 127

liked a model 24 days ago

RekaAI/reka-flash-3

Updated 23 days ago • 5.27k • 350

liked a dataset 24 days ago

open-r1/codeforces

Viewer • Updated about 20 hours ago • 10k • 1.68k • 28

liked 2 Spaces about 1 month ago

Predict Memory

🧮

Calculate memory usage from model configurations

2.41k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 datasets about 1 month ago

HuggingFaceH4/aime_2024

Viewer • Updated Jan 26 • 30 • 24.8k • 23

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 47.1k • 535

New activity in Zhihui/CTRL-32B about 2 months ago

Add library_name and pipeline_tag metadata

#1 opened about 2 months ago by

nielsr

liked a dataset about 2 months ago

allenai/RLVR-GSM-MATH-IF-Mixed-Constraints

Viewer • Updated Nov 26, 2024 • 29.9k • 863 • 20

authored a paper about 2 months ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

upvoted a paper about 2 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 47

liked a model about 2 months ago

Zhihui/CTRL-32B

Text Generation • Updated Feb 18 • 92 • 4

updated a model about 2 months ago

Zhihui/CTRL-32B

Text Generation • Updated Feb 18 • 92 • 4

upvoted a collection about 2 months ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 341 items • Updated 1 day ago • 50

upvoted a paper about 2 months ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24