Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
Kenneth Moore
kennethmoore-81j
Follow
AI & ML interests
None yet
Recent Activity
reacted
to
black-yt
's
post
with š„
7 days ago
Hey all ā our ResearchClawBench leaderboard just updated š„ We let AI do real science: 40 tasks across 10 disciplines, compared to human papers. Hard example? šļø Glacier mass change ā AI must integrate 233 datasets from 35 teams, 4 methods, reproduce 6542±387 Gt ice loss vs IPCC. No toy problems. Latest leaderboard (2026-06-09) š: Agents: š„ Claude Code 21.5 (50 = match human), $5.3; š„ EvoScientist 18.8, $4.1; š„ Codex CLI 18.4, just $2.0 LLMs+Harness: š„ Claude-Opus-4.8 21.1, $4.0; š„ Claude-Opus-4.7 20.7; š„ MiniMax-M3 19.8, only $0.45; Qwen3.7-Max 18.7, $0.42, 11min š„ Claude still king, but MiniMax/Qwen/DeepSeek are crazy cheap and competitive. Expensive isn't always better. š Code & star: https://github.com/InternScience/ResearchClawBench š Website: https://internscience.github.io/ResearchClawBench-Home/ š¤ Upvote paper: https://huggingface.co/papers/2606.07591
commented
on
a paper
7 days ago
Winner-takes-all for Multivariate Probabilistic Time Series Forecasting
View all activity
Organizations
None yet
models
0
None public yet
datasets
0
None public yet