jiakai's picture

217 809

jiakai

real-jiakai

·

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a model about 11 hours ago

microsoft/MAI-DS-R1

upvoted a collection about 11 hours ago

liked a dataset about 14 hours ago

weaviate/agents

View all activity

Organizations

real-jiakai's activity

upvoted a collection about 11 hours ago

MAI-DS-R1

MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team. • 2 items • Updated 1 day ago • 6

upvoted 2 articles about 16 hours ago

Article

Introducing HELMET

3 days ago

• 18

Article

Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖

5 days ago

• 36

upvoted a paper about 19 hours ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 8 days ago • 43

upvoted 2 papers 1 day ago

Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

Paper • 2504.08003 • Published 9 days ago • 43

TextArena

Paper • 2504.11442 • Published 3 days ago • 23

upvoted a paper 3 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 4 days ago • 220

upvoted a collection 3 days ago

InternVL3

34 items • Updated 1 day ago • 49

upvoted an article 3 days ago

Article

4M Models Scanned: Protect AI + Hugging Face 6 Months In

5 days ago

• 24

upvoted a collection 3 days ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 2 days ago • 11

upvoted a paper 4 days ago

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published 7 days ago • 24

upvoted a collection 4 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 4 days ago • 99

upvoted a paper 4 days ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published 9 days ago • 69

upvoted a collection 4 days ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 8 days ago • 76

upvoted a collection 5 days ago

Skywork-OR1

Skywork Open Reasoner 1 • 8 items • Updated 5 days ago • 21

upvoted 2 papers 7 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 9 days ago • 113

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 17 days ago • 79

upvoted an article 8 days ago

Article

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

10 days ago

• 20

upvoted a paper 9 days ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 10 days ago • 143