Zhimin Zhao PRO

zhiminy

AI & ML interests

LLMOps, MLOps, SE4AI, AI4SE

Recent Activity

New activity about 20 hours ago
RMT-team/babilong
New activity 2 days ago
stemdataset/stem-leaderboard
liked a Space 3 days ago
TIGER-Lab/MEGA-Bench

Organizations

Posts 2

view post
Post
1988
Hey everyone!

Our team just dropped something cool! ๐ŸŽ‰ We've published a new paper on arxiv diving into the foundation model leaderboards across different platforms. We've analyzed the content, operational workflows, and common issues of these leaderboards. From this, we came up with two new concepts: Leaderboard Operations (LBOps) and leaderboard smells.

We also put together an awesome list with nearly 300 of the latest leaderboards, development tools, and publishing organizations. You can check it out here: https://github.com/SAILResearch/awesome-foundation-model-leaderboards

If you find it useful or interesting, give us a follow or drop a comment. We'd love to hear your thoughts and get your support! โœจ

Link to the paper: https://arxiv.org/abs/2407.04065

models

None public yet

datasets

None public yet