King Han

kingh0730

AI & ML interests

Code LLMs

Recent Activity

liked a Space 6 days ago
Manmay/tortoise-tts
liked a Space 6 days ago
hexgrad/Kokoro-TTS
liked a model 6 days ago
sesame/csm-1b
View all activity

Organizations

UC Berkeley's profile picture Live Code Bench's profile picture Skylow, Inc.'s profile picture

kingh0730's activity

upvoted an article 7 months ago
view article
Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

By StringChaos and 6 others β€’
β€’ 15
reacted to clefourrier's post with ❀️πŸ”₯ 12 months ago
view post
Post
4767
Contamination free code evaluations with LiveCodeBench! πŸ–₯️

LiveCodeBench is a new leaderboard, which contains:
- complete code evaluations (on code generation, self repair, code execution, tests)
- my favorite feature: problem selection by publication date πŸ“…

This feature means that you can get model scores averaged only on new problems out of the training data. This means... contamination free code evals! πŸš€

Check it out!

Blog: https://huggingface.co/blog/leaderboard-livecodebench
Leaderboard: livecodebench/leaderboard

Congrats to @StringChaos @minimario @xu3kev @kingh0730 and @FanjiaYan for the super cool leaderboard!
published an article 12 months ago
view article
Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

By StringChaos and 6 others β€’
β€’ 15