Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10.0
TFLOPS
6
2
2
kyle
PRO
kaikaidai
Follow
inwaves's profile picture
naman5a's profile picture
bergr7f's profile picture
4 followers
ยท
15 following
kaikaidai
AI & ML interests
None yet
Recent Activity
updated
a Space
3 days ago
AtlaAI/judge-arena
new
activity
6 days ago
AtlaAI/judge-arena:
Promotion to get more voters
posted
an
update
18 days ago
๐ Early results on the 8B evaluation model we've been training... @NinaCalvi wrote about the progress we've made this quarter towards training the best 'LLM-as-a-judge' evaluator. We've significantly improved against the baseline and are approaching state-of-the-art evaluation performance with an 8B model. Next up: training Llama-3.1-70B ๐ Here's the full article: https://www.atla-ai.com/post/evaluating-the-evaluator
View all activity
Articles
Judge Arena: Benchmarking LLMs as Evaluators
Nov 19
โข
52
Experimenting with different training objectives for an AI evaluator
Oct 31
โข
2
Organizations
kaikaidai
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
AtlaAI/judge-arena
6 days ago
Promotion to get more voters
1
#7 opened 7 days ago by
softclone
New activity in
AtlaAI/judge-arena
19 days ago
About adding a judge model to the leaderboard
1
#6 opened 19 days ago by
wangpf3
New activity in
AtlaAI/judge-arena
24 days ago
Which models do you want to see on here?
12
#2 opened about 1 month ago by
kaikaidai
New activity in
AtlaAI/judge-arena
25 days ago
Apply for community grant: Company project (gpu and storage)
#5 opened 25 days ago by
kaikaidai
New activity in
AtlaAI/judge-arena
30 days ago
What are Meta-Llama-3.1-Instruct "Turbo" models?
3
#4 opened about 1 month ago by
m-ric
New activity in
AtlaAI/judge-arena
about 1 month ago
Push main
#1 opened about 1 month ago by
kaikaidai
Push main
#1 opened about 1 month ago by
kaikaidai