m. bou.'s picture

1 1

m. bou.

ka00ri

·

AI & ML interests

None yet

Recent Activity

liked a Space 10 days ago

AtlaAI/judge-arena

upvoted an article about 1 month ago

Judge Arena: Benchmarking LLMs as Evaluators

authored a paper about 1 year ago

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

View all activity

Organizations

None yet

ka00ri's activity

upvoted an article about 1 month ago

Article

Judge Arena: Benchmarking LLMs as Evaluators

Nov 19, 2024

• 56