Nice and fully accurate. Excellent job. Thanks!
Peter Kruger PRO
PeterKruger
AI & ML interests
Neural networks (since 1993), LLMs, AI-based financial analysis, LLM Benchmarks
Recent Activity
commented on
their
article
about 18 hours ago
Escape the Benchmark Trap: AutoBench โ the Collective-LLM-as-a-Judge System for Evaluating AI models (ASI-Ready!)
new activity
2 days ago
AutoBench/AutoBench_1.0:Comparing with mt-bench
Organizations
PeterKruger's activity

commented on
Escape the Benchmark Trap: AutoBench โ the Collective-LLM-as-a-Judge System for Evaluating AI models (ASI-Ready!)
about 18 hours ago
Comparing with mt-bench
#3 opened 2 days ago
by
PeterKruger


posted
an
update
2 days ago
Post
397
AutoBench 1.0 is live. The Collective-LLM-as-a-Judge model benchmark
https://huggingface.co/blog/PeterKruger/autobench
https://huggingface.co/blog/PeterKruger/autobench
Pool LLM bias
#2 opened 2 days ago
by
PeterKruger

Prompt analysis should be better discussed
#1 opened 2 days ago
by
PeterKruger


upvoted
an
article
2 days ago
Article
Escape the Benchmark Trap: AutoBench โ the Collective-LLM-as-a-Judge System for Evaluating AI models (ASI-Ready!)
By
โข
โข
5
published
an
article
2 days ago
Article
Escape the Benchmark Trap: AutoBench โ the Collective-LLM-as-a-Judge System for Evaluating AI models (ASI-Ready!)
By
โข
โข
5