Holmes: Benchmark the Linguistic Competence of Language Models Paper • 2404.18923 • Published Apr 29
JuStRank: Benchmarking LLM Judges for System Ranking Paper • 2412.09569 • Published 10 days ago • 18
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 4 days ago • 16