Let's talk about LLM evaluation
•
37
imgsys.org -- arena for text guided image generation
Leaderboard for LLM for Science Reasoning
Track, rank and evaluate open LLMs' CoT quality
View how beam search decoding works, in detail!
Jailbreak the LLM and privacy guardrails
Vote on the top TTS models!
Realtime Image/Video Gen AI Arena
Track, rank and evaluate open LLMs in Portuguese