Track, rank and evaluate open LLMs' CoT quality
VLMEvalKit Evaluation Results Collection
Track, rank and evaluate open LLMs and chatbots