ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin • about 5 hours ago • 15
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! By davidchan • 4 days ago • 2
Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • 5 days ago • 3
Create a Diffusers-compatible Dataset for Stable Diffusion Fine-tuning By nroggendorff • 8 days ago • 7
Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • 8 days ago • 2
ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin • about 5 hours ago • 15
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! By davidchan • 4 days ago • 2
Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • 5 days ago • 3
Create a Diffusers-compatible Dataset for Stable Diffusion Fine-tuning By nroggendorff • 8 days ago • 7
Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • 8 days ago • 2