Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! By davidchan • 3 days ago • 2
Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • 5 days ago • 3
ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin • 6 days ago • 14
Create a Diffusers-compatible Dataset for Stable Diffusion Fine-tuning By nroggendorff • 7 days ago • 7
Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • 7 days ago • 2
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! By davidchan • 3 days ago • 2
Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • 5 days ago • 3
ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin • 6 days ago • 14
Create a Diffusers-compatible Dataset for Stable Diffusion Fine-tuning By nroggendorff • 7 days ago • 7
Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • 7 days ago • 2