Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! By davidchan • about 9 hours ago • 1
Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • 1 day ago • 3
ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin • 3 days ago • 12
Create a Diffusers-compatible Dataset for Stable Diffusion Fine-tuning By nroggendorff • 4 days ago • 6
Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • 4 days ago • 2
Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing By Pclanglais • 5 days ago • 15
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! By davidchan • about 9 hours ago • 1
Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics By dcarpintero • 1 day ago • 3
ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin • 3 days ago • 12
Create a Diffusers-compatible Dataset for Stable Diffusion Fine-tuning By nroggendorff • 4 days ago • 6
Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • 4 days ago • 2
Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing By Pclanglais • 5 days ago • 15