ViDoRe Benchmark Collection Benchmark for document retrieval using visual features, introduced in "ColPali: Efficient Document Retrieval with Vision Language Models" • 10 items • Updated Jun 18 • 9
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 66
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • Jun 23 • 61