IndicConformer Collection A collection of ASR models for 22 scheduled languages of India • 24 items • Updated Mar 14 • 16
OWLS: Scaling Laws for Speech Recognition and Translation Collection 🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 8 items • Updated May 3 • 7
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 281
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 151
impira/layoutlm-document-qa Document Question Answering • 0.1B • Updated Mar 18, 2023 • 18.5k • 1.12k
google/pix2struct-docvqa-base Visual Question Answering • 0.3B • Updated Dec 24, 2023 • 10.7k • 39