microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 14 days ago • 622k • 1.32k
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 85