LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Paper • 2412.15204 • Published 3 days ago • 27
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 4 days ago • 93
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 3 days ago • 78
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 10 days ago • 69
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples Paper • 2404.07544 • Published Apr 11 • 19
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published 24 days ago • 32
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published 23 days ago • 55
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation Paper • 2412.02592 • Published 19 days ago • 20
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 18 days ago • 43
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published 20 days ago • 42
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published Nov 16 • 44
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 20 days ago • 195
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction Paper • 2410.21169 • Published Oct 28 • 30