Unified Reward Model for Multimodal Understanding and Generation Paper ā¢ 2503.05236 ā¢ Published 27 days ago ā¢ 112
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality about 1 month ago ā¢ 71
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. ā¢ 5 items ā¢ Updated 30 days ago ā¢ 68
CHASE Collection Generate challenging synthetic data to evaluate LLMs ā¢ 5 items ā¢ Updated Feb 21 ā¢ 4
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper ā¢ 2502.14678 ā¢ Published Feb 20 ā¢ 17
MMTEB: Massive Multilingual Text Embedding Benchmark Paper ā¢ 2502.13595 ā¢ Published Feb 19 ā¢ 33
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Paper ā¢ 2502.13791 ā¢ Published Feb 19 ā¢ 5
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper ā¢ 2501.17161 ā¢ Published Jan 28 ā¢ 118
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper ā¢ 2501.07301 ā¢ Published Jan 13 ā¢ 98
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper ā¢ 2501.02045 ā¢ Published Jan 3 ā¢ 21
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Paper ā¢ 2501.01895 ā¢ Published Jan 3 ā¢ 55
LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper ā¢ 2406.19314 ā¢ Published Jun 27, 2024 ā¢ 23
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper ā¢ 2412.06559 ā¢ Published Dec 9, 2024 ā¢ 82
PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion Paper ā¢ 2412.17780 ā¢ Published Dec 23, 2024 ā¢ 4
Bridging the Data Provenance Gap Across Text, Speech and Video Paper ā¢ 2412.17847 ā¢ Published Dec 19, 2024 ā¢ 9
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. ā¢ 40 items ā¢ Updated Feb 13 ā¢ 83