Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published 6 days ago • 46
ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning Paper • 2502.04689 • Published 11 days ago • 7
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models Paper • 2502.04404 • Published 12 days ago • 18
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM Paper • 2502.06635 • Published 7 days ago • 4
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published 12 days ago • 12
Large Language Model Guided Self-Debugging Code Generation Paper • 2502.02928 • Published 13 days ago • 10
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 13 days ago • 179
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Paper • 2502.04295 • Published 11 days ago • 11
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation Paper • 2502.03860 • Published 12 days ago • 22
Learning to Generate Unit Tests for Automated Debugging Paper • 2502.01619 • Published 14 days ago • 4
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published 15 days ago • 13
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published 16 days ago • 23
ACECODER: Acing Coder RL via Automated Test-Case Synthesis Paper • 2502.01718 • Published 14 days ago • 27
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Paper • 2502.01941 • Published 14 days ago • 13