InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems Paper • 2410.15700 • Published Oct 21, 2024
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published 22 days ago • 52
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning Paper • 2402.06332 • Published Feb 9, 2024 • 18
StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought Decompilation Paper • 2406.04568 • Published Jun 7, 2024
Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia Paper • 2409.17391 • Published Sep 25, 2024
On Retrieval Augmentation and the Limitations of Language Model Training Paper • 2311.09615 • Published Nov 16, 2023 • 1
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4, 2024 • 5
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations Paper • 2404.01266 • Published Apr 1, 2024 • 2
LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper • 2406.19314 • Published Jun 27, 2024 • 20
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations Paper • 2404.01266 • Published Apr 1, 2024 • 2
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4, 2024 • 5
FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs Paper • 2402.05904 • Published Feb 8, 2024
SlimPajama-DC: Understanding Data Combinations for LLM Training Paper • 2309.10818 • Published Sep 19, 2023 • 10
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech Paper • 2209.07678 • Published Sep 16, 2022