Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 12 days ago • 109
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 12 days ago • 109
LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper • 2406.19314 • Published Jun 27, 2024 • 23
Transformers Can Do Arithmetic with the Right Embeddings Paper • 2405.17399 • Published May 27, 2024 • 52
ODIN: Disentangled Reward Mitigates Hacking in RLHF Paper • 2402.07319 • Published Feb 11, 2024 • 14
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text Paper • 2401.12070 • Published Jan 22, 2024 • 44
Perspectives on the State and Future of Deep Learning -- 2023 Paper • 2312.09323 • Published Dec 7, 2023 • 8
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks Paper • 2310.19909 • Published Oct 30, 2023 • 21
Bring Your Own Data! Self-Supervised Evaluation for Large Language Models Paper • 2306.13651 • Published Jun 23, 2023 • 15
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models Paper • 2306.03082 • Published Jun 5, 2023 • 5
Understanding and Mitigating Copying in Diffusion Models Paper • 2305.20086 • Published May 31, 2023 • 3
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust Paper • 2305.20030 • Published May 31, 2023 • 8