Reading list - a ron-wolf Collection

ron-wolf 's Collections

Reading list

updated about 13 hours ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 41
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 50
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments

Paper • 2408.10945 • Published Aug 20, 2024 • 11
PDFTriage: Question Answering over Long, Structured Documents

Paper • 2309.08872 • Published Sep 16, 2023 • 53
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 31
The Matrix Calculus You Need For Deep Learning

Paper • 1802.01528 • Published Feb 5, 2018
A Modern Self-Referential Weight Matrix That Learns to Modify Itself

Paper • 2202.05780 • Published Feb 11, 2022
Recurrent Memory Transformer

Paper • 2207.06881 • Published Jul 14, 2022 • 1
How many words does ChatGPT know? The answer is ChatWords

Paper • 2309.16777 • Published Sep 28, 2023 • 1