Jonathan Mamou's picture

11 6

Jonathan Mamou

jmamou

·

jmamou

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

FastDraft: How to Train Your Draft

upvoted an article 3 months ago

Faster Assisted Generation with Dynamic Speculation

new activity 3 months ago

huggingface/documentation-images:Create dynamic_speculation_lookahead/

View all activity

Articles

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Organizations

jmamou's activity

upvoted a paper about 2 months ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 10

upvoted an article 3 months ago

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8, 2024

• 44

upvoted a paper 5 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 37

upvoted 2 papers 8 months ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 16