20 14

Timothe Laborie

timothelaborie

AI & ML interests

Recent Activity

upvoted a paper 25 days ago

Cautious Optimizers: Improving Training with One Line of Code

commented a paper about 1 month ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

upvoted a paper about 1 month ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

View all activity

Organizations

timothelaborie's activity

commented a paper about 1 month ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7 • 63 •

New activity in huggingface/HuggingDiscussions 2 months ago

[FEEDBACK] Daily Papers

102

#32 opened 6 months ago by

kramp

commented a paper 3 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144 •

New activity in mistralai/Mistral-7B-v0.1 3 months ago

Fine Tuning for Classification

#129 opened 10 months ago by

MUHAMMAD-SOHAIL-ZZU

New activity in madbuda/triton-windows-builds 5 months ago

triton 3

#3 opened 5 months ago by

timothelaborie

commented a paper 7 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63 •

commented a paper 8 months ago

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Paper • 2404.18911 • Published Apr 29 • 29 •

New activity in 1bitLLM/bitnet_b1_58-3B 9 months ago

Why are these models fp32?

#2 opened 9 months ago by

supercharge19

commented a paper 9 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 603 •

142

commented 5 papers 10 months ago

commented 5 papers 11 months ago

TP-Aware Dequantization

Paper • 2402.04925 • Published Jan 15 • 3 •

BlackMamba: Mixture of Experts for State-Space Models

Paper • 2402.01771 • Published Feb 1 • 23 •

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

Paper • 2401.15077 • Published Jan 26 • 19 •

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

Paper • 2401.15077 • Published Jan 26 • 19 •

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25 • 18 •

commented a paper 12 months ago

The Impact of Reasoning Step Length on Large Language Models

Paper • 2401.04925 • Published Jan 10 • 16 •