FasterDecoding

community

https://github.com/FasterDecoding

Activity Feed Request to join this org

AI & ML interests

Making model inference more efficient by model-system codesign.

Recent Activity

Gsunshine authored a paper about 2 months ago

Representation Fréchet Loss for Visual Generation

tianlecai authored a paper 10 months ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

tianlecai authored a paper about 2 years ago

SnapKV: LLM Knows What You are Looking for Before Generation

View all activity

Organization Card

Community About org cards

Think deeper, decode faster

models 8

FasterDecoding/BitDelta_Mistral_combo

Updated Feb 14, 2024

FasterDecoding/medusa-1.0-vicuna-13b-v1.5

Text Generation • Updated Jan 25, 2024 • 6 • 1

FasterDecoding/medusa-1.0-vicuna-33b-v1.3

Text Generation • Updated Dec 18, 2023

FasterDecoding/medusa-1.0-zephyr-7b-beta

Text Generation • Updated Dec 18, 2023 • 344 • 1

FasterDecoding/medusa-v1.0-vicuna-7b-v1.5

Text Generation • Updated Oct 29, 2023 • 415

FasterDecoding/medusa-vicuna-33b-v1.3

Updated Sep 11, 2023 • 12 • 4

FasterDecoding/medusa-vicuna-13b-v1.3

Updated Sep 11, 2023 • 17 • 5

FasterDecoding/medusa-vicuna-7b-v1.3

Updated Sep 11, 2023 • 229 • 17

datasets 0

None public yet