KV Cache for compress_kv or key-value states

by House-99 - opened May 7, 2024

May 7, 2024

From tech report, when it comes to inference, the kv cache should be compress_kv. But from the modeling_deepseek.py, I notice that the key state and value state still like llama2. Something wrongs right?

Xidong

May 7, 2024

This comment has been hidden

ZhongYingMatrix

May 7, 2024

Does the file modeling_deepseek.py merely contain executable code accompanying open-source weights, without the actual implementation of compressed kv? I've also noticed that the training implementation for DeepseekV2MoE lacks supporting of ep.

msr2000

DeepSeek org May 7, 2024

Thank you for your interest in our work. We are aware of the challenges in implementing KV compression on current open-source code and are actively working on it. The HuggingFace's code is not as efficient as we would like, so we're developing a new open-source code using vLLM for better performance. The open-source vLLM code including KV compression will be released once it is ready.

House-99

May 7, 2024

Does the file modeling_deepseek.py merely contain executable code accompanying open-source weights, without the actual implementation of compressed kv? I've also noticed that the training implementation for DeepseekV2MoE lacks supporting of ep.

DeepseekV2 opensource their training implementation? I do not find the link yet.

ZhongYingMatrix

May 7, 2024

DeepseekV2 opensource their training implementation? I do not find the link yet.

Apologies for the confusion, I mean the implementation in modeling_deepseek.py

Jiayi-Pan

Jun 25, 2024

•

edited Jun 25, 2024

@msr2000 Thank you for your contribution to the OS community.
I am working on a project where we'd like to use DS-2 as the base model. Inference speed has become a bottleneck for our project
We wonder if you have any etas on the timeline for open sourcing efficient vllm inference code?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment