Ragged attention supported in vLLM

#18

by patrickvonplaten - opened 18 days ago

←

Mistral AI_ org 18 days ago

18 days ago

Will you add interleaved_sliding_window to hf config.json as well? Are we going to use this parameter going forward?

patrickvonplaten changed pull request status to merged 5 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment