Visual Question Answering
Transformers
TensorBoard
Safetensors
internvl_chat
feature-extraction
custom_code

how to realize streaming output

#18
by qixiang1111 - opened

want to calculate time to first token, can streaming output support?

截屏2024-05-12 11.00.27.png

This comment has been hidden

Sign up or log in to comment