本身就支持流式输出设置 stream = True即可model.chat(tokenizer, messages, stream=True)
· Sign up or log in to comment