Example of using local LLMs

#2
by russellsparadox - opened

In the code you use requests.post(API_URL, headers=headers, json=payload, stream=True) to get the streaming. What if I just have GPT-J-6B that I downloaded locally. Is there a way to get streaming?

Sign up or log in to comment