Only first token is received , not the entire response .

#84
by venkteshm - opened

@ysharma
the following code returns only the first token of the response ,
Am I missing something basic here . Can you please look into it ?

from gradio_client import Client

client = Client("https://ysharma-explore-llamav2-with-tgi.hf.space/")
result = client.predict(
"Capital of India", # str in 'Message' Textbox component
api_name="/chat"
)
print(result)

result returned : ' The'

Sign up or log in to comment