with torch.no_grad(): outputs = model.generate(**inputs, **gen_kwargs) outputs = outputs[:, inputs['input_ids'].shape[1]:] print(tokenizer.decode(outputs[0]))
It only output the description of the image. How can I get the bounding box?
Thanks in advace!
· Sign up or log in to comment