Avoid printing incomplete bytes

by fikavec - opened May 16, 2023

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-9

fikavec

May 16, 2023

This comment has been hidden

fikavec

May 16, 2023

for token in generator:
# detokenize function may return incomplete bytes (not for printing), because one letter can consist of several tokens - see what we can make with it there: https://abetlen.github.io/llama-cpp-python/#__codelineno-0-673
# errors="ignore" - just simple trick to avoid exception
token_str = model.detokenize([token]).decode("utf-8", errors="ignore")

IlyaGusev

Owner May 16, 2023

Fixed here: https://github.com/IlyaGusev/rulm/commit/5b31a0b5d6c0b564366025f42bc50ffc3b2c0b89

fikavec changed pull request status to closed May 18, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment