gradio similarities sentencepiece textgen markdown PyPDF2 python-docx pandas protobuf cpm-kernels loguru