indexify-extractor-sdk sentencepiece pdf2image marker-pdf easyocr pypdf PyMuPDF accelerate bitsandbytes peft transformers numpy pydantic pydantic-settings unstructured[pdf]