langchain openai streamlit pinecone-client chromadb unstructured pdf2image pytesseract tiktoken pymupdf tabulate sentence-transformers llama-cpp-python huggingface-hub python-docx altair<5