openai streamlit pypdf2 azure-identity streamlit-option-menu streamlit-navigation-bar streamlit-modal docx2txt PyPDF2 pdfplumber pdfminer.six langchain langchain_community faiss-gpu faiss-cpu tiktoken pdf2image llama-index pypdf numpy openai pymupdf diff-match-patch PorterStemmer base