sentence_transformers gradio openai langchain langchain-community pypdf unstructured pinecone-client docx2txt InstructorEmbedding