PyPDF2 langchain openai docsearch tiktoken texts pinecone-client