mistralai requests beautifulsoup4 docx2txt python-docx textract openpyxl==3.0.10 sentence-transformers