openai scikit-learn numpy gradio python-docx PyPDF2