Spaces:
Sleeping
Sleeping
Upload 3 files
Browse files- app.py +7 -59
- resume.txt +55 -0
app.py
CHANGED
|
@@ -3,65 +3,9 @@ from langchain.llms import OpenAI
|
|
| 3 |
import streamlit as st
|
| 4 |
|
| 5 |
# Function to load OpenAI model and get responses
|
| 6 |
-
def get_openai_response(question):
|
| 7 |
# Concatenate resume details with the user's input
|
| 8 |
-
|
| 9 |
-
Rochester, NY | (720)-254-2220 | dm2399@rit.edu
|
| 10 |
-
GitHub - https://github.com/deepmehta922000
|
| 11 |
-
LinkedIn - https://www.linkedin.com/in/deep-mehta-038ba91aa
|
| 12 |
-
|
| 13 |
-
SUMMARY
|
| 14 |
-
Data Scientist proficient in Python, Java, R, and SQL, specializing in machine learning, deep learning, and algorithm optimization. Accomplished in developing and fine-tuning algorithms for data analysis. Eager to contribute expertise in driving data-driven decisions and solutions in a dynamic and challenging environment.
|
| 15 |
-
|
| 16 |
-
EDUCATION
|
| 17 |
-
Rochester Institute of Technology, Rochester | Master Data Science (GPA: 3.9/4.00) Expected May 2024
|
| 18 |
-
Courses: Foundation Data Science, Advanced Statistics, Neural Networks, Relation Databases, Non-Relational Databases, Business Analytics and Intelligence
|
| 19 |
-
|
| 20 |
-
SKILLS
|
| 21 |
-
Programming Languages: Python, R, SQL, Java, C, MongoDB, XML
|
| 22 |
-
Python Libraries: Pandas, NumPy, Keras, NLTK, SciPy, Matplotlib, Seaborn, TensorFlow, PyTorch
|
| 23 |
-
Additional Tools: Tableau, PowerBI, RStudio, Git, AWS, Excel, Matlab, Minitab, JmpPro, OpenAI API
|
| 24 |
-
Hard Skills: Data Analysis, Statistics, Database, Machine Learning, Deep Learning, Data Visualization, Quantitative Analysis, Big Data
|
| 25 |
-
|
| 26 |
-
EXPERIENCE
|
| 27 |
-
Lead Data Science Intern | S.A.K.E.C June 2021 β Aug 2021
|
| 28 |
-
- Spearheaded the development of personalized course recommendations, reduced MAE by 12%, and increased precision by 15%.
|
| 29 |
-
- Led a cross-functional team of developers in the implementation of collaborative, content-based, neural collaborative, and TF-IDF-enhanced NLP filtering algorithms reducing response time by 30%.
|
| 30 |
-
- Employed Python libraries like Pandas and NumPy to preprocess and merge 7 datasets, reducing data discrepancies by 10%.
|
| 31 |
-
|
| 32 |
-
Artificial Intelligence Intern | Digital Infrared Thermography Oct 2020 β June 2022
|
| 33 |
-
- Developed diagnostic software using digital infrared thermography, image processing, and Neural Networks to detect breast.
|
| 34 |
-
- Collaborated with researchers and medical professionals to implement algorithms like SVM, CNN, and Decision trees, resulting in a 92% accuracy.
|
| 35 |
-
- Optimized image processing in MATLAB, reducing preprocessing time by 20% and enhancing overall image quality for comprehensive analysis.
|
| 36 |
-
|
| 37 |
-
PROJECTS
|
| 38 |
-
Diabetes Readmission Prediction with Machine Learning β Python June 2023
|
| 39 |
-
- Engineered a diabetes patient readmission prediction model with an impressive precision score of 0.91 using sci-kit-learn, and XGBoost.
|
| 40 |
-
- Trained classification models β Decision Trees, Randon Forest, KNN, and Logistic Regression using k-fold cross-validation to get the best model.
|
| 41 |
-
- Utilized the GridSearch CV to find the best hyperparameters and tweak the model accordingly to give a 7% boost to the precision score.
|
| 42 |
-
|
| 43 |
-
Data Analytics Consulting Virtual Intern | KPMGβ Python July 2023
|
| 44 |
-
- Assessed and successfully resolved more than a dozen data quality issues, leading to an improvement in data accuracy and completeness.
|
| 45 |
-
- Demonstrated proficiency in utilizing data analytics tools, particularly Python and SQL enabling data-driven decision-making.
|
| 46 |
-
- Designed an interactive dashboard using Tableau to showcase key performance metrics and data insights for stakeholders.
|
| 47 |
-
|
| 48 |
-
British Airways: Customer Feedback Analysis & Sentiment Analysis β Python March 2023
|
| 49 |
-
- Orchestrated end-to-end data scraping and cleaning pipelines, ensuring 100% data reliability and consistency.
|
| 50 |
-
- Applied advanced NLP techniques like sentiment analysis, topic modeling, and text classification, extracting insights from customer feedback.
|
| 51 |
-
- Leveraged tools and technology including Beautiful Soup, Scrapy, spaCy, seaborn, and sci-kit-learn, reducing project timelines by 20%.
|
| 52 |
-
|
| 53 |
-
Liver Disease Prediction with Logistic Regression β Python Dec 2022
|
| 54 |
-
- Employed Logistic Regression and evaluated the model on key performance metrics like precision (0.82), recall (0.94), and F1-measure (0.88).
|
| 55 |
-
- Conducted thorough data preprocessing, applied advanced data imputation methods for handling missing values reducing data redundancy by a factor of 2.
|
| 56 |
-
- Implemented under-sampling to address class imbalance, resulting in a more balanced dataset.
|
| 57 |
-
|
| 58 |
-
PUBLICATIONS
|
| 59 |
-
Applied Intelligence for Medical Diagnosing June 2022
|
| 60 |
-
- Nikumbh, D. D., Sayyad, S., Mehta, D. V., Joshi, R. R., Dubey, K. S., & Matta, D. K. (2022). Applied Intelligence for Medical Diagnosing. In Handbook of Research on Applied Intelligence for Health and Clinical Informatics (pp. 44-79). IGI Global.
|
| 61 |
-
|
| 62 |
-
|
| 63 |
-
"""
|
| 64 |
-
input_prompt = f"{resume_details}\n\nUser Question: {question}"
|
| 65 |
|
| 66 |
llm = OpenAI(model_name="text-davinci-003", temperature=0.5)
|
| 67 |
response = llm(input_prompt)
|
|
@@ -71,6 +15,10 @@ Applied Intelligence for Medical Diagnosing June 2022
|
|
| 71 |
st.set_page_config(page_title="Q&A Demo")
|
| 72 |
st.header("Chatbot")
|
| 73 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 74 |
# Initialize history list
|
| 75 |
history = []
|
| 76 |
|
|
@@ -78,7 +26,7 @@ history = []
|
|
| 78 |
input_question = st.text_input("Input: ", key="input")
|
| 79 |
|
| 80 |
# Get response for the current question
|
| 81 |
-
response = get_openai_response(input_question)
|
| 82 |
|
| 83 |
# Button to ask the question
|
| 84 |
submit = st.button("Ask the question")
|
|
|
|
| 3 |
import streamlit as st
|
| 4 |
|
| 5 |
# Function to load OpenAI model and get responses
|
| 6 |
+
def get_openai_response(question, resume_summary):
|
| 7 |
# Concatenate resume details with the user's input
|
| 8 |
+
input_prompt = f"{resume_summary}\n\nUser Question: {question}"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
llm = OpenAI(model_name="text-davinci-003", temperature=0.5)
|
| 11 |
response = llm(input_prompt)
|
|
|
|
| 15 |
st.set_page_config(page_title="Q&A Demo")
|
| 16 |
st.header("Chatbot")
|
| 17 |
|
| 18 |
+
# Read resume information from the text file
|
| 19 |
+
with open("resume.txt", "r", encoding="utf-8") as file:
|
| 20 |
+
resume_summary = file.read()
|
| 21 |
+
|
| 22 |
# Initialize history list
|
| 23 |
history = []
|
| 24 |
|
|
|
|
| 26 |
input_question = st.text_input("Input: ", key="input")
|
| 27 |
|
| 28 |
# Get response for the current question
|
| 29 |
+
response = get_openai_response(input_question, resume_summary)
|
| 30 |
|
| 31 |
# Button to ask the question
|
| 32 |
submit = st.button("Ask the question")
|
resume.txt
ADDED
|
@@ -0,0 +1,55 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
DEEP MEHTA
|
| 2 |
+
Rochester, NY | (720)-254-2220 | dm2399@rit.edu
|
| 3 |
+
GitHub - https://github.com/deepmehta922000
|
| 4 |
+
LinkedIn - https://www.linkedin.com/in/deep-mehta-038ba91aa
|
| 5 |
+
|
| 6 |
+
SUMMARY
|
| 7 |
+
Data Scientist proficient in Python, Java, R, and SQL, specializing in machine learning, deep learning, and algorithm optimization. Accomplished in developing and fine-tuning algorithms for data analysis. Eager to contribute expertise in driving data-driven decisions and solutions in a dynamic and challenging environment.
|
| 8 |
+
|
| 9 |
+
EDUCATION
|
| 10 |
+
Rochester Institute of Technology, Rochester | Master Data Science (GPA: 3.9/4.00) Expected May 2024
|
| 11 |
+
Courses: Foundation Data Science, Advanced Statistics, Neural Networks, Relation Databases, Non-Relational Databases, Business Analytics and Intelligence
|
| 12 |
+
|
| 13 |
+
SKILLS
|
| 14 |
+
Programming Languages: Python, R, SQL, Java, C, MongoDB, XML
|
| 15 |
+
Python Libraries: Pandas, NumPy, Keras, NLTK, SciPy, Matplotlib, Seaborn, TensorFlow, PyTorch
|
| 16 |
+
Additional Tools: Tableau, PowerBI, RStudio, Git, AWS, Excel, Matlab, Minitab, JmpPro, OpenAI API
|
| 17 |
+
Hard Skills: Data Analysis, Statistics, Database, Machine Learning, Deep Learning, Data Visualization, Quantitative Analysis, Big Data
|
| 18 |
+
|
| 19 |
+
EXPERIENCE
|
| 20 |
+
Lead Data Science Intern | S.A.K.E.C June 2021 β Aug 2021
|
| 21 |
+
- Spearheaded the development of personalized course recommendations, reduced MAE by 12%, and increased precision by 15%.
|
| 22 |
+
- Led a cross-functional team of developers in the implementation of collaborative, content-based, neural collaborative, and TF-IDF-enhanced NLP filtering algorithms reducing response time by 30%.
|
| 23 |
+
- Employed Python libraries like Pandas and NumPy to preprocess and merge 7 datasets, reducing data discrepancies by 10%.
|
| 24 |
+
|
| 25 |
+
Artificial Intelligence Intern | Digital Infrared Thermography Oct 2020 β June 2022
|
| 26 |
+
- Developed diagnostic software using digital infrared thermography, image processing, and Neural Networks to detect breast.
|
| 27 |
+
- Collaborated with researchers and medical professionals to implement algorithms like SVM, CNN, and Decision trees, resulting in a 92% accuracy.
|
| 28 |
+
- Optimized image processing in MATLAB, reducing preprocessing time by 20% and enhancing overall image quality for comprehensive analysis.
|
| 29 |
+
|
| 30 |
+
PROJECTS
|
| 31 |
+
Diabetes Readmission Prediction with Machine Learning β Python June 2023
|
| 32 |
+
- Engineered a diabetes patient readmission prediction model with an impressive precision score of 0.91 using sci-kit-learn, and XGBoost.
|
| 33 |
+
- Trained classification models β Decision Trees, Randon Forest, KNN, and Logistic Regression using k-fold cross-validation to get the best model.
|
| 34 |
+
- Utilized the GridSearch CV to find the best hyperparameters and tweak the model accordingly to give a 7% boost to the precision score.
|
| 35 |
+
|
| 36 |
+
Data Analytics Consulting Virtual Intern | KPMGβ Python July 2023
|
| 37 |
+
- Assessed and successfully resolved more than a dozen data quality issues, leading to an improvement in data accuracy and completeness.
|
| 38 |
+
- Demonstrated proficiency in utilizing data analytics tools, particularly Python and SQL enabling data-driven decision-making.
|
| 39 |
+
- Designed an interactive dashboard using Tableau to showcase key performance metrics and data insights for stakeholders.
|
| 40 |
+
|
| 41 |
+
British Airways: Customer Feedback Analysis & Sentiment Analysis β Python March 2023
|
| 42 |
+
- Orchestrated end-to-end data scraping and cleaning pipelines, ensuring 100% data reliability and consistency.
|
| 43 |
+
- Applied advanced NLP techniques like sentiment analysis, topic modeling, and text classification, extracting insights from customer feedback.
|
| 44 |
+
- Leveraged tools and technology including Beautiful Soup, Scrapy, spaCy, seaborn, and sci-kit-learn, reducing project timelines by 20%.
|
| 45 |
+
|
| 46 |
+
Liver Disease Prediction with Logistic Regression β Python Dec 2022
|
| 47 |
+
- Employed Logistic Regression and evaluated the model on key performance metrics like precision (0.82), recall (0.94), and F1-measure (0.88).
|
| 48 |
+
- Conducted thorough data preprocessing, applied advanced data imputation methods for handling missing values reducing data redundancy by a factor of 2.
|
| 49 |
+
- Implemented under-sampling to address class imbalance, resulting in a more balanced dataset.
|
| 50 |
+
|
| 51 |
+
PUBLICATIONS
|
| 52 |
+
Applied Intelligence for Medical Diagnosing June 2022
|
| 53 |
+
- Nikumbh, D. D., Sayyad, S., Mehta, D. V., Joshi, R. R., Dubey, K. S., & Matta, D. K. (2022). Applied Intelligence for Medical Diagnosing. In Handbook of Research on Applied Intelligence for Health and Clinical Informatics (pp. 44-79). IGI Global.
|
| 54 |
+
|
| 55 |
+
Link - https://www.igi-global.com/chapter/applied-intelligence-for-medical-diagnosing/288808
|