deepm09 commited on
Commit
aa2e9d1
Β·
1 Parent(s): e51fab8

Upload 3 files

Browse files
Files changed (2) hide show
  1. app.py +7 -59
  2. resume.txt +55 -0
app.py CHANGED
@@ -3,65 +3,9 @@ from langchain.llms import OpenAI
3
  import streamlit as st
4
 
5
  # Function to load OpenAI model and get responses
6
- def get_openai_response(question):
7
  # Concatenate resume details with the user's input
8
- resume_details = """ DEEP MEHTA
9
- Rochester, NY | (720)-254-2220 | dm2399@rit.edu
10
- GitHub - https://github.com/deepmehta922000
11
- LinkedIn - https://www.linkedin.com/in/deep-mehta-038ba91aa
12
-
13
- SUMMARY
14
- Data Scientist proficient in Python, Java, R, and SQL, specializing in machine learning, deep learning, and algorithm optimization. Accomplished in developing and fine-tuning algorithms for data analysis. Eager to contribute expertise in driving data-driven decisions and solutions in a dynamic and challenging environment.
15
-
16
- EDUCATION
17
- Rochester Institute of Technology, Rochester | Master Data Science (GPA: 3.9/4.00) Expected May 2024
18
- Courses: Foundation Data Science, Advanced Statistics, Neural Networks, Relation Databases, Non-Relational Databases, Business Analytics and Intelligence
19
-
20
- SKILLS
21
- Programming Languages: Python, R, SQL, Java, C, MongoDB, XML
22
- Python Libraries: Pandas, NumPy, Keras, NLTK, SciPy, Matplotlib, Seaborn, TensorFlow, PyTorch
23
- Additional Tools: Tableau, PowerBI, RStudio, Git, AWS, Excel, Matlab, Minitab, JmpPro, OpenAI API
24
- Hard Skills: Data Analysis, Statistics, Database, Machine Learning, Deep Learning, Data Visualization, Quantitative Analysis, Big Data
25
-
26
- EXPERIENCE
27
- Lead Data Science Intern | S.A.K.E.C June 2021 – Aug 2021
28
- - Spearheaded the development of personalized course recommendations, reduced MAE by 12%, and increased precision by 15%.
29
- - Led a cross-functional team of developers in the implementation of collaborative, content-based, neural collaborative, and TF-IDF-enhanced NLP filtering algorithms reducing response time by 30%.
30
- - Employed Python libraries like Pandas and NumPy to preprocess and merge 7 datasets, reducing data discrepancies by 10%.
31
-
32
- Artificial Intelligence Intern | Digital Infrared Thermography Oct 2020 – June 2022
33
- - Developed diagnostic software using digital infrared thermography, image processing, and Neural Networks to detect breast.
34
- - Collaborated with researchers and medical professionals to implement algorithms like SVM, CNN, and Decision trees, resulting in a 92% accuracy.
35
- - Optimized image processing in MATLAB, reducing preprocessing time by 20% and enhancing overall image quality for comprehensive analysis.
36
-
37
- PROJECTS
38
- Diabetes Readmission Prediction with Machine Learning – Python June 2023
39
- - Engineered a diabetes patient readmission prediction model with an impressive precision score of 0.91 using sci-kit-learn, and XGBoost.
40
- - Trained classification models – Decision Trees, Randon Forest, KNN, and Logistic Regression using k-fold cross-validation to get the best model.
41
- - Utilized the GridSearch CV to find the best hyperparameters and tweak the model accordingly to give a 7% boost to the precision score.
42
-
43
- Data Analytics Consulting Virtual Intern | KPMG– Python July 2023
44
- - Assessed and successfully resolved more than a dozen data quality issues, leading to an improvement in data accuracy and completeness.
45
- - Demonstrated proficiency in utilizing data analytics tools, particularly Python and SQL enabling data-driven decision-making.
46
- - Designed an interactive dashboard using Tableau to showcase key performance metrics and data insights for stakeholders.
47
-
48
- British Airways: Customer Feedback Analysis & Sentiment Analysis – Python March 2023
49
- - Orchestrated end-to-end data scraping and cleaning pipelines, ensuring 100% data reliability and consistency.
50
- - Applied advanced NLP techniques like sentiment analysis, topic modeling, and text classification, extracting insights from customer feedback.
51
- - Leveraged tools and technology including Beautiful Soup, Scrapy, spaCy, seaborn, and sci-kit-learn, reducing project timelines by 20%.
52
-
53
- Liver Disease Prediction with Logistic Regression – Python Dec 2022
54
- - Employed Logistic Regression and evaluated the model on key performance metrics like precision (0.82), recall (0.94), and F1-measure (0.88).
55
- - Conducted thorough data preprocessing, applied advanced data imputation methods for handling missing values reducing data redundancy by a factor of 2.
56
- - Implemented under-sampling to address class imbalance, resulting in a more balanced dataset.
57
-
58
- PUBLICATIONS
59
- Applied Intelligence for Medical Diagnosing June 2022
60
- - Nikumbh, D. D., Sayyad, S., Mehta, D. V., Joshi, R. R., Dubey, K. S., & Matta, D. K. (2022). Applied Intelligence for Medical Diagnosing. In Handbook of Research on Applied Intelligence for Health and Clinical Informatics (pp. 44-79). IGI Global.
61
-
62
-
63
- """
64
- input_prompt = f"{resume_details}\n\nUser Question: {question}"
65
 
66
  llm = OpenAI(model_name="text-davinci-003", temperature=0.5)
67
  response = llm(input_prompt)
@@ -71,6 +15,10 @@ Applied Intelligence for Medical Diagnosing June 2022
71
  st.set_page_config(page_title="Q&A Demo")
72
  st.header("Chatbot")
73
 
 
 
 
 
74
  # Initialize history list
75
  history = []
76
 
@@ -78,7 +26,7 @@ history = []
78
  input_question = st.text_input("Input: ", key="input")
79
 
80
  # Get response for the current question
81
- response = get_openai_response(input_question)
82
 
83
  # Button to ask the question
84
  submit = st.button("Ask the question")
 
3
  import streamlit as st
4
 
5
  # Function to load OpenAI model and get responses
6
+ def get_openai_response(question, resume_summary):
7
  # Concatenate resume details with the user's input
8
+ input_prompt = f"{resume_summary}\n\nUser Question: {question}"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
  llm = OpenAI(model_name="text-davinci-003", temperature=0.5)
11
  response = llm(input_prompt)
 
15
  st.set_page_config(page_title="Q&A Demo")
16
  st.header("Chatbot")
17
 
18
+ # Read resume information from the text file
19
+ with open("resume.txt", "r", encoding="utf-8") as file:
20
+ resume_summary = file.read()
21
+
22
  # Initialize history list
23
  history = []
24
 
 
26
  input_question = st.text_input("Input: ", key="input")
27
 
28
  # Get response for the current question
29
+ response = get_openai_response(input_question, resume_summary)
30
 
31
  # Button to ask the question
32
  submit = st.button("Ask the question")
resume.txt ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ DEEP MEHTA
2
+ Rochester, NY | (720)-254-2220 | dm2399@rit.edu
3
+ GitHub - https://github.com/deepmehta922000
4
+ LinkedIn - https://www.linkedin.com/in/deep-mehta-038ba91aa
5
+
6
+ SUMMARY
7
+ Data Scientist proficient in Python, Java, R, and SQL, specializing in machine learning, deep learning, and algorithm optimization. Accomplished in developing and fine-tuning algorithms for data analysis. Eager to contribute expertise in driving data-driven decisions and solutions in a dynamic and challenging environment.
8
+
9
+ EDUCATION
10
+ Rochester Institute of Technology, Rochester | Master Data Science (GPA: 3.9/4.00) Expected May 2024
11
+ Courses: Foundation Data Science, Advanced Statistics, Neural Networks, Relation Databases, Non-Relational Databases, Business Analytics and Intelligence
12
+
13
+ SKILLS
14
+ Programming Languages: Python, R, SQL, Java, C, MongoDB, XML
15
+ Python Libraries: Pandas, NumPy, Keras, NLTK, SciPy, Matplotlib, Seaborn, TensorFlow, PyTorch
16
+ Additional Tools: Tableau, PowerBI, RStudio, Git, AWS, Excel, Matlab, Minitab, JmpPro, OpenAI API
17
+ Hard Skills: Data Analysis, Statistics, Database, Machine Learning, Deep Learning, Data Visualization, Quantitative Analysis, Big Data
18
+
19
+ EXPERIENCE
20
+ Lead Data Science Intern | S.A.K.E.C June 2021 – Aug 2021
21
+ - Spearheaded the development of personalized course recommendations, reduced MAE by 12%, and increased precision by 15%.
22
+ - Led a cross-functional team of developers in the implementation of collaborative, content-based, neural collaborative, and TF-IDF-enhanced NLP filtering algorithms reducing response time by 30%.
23
+ - Employed Python libraries like Pandas and NumPy to preprocess and merge 7 datasets, reducing data discrepancies by 10%.
24
+
25
+ Artificial Intelligence Intern | Digital Infrared Thermography Oct 2020 – June 2022
26
+ - Developed diagnostic software using digital infrared thermography, image processing, and Neural Networks to detect breast.
27
+ - Collaborated with researchers and medical professionals to implement algorithms like SVM, CNN, and Decision trees, resulting in a 92% accuracy.
28
+ - Optimized image processing in MATLAB, reducing preprocessing time by 20% and enhancing overall image quality for comprehensive analysis.
29
+
30
+ PROJECTS
31
+ Diabetes Readmission Prediction with Machine Learning – Python June 2023
32
+ - Engineered a diabetes patient readmission prediction model with an impressive precision score of 0.91 using sci-kit-learn, and XGBoost.
33
+ - Trained classification models – Decision Trees, Randon Forest, KNN, and Logistic Regression using k-fold cross-validation to get the best model.
34
+ - Utilized the GridSearch CV to find the best hyperparameters and tweak the model accordingly to give a 7% boost to the precision score.
35
+
36
+ Data Analytics Consulting Virtual Intern | KPMG– Python July 2023
37
+ - Assessed and successfully resolved more than a dozen data quality issues, leading to an improvement in data accuracy and completeness.
38
+ - Demonstrated proficiency in utilizing data analytics tools, particularly Python and SQL enabling data-driven decision-making.
39
+ - Designed an interactive dashboard using Tableau to showcase key performance metrics and data insights for stakeholders.
40
+
41
+ British Airways: Customer Feedback Analysis & Sentiment Analysis – Python March 2023
42
+ - Orchestrated end-to-end data scraping and cleaning pipelines, ensuring 100% data reliability and consistency.
43
+ - Applied advanced NLP techniques like sentiment analysis, topic modeling, and text classification, extracting insights from customer feedback.
44
+ - Leveraged tools and technology including Beautiful Soup, Scrapy, spaCy, seaborn, and sci-kit-learn, reducing project timelines by 20%.
45
+
46
+ Liver Disease Prediction with Logistic Regression – Python Dec 2022
47
+ - Employed Logistic Regression and evaluated the model on key performance metrics like precision (0.82), recall (0.94), and F1-measure (0.88).
48
+ - Conducted thorough data preprocessing, applied advanced data imputation methods for handling missing values reducing data redundancy by a factor of 2.
49
+ - Implemented under-sampling to address class imbalance, resulting in a more balanced dataset.
50
+
51
+ PUBLICATIONS
52
+ Applied Intelligence for Medical Diagnosing June 2022
53
+ - Nikumbh, D. D., Sayyad, S., Mehta, D. V., Joshi, R. R., Dubey, K. S., & Matta, D. K. (2022). Applied Intelligence for Medical Diagnosing. In Handbook of Research on Applied Intelligence for Health and Clinical Informatics (pp. 44-79). IGI Global.
54
+
55
+ Link - https://www.igi-global.com/chapter/applied-intelligence-for-medical-diagnosing/288808