Luis Rodrigues commited on
Commit
3cd0212
1 Parent(s): 7266825

adds notebook to scrap jobs from job street

Browse files
.gitignore CHANGED
@@ -20,6 +20,7 @@ logs/**
20
  # also keep the example dataset
21
  !data/01_raw/*.csv
22
  !data/02_processed/**
 
23
 
24
  # also keep all .gitkeep files
25
  !.gitkeep
 
20
  # also keep the example dataset
21
  !data/01_raw/*.csv
22
  !data/02_processed/**
23
+ !data/02_processed/**/**
24
 
25
  # also keep all .gitkeep files
26
  !.gitkeep
conf/params.yml CHANGED
@@ -1,22 +1,32 @@
1
  # Data
2
  ingestion_data_dir: ["data", "02_processed"]
3
  ingestion_metadata_dir: ["data", "02_processed", "metadata"]
 
4
 
5
  # Embeddings
6
  embedding_model_name: "sentence-transformers/all-mpnet-base-v2"
7
  embedding_dir: ["data", "03_indexed"]
8
 
9
  # app
10
- APP_TITLE: "ResumeWorth"
11
- APP_DESCRIPTION: |
12
- Discover Your Market Value!
 
 
 
 
 
 
13
 
14
- Unlock the true value of your experience with ResumeWorth.
15
 
16
- Simply enter your job title, upload your resume, and let our advanced AI analyze your professional background.
17
-
18
- In moments, you'll receive an estimated salary, tailored to your unique skills and experience.
19
- # Empower your job search with insights that put you a step ahead.
20
- JOB_TITLES: ["Data Engineer", "Data Scientist", "Data Analyst", "Machine Learning Engineer"]
21
- HOST: "0.0.0.0"
22
- PORT: 7860
 
 
 
 
1
  # Data
2
  ingestion_data_dir: ["data", "02_processed"]
3
  ingestion_metadata_dir: ["data", "02_processed", "metadata"]
4
+ job_titles: ["Data Engineer", "Data Scientist", "Data Analyst", "Machine Learning Engineer"]
5
 
6
  # Embeddings
7
  embedding_model_name: "sentence-transformers/all-mpnet-base-v2"
8
  embedding_dir: ["data", "03_indexed"]
9
 
10
  # app
11
+ app_config:
12
+ host: "0.0.0.0"
13
+ port: 7860
14
+ app_backend:
15
+ min_resume_size: 1000
16
+ app_frontend:
17
+ title: "ResumeWorth"
18
+ description: |
19
+ Discover Your Market Value!
20
 
21
+ Unlock the true value of your experience with ResumeWorth.
22
 
23
+ Simply enter your job title, upload your resume, and let our advanced AI analyze your professional background.
24
+
25
+ In moments, you'll receive an estimated salary, tailored to your unique skills and experience.
26
+ # Empower your job search with insights that put you a step ahead.
27
+ jobs: ["Data Engineer", "Data Scientist", "Data Analyst", "Machine Learning Engineer"]
28
+ messages:
29
+ job_title_not_found: "You must choose one of the roles in the Job Title section!"
30
+ salary_not_found: |
31
+ There is no good match between resume and available job vacancies.
32
+ We recommend that you submit an extended version of your resume.
data/01_raw/data_jobs_scraped_from_jobstreet_2024-04-07.csv ADDED
@@ -0,0 +1,639 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,job_id,job_title,company,location,category,sub_category,job_type,salary,description
2
+ 0,74727704,Machine Learning Engineer (Artificial Intelligence),Private Advertiser,Kuala Lumpur,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 4,000 – RM 6,000 per month","Design, develop, and deploy machine learning models and algorithms for complex and unique datasets, using various techniques such as mathematical modeling, scikit-learn, NLP, CNN, RNN, DL, RL, Transformers, GAN, LLM, RAG
3
+ Collaborate with cross-functional teams to extract insights, identify business opportunities and provide data-driven recommendations
4
+ Stay up-to-date with the latest machine learning and AI techniques and tools
5
+ Communicate complex technical concepts to non-technical stakeholders in an easy-to-understand manner
6
+ Bachelor's degree or higher in Computer Science, Mathematics, Statistics, Actuarial Science, Informatics, Information Science or related fields
7
+ Strong analytical skills and attention to detail
8
+ Participation in Kaggle, Mathematics Olympiad or similar competitions is a plus
9
+ Excellent programming skills in Python, R, Java, or C++
10
+ Familiar with ML frameworks such as Tensorflow, Keras, PyTorch, MLFlow, AutoML, TensorRT, CUDA
11
+ Excellent communication and collaboration skills
12
+ Experience with designing, training, and deploying machine learning models
13
+ Customer centric and committed to deliver the best AI results to customers
14
+ What's your expected monthly basic salary?
15
+
16
+
17
+ Role descriptions
18
+
19
+
20
+ Salary insights
21
+
22
+
23
+ Tools to help you prepare for jobs
24
+
25
+ "
26
+ 1,74850440,Data Scientist,PHENOMENAL UAM Sdn Bhd,Kuala Lumpur,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 5,000 – RM 7,500 per month","Utilizing machine learning techniques to analyze and interpret complex data sets.
27
+ Developing AI models and algorithms to solve specific business problems and improve operational efficiency.
28
+ Collaborating with engineers and software developers to integrate AI solutions into our products and services.
29
+ Conducting exploratory data analysis to identify trends, patterns, and insights.
30
+ Designing and implementing experiments to validate model performance and accuracy.
31
+ Communicating findings and insights to stakeholders through visualizations, presentations, and reports.
32
+ Staying abreast of the latest advancements in AI and machine learning technologies, and evaluating their potential applications within our organization.
33
+ Bachelor's degree in Computer Science, Data Science, Statistics, Mathematics, or a related field. Advanced degree (Master's or Ph.D.) preferred.
34
+ Proven experience (X years) working as a Data Scientist or AI Engineer, with a strong focus on developing and implementing machine learning algorithms and AI models.
35
+ Proficiency in programming languages such as Python, R, or Java, along with experience using libraries and frameworks such as TensorFlow, PyTorch, or scikit-learn.
36
+ Solid understanding of statistical analysis, hypothesis testing, and experimental design.
37
+ Experience with data visualization tools such as Matplotlib, Seaborn, or Tableau.
38
+ Strong problem-solving skills and the ability to translate business requirements into analytical solutions.
39
+ Excellent communication and collaboration skills, with the ability to effectively convey technical concepts to both technical and non-technical stakeholders.
40
+ Familiarity with cloud computing platforms (e.g., AWS, Azure, Google Cloud) and big data technologies (e.g., Hadoop, Spark) is a plus.
41
+ Passion for innovation and a keen interest in staying updated with the latest trends and advancements in AI and machine learning."
42
+ 2,74946043,Data Engineer Lead,KK Group Of Companies,Bandar Tasik Selatan,Information & Communication Technology,Engineering - Software,Full time,"RM 7,000 – RM 8,000 per month","Making large and/or complex data more accessible, understandable and usable
43
+ Transforming, improving and integrating data, depending on the business requirements
44
+ Delivering the data in a useful and appealing way to users
45
+ Work with noisy, dirty, and unstructured data, data cleansing, scraping unstructured data, and converting into structured data
46
+ Build and maintain data pipelines, ETL, databases and applications to acquire, store and manage data
47
+ Choose, design, test and implement data software and solutions for projects and enterprise
48
+ Selecting and integrating any Big Data tools and frameworks required to provide requested capabilities
49
+ Monitoring performance and advising necessary infrastructure changes to optimize performance and cost
50
+ Architect, develop and implement end-to-end automated data flows for data and analytical needs
51
+ Perform data profiling, data quality assessment, data cleansing and data transformation
52
+ Recommend appropriate solutions architecture that is highly scalable based on business requirement
53
+ Create date tools for data scientists and data visualization team members
54
+ Builds the infrastructure required for optimal extraction, transformation, and loading for data from a wide variety of data sources using SQL, noSQL and cloud (AWS / Azure / Google / Alibaba / DigitalOcean) technologies
55
+ Benchmark, evaluate and improve the scalability, robustness, efficiency and performance of big data platforms and applications
56
+ Engage in the delivery of analytics assignments for business users involving, amongst others; data preparation, data visualization, creating business insights etc
57
+ Contribute and facilitate workshops/interviews bridging the gap between data analytics and business outcomes
58
+ Provide strategic and tactical advice to business units by translating analytical solutions into out-of-the-box execution recommendations addressing business objectives and impacting bottom line results
59
+ Combine your technical expertise with the ability to architect, deploy and oversee integrated analytics/business intelligence/technology solutions
60
+ Build case for analytics through need analysis and business case (quantify likely benefits)
61
+ Consult with leaders of business teams and manage expectations, meet project and deliverable timelines, ensure accurate and timely completion of deliverables
62
+ Collaborate with several other teams and roles across the organization on requirements gathering and solution deployment
63
+ Make analytical product, service or process decisions likely to impact multiple groups of employees and/or customers (internal or external)
64
+ Manage technology solution portfolios, and identifying new technology approaches to solve business problems with a strong focus on leveraging enterprise data
65
+ Contribute to the building and management of the analytics infrastructure
66
+ Work with management team on staffing needs, strategic technology trend identification, monitoring project progress and completion, issue resolution and technical solutions related to data, data flow and advising on algorithm deployment
67
+ Collaborate with analytics practitioners, business analysts and methodologists to design and oversee the development of working prototypes of technical solutions
68
+ Technical support and lead for the Data Engineer team
69
+ Work with project teams to assist with data-related technical issues and support their data infrastructure needs
70
+ What's your expected monthly basic salary?
71
+ How many years' experience do you have as a Data Engineer?
72
+ How many years' experience do you have as a Tableau Specialist?
73
+ How many years' experience do you have as a Power BI Developer?
74
+ Have you completed an Amazon Web Services Developer certification?
75
+ Have you completed a Cloudera Certified Professional Data Engineer certification?
76
+ How many years' experience do you have as an ETL Data Warehouse Programmer?
77
+ How much notice are you required to give your current employer?"
78
+ 3,74915330,Data Engineer (Python),Inter Island Manpower Pte Ltd,Kuala Lumpur,Information & Communication Technology,Engineering - Software,Full time,"RM 6,000 – RM 7,000 per month","Bachelor's or Master's degree in Computer Science, Engineering, or related fields.
79
+ 1-3 years as a Data Engineer with expertise in data engineering concepts.
80
+ Hands-on experience with Azure or AWS cloud services, Python/Java/Scala, and data processing frameworks.
81
+ Strong knowledge of data modeling, database technologies, and problem-solving skills.
82
+ Excellent communication, collaboration, and adaptability to fast-paced environments.
83
+ Design, develop, and maintain data pipelines for diverse sources into data lake/warehouse.
84
+ Collaborate with data scientists for analysis and reporting needs.
85
+ Research and evaluate data sources for property development.
86
+ Develop web-scraping scripts and automate pipeline deployment.
87
+ Utilize Azure, AWS, and relevant tech for data solutions.
88
+ Integrate data between systems/platforms, monitor, and optimize pipelines.
89
+ Ensure data security, compliance, and quality while staying updated on data tech trends.
90
+ How many years' experience do you have as a Data Engineer?
91
+ Which of the following programming languages are you experienced in?"
92
+ 4,74891040,Principal Data Engineer,Agensi Pekerjaan Hays (Malaysia) Sdn Bhd,Petaling,Information & Communication Technology,Database Development & Administration,Full time,$10000.0 - $14000.0 p.m.,"Take ownership of the end-to-end data solution development and operation, including modelling, ETL, scripting, visualisation, testing, deployment, etc.
93
+ Drive maturity of data infrastructure and architecture by designing, implementing, and maintaining the ingestion, ETL, storage, consumption, archival and disposal process and structure for product and organisation.
94
+ Build data products, including APIs, visualisation tools, analytics and machine learning platforms.
95
+ Utilise ETL to provide actionable insights and intelligence, serving both product offerings and internal data consumption for business functions.
96
+ Coach and mentor the junior team members to develop and improve both technical and non-technical capabilities.
97
+ At least 3 years of experience in a Data Engineering role.
98
+ Strong command of SQL, Python, Data Modelling, Data Warehousing, ETL/ELT.
99
+ Strong foundation in cloud data services such as AWS Redshift, Azure Synapse, BigQuery, Snowflakes, etc.
100
+ A competitive salary based on performance and output.
101
+ Excellent career support and leadership training.
102
+ Exposure to the growing fintech industry and room to grow at an elevated rate."
103
+ 5,74923865,Data Analyst,MONKEYCLOUD MEDIA SDN. BHD.,Petaling,Information & Communication Technology,Help Desk & IT Support,Full time,"RM 3,000 – RM 4,500 per month","Collect and compile data from different sources, including databases, and spreadsheets.
104
+ Filtering and cleaning data to ensure accuracy.
105
+ Analyze data to identify trends, patterns, and anomalies, and provide actionable insights to support in various team decision-making.
106
+ Develop and maintain dashboards and reports to track key project performance metrics, including progress and target achievement.
107
+ Conduct ad-hoc data analysis to address specific project management questions or issues.
108
+ Assist in the preparation of presentations and reports for project status updates and executive-level reporting.
109
+ Diploma or Degree in any related field
110
+ Experience in data analysis, business analysis or similar role, with a strong focus on data manipulation and analysis is preferred.
111
+ Familiarity with data analysis tools and concepts, with experience in Power BI being a strong advantage.
112
+ 2-3 years working experience as a Business Intelligence Analyst/Data Analyst or any position that related to data and business analytics with strong statistical background.
113
+ Preferably with prior experience in the FMCG industry.
114
+ Experience in data analysis, business analysis or similar role, with a strong focus on data manipulation and analysis is preferred.
115
+ Proficiency in data analysis tools and software, such as Power BI, Microsoft Excel (Pivot, Macro) or similar tools.
116
+ Strong organizational skills and the ability to manage multiple tasks and priorities simultaneously.
117
+ A collaborative and team-oriented mindset with the ability to work effectively in a cross-functional environment.
118
+ Excellent communication and presentation skills, with the ability to convey complex information clearly and effectively.
119
+ Strong analytical and problem-solving skills, with the ability to translate data into actionable insights.
120
+ Monday to Friday
121
+ : 8.30a.m to 5.30p.m 
122
+ How many years' experience do you have as a data analyst?
123
+ How many years' experience do you have as a Power BI Consultant?
124
+ How would you rate your Mandarin language skills?
125
+ Which of the following data analytics tools are you experienced with?
126
+ Which of the following Relational Database Management Systems (RDBMS) are you experienced with?"
127
+ 6,74879920,Data Analyst (Intern),MONT ROYALE COLLEGE,Sri Hartamas,Science & Technology,"Mathematics, Statistics & Information Sciences",Contract/Temp,RM 600 per month,"Collect, organize, and analyze data from multiple sources to identify trends, patterns, and insights.
128
+ Develop and maintain databases and data systems necessary for data analysis and reporting.
129
+ Generate regular reports and dashboards to communicate key findings and metrics to stakeholders.
130
+ Collaborate with departments to understand their data needs and provide analytical support to inform strategic initiatives.
131
+ Conduct thorough data quality assessments and implement solutions to address any discrepancies or issues.
132
+ Stay updated on industry trends and best practices in data analysis and visualization techniques.
133
+ Bachelor's degree in Computer Science, Statistics, Mathematics, Economics, or related field. (Master's degree preferred)
134
+ Proven experience working as a data analyst or in a similar role.
135
+ Proficiency in data analysis tools and software such as Excel, SQL, Python, R, or Tableau.
136
+ Strong analytical and problem-solving skills with the ability to translate complex data into actionable insights.
137
+ Excellent communication and presentation skills, with the ability to effectively convey technical information to non-technical stakeholders.
138
+ Detail-oriented with a commitment to accuracy and data integrity.
139
+ Ability to work independently and collaboratively in a fast-paced environment.
140
+ What's your expected monthly basic salary?"
141
+ 7,74866336,Manager - Data Engineer,Elken Sdn Bhd,Petaling,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 8,000 – RM 12,000 per month","Provide database operation support including monitoring, performance tuning, backup and restoration, disaster recovery, troubleshooting, replication setup and capacity management.
142
+ Responsible for installing, configuration, upgrading and maintaining multiple environments including on premise and cloud databases.
143
+ Provide recommendation for future direction of database technology and infrastructure setup.
144
+ Deploy database objects and data patches when required by development team.
145
+ Assist developers with complex query tuning and best practice suggestion.
146
+ Database environment setup for Development, UAT and Production.
147
+ Database migration for various platform.
148
+ Build automation script for database tasks.
149
+ Collaborate with Data Analysts, and other stakeholders to understand data requirements and provide data solutions to meet their needs.
150
+ Bachelor’s Degree in Computer Science or equivalent in related field.
151
+ At least 3 years working experience in related field.
152
+ Possess professional database certifications such as Microsoft Certified, Oracle Certified will be added advantage.
153
+ Experience with RDBMS Administration such as MS SQL, My SQL MariaDB, PostgresSQL and Oracle.
154
+ Experience with AWS RDS Cloud database environment administration.
155
+ Experience with Windows Server and Linux environments.
156
+ Experience with Powershell scripting, Linux command line.
157
+ Able to converse in 
158
+ Mandarin
159
+  as needed to support oversea team like Hong Kong, China and Taiwan.
160
+ What's your expected monthly basic salary?
161
+ Which of the following Relational Database Management Systems (RDBMS) are you experienced with?"
162
+ 8,74789886,Junior Data Analyst,Oriental Fortune Trading Malaysia Sdn Bhd,Petaling,Information & Communication Technology,Business/Systems Analysts,Full time,"RM 3,000 – RM 4,300 per month","Data Collection and Cleaning:
163
+ Collect and compile data from various sources, ensuring data accuracy and completeness. Perform data cleaning and transformation as necessary to prepare it for analysis.
164
+ Data Analysis:
165
+ Assist in conducting data analysis using statistical methods, data visualization tools, and data analysis software. Summarize findings in clear and concise reports.
166
+ Data Visualization:
167
+ Create visually appealing and informative charts, graphs, and dashboards to present data insights to stakeholders. Use tools such as Tableau, Power BI, or Excel for visualization.
168
+ Report Generation:
169
+ Prepare regular and ad-hoc reports to communicate key metrics and findings to the team and management. Ensure reports are accurate and delivered on time.
170
+ Data Quality Assurance:
171
+ Help maintain data quality standards by identifying and resolving data discrepancies or inconsistencies. Collaborate with other team members to establish data quality protocols.
172
+ Collaboration:
173
+ Work closely with cross-functional teams to understand data requirements and deliver actionable insights.
174
+ Research and Learning:
175
+ Stay up-to-date with industry trends and best practices in data analysis. Continuously expand your knowledge and skills in data analytics techniques and tools.
176
+ Bachelor's degree in a relevant field such as Finance, Data Science, Statistics, Mathematics, Computer Science, or a related discipline.
177
+ Candidate with 1-2 years quantitative analysis or finance background are preferred;
178
+ fresh graduates are welcome as well.
179
+ Required language(s):
180
+ Fluent in English & Mandarin
181
+ are preferred because the role requires candidates to interact with Mandarin-speaking clients.
182
+ Strong analytical skills and a passion for working with data.
183
+ Proficiency in data analysis tools and programming languages such as Excel, SQL, Python, or R.
184
+ Familiarity with data visualization tools (e.g., Tableau, Power BI) is a plus.
185
+ Excellent communication skills, with the ability to translate complex data into understandable insights.
186
+ Detail-oriented and capable of maintaining data accuracy and integrity.
187
+ Highly organized and able to manage multiple tasks and deadlines effectively.
188
+ Team player with a collaborative mindset and the ability to work well in a diverse team environment.
189
+ Eagerness to learn and adapt to new tools, techniques, and technologies in the field of data analysis.
190
+ Possess own car.
191
+ What's your expected monthly basic salary?
192
+ How many years' experience do you have as a data analyst?
193
+ Which of the following languages are you fluent in?"
194
+ 9,74703677,Junior Data Analyst,Private Advertiser,Bangsar South,Information & Communication Technology,Business/Systems Analysts,Full time,"RM 3,500 – RM 4,300 per month","Analyze and manage data related to E-sports events, focusing on Key Performance Indicators (KPIs).
195
+ Create daily reports highlighting discrepancies or errors identified during the analysis process.
196
+ Chart and conduct in-depth analyses of team performance in E-sports events.
197
+ Real-time monitoring and analysis of data, including the production of charts and performance evaluations.
198
+ Maintain daily event reports with important metrics for overall performance.
199
+ Collaborate with team members, ensuring smooth operations and providing support.
200
+ Fluency in Mandarin preferred for client interaction.
201
+ Strong analytical and critical thinking skills.
202
+ Ability to work independently under pressure and manage multiple projects with tight schedules.
203
+ Flexibility for shifts and overnight work.
204
+ Passion for E-sports is a plus.
205
+ Fresh graduates are welcome to apply.
206
+ 13 months’ Salary
207
+ Quarterly and annual bonuses
208
+ Monthly meal, shift, and parking allowances
209
+ Monthly team gatherings & birthday celebrations
210
+ Unlimited snacks & beverages
211
+ Team building activities & outings
212
+ Medical insurance
213
+ 14 days of annual leave
214
+ Birthday leave
215
+ 8 hours of work per day
216
+ EPF, SOCSO, EIS contributions
217
+ Which of the following languages are you fluent in?"
218
+ 10,74602585,Data Analyst/Scientist,PERSOLKELLY Workforce Solutions Malaysia Sdn Bhd,Kulim District,Science & Technology,"Mathematics, Statistics & Information Sciences",Contract/Temp,"RM 3,500 – RM 5,000 per month","What's your expected monthly basic salary?
219
+ How many years' experience do you have as a data analyst?
220
+ How many years' experience do you have using SQL queries?"
221
+ 11,74586835,Data Engineer,Michael Page International (Malaysia) Sdn Bhd,Kuala Lumpur,Information & Communication Technology,Database Development & Administration,Full time,$36k - $64k p.a. + Bonus + Other benefits,"Designing and implementing Data pipelines and ETL processes from various sources into our Data Warehouse
222
+ Ensure that data is cleaned, standardized, and properly structured to minimize data inconsistencies and gaps
223
+ Create and maintain data pipelines to facilitate the smooth flow of data throughout the organization
224
+ Work closely with cross-functional teams to identify data integration needs and implement solutions that enable seamless data movement
225
+ Establish data quality standards and processes to monitor and maintain the accuracy and integrity of data within Big Query and other data systems
226
+ Implement data validation techniques to identify and rectify invalid or incomplete data entries
227
+ Design and maintain data schemas to ensure consistency and efficiency in data storage and retrieval
228
+ Collaborate with Tech and Business teams to refine data models and adapt to changing business needs
229
+ Drive data-centric decision-making within the organization
230
+ Collaborate with stakeholders to foster a data-driven culture by promoting data literacy and usage
231
+ Bachelor's degree in Computer Science, Engineering or a related field
232
+ Proven experience as a Data Engineer or similar role, with a track record of implementing successful data solutions
233
+ Proficiency in ETL (Extract, Transform, Load) processes and data integration techniques
234
+ Strong knowledge of database systems and data warehousing concepts
235
+ Experience with Google Cloud Platform (GCP) services"
236
+ 12,74513015,Senior Data Scientist (Full Stack),Ambition Group Malaysia Sdn Bhd,Petaling,Consulting & Strategy,Other,Full time,Attractive Remuneration Package,"Utilize technical proficiency for data gathering, cleaning, and preprocessing from varied sources.
237
+ Develop, train, and assess machine learning models for solving business problems like predictive analytics and natural language processing.
238
+ Conduct statistical analysis, hypothesis testing, and data-driven experimentation; stay updated on emerging trends in data science and machine learning
239
+ Degree/Master in Computer Science, Mathematics, Statistics.
240
+ Experienced in all aspects of the data science lifecycle, including problem formulation, data collection and preprocessing, exploratory data analysis, feature engineering, model selection and training, model evaluation.
241
+ Familiar working with cloud-based big data platforms (Azure, GCP, AWS)
242
+ A minimum 3 years of hands on experience with Python.
243
+ Excellent communication skills in Bahasa Melayu and English."
244
+ 13,74636692,Data Analyst,Agensi Pekerjaan Talent Recruit Sdn. Bhd.,Kuala Selangor District,Information & Communication Technology,Business/Systems Analysts,Full time,"RM 4,000 – RM 5,000 per month","Analyze and interpret key indicators in e-commerce
245
+ Provide valuable insights and identify trends based on the sales data and suggest recommendations and key action plans 
246
+ Provide competitor analysis 
247
+ Evaluation of the effectiveness of sales and marketing tools and mechanics
248
+ Creating and developing tools for analysis and information processing 
249
+ Presenting the results of reports and graphical modeling of data 
250
+ Work closely with the e-Commerce sales team to increase sales and revenue 
251
+ Develop and maintain data pipelines to ensure data accuracy, integrity, and availability
252
+ Develop and maintain e-Commerce dashboards as per client’s requests
253
+ Providing insight to the e-Commerce sales team to optimize marketing expenses
254
+ Data Review: reviewing and validating data from various sources (e.g. Lazada, Shopee, etc.) to ensure data quality and accuracy
255
+ Diploma or Bachelor’s degree in Data Science, Statistics, Computer Science, or a related field 
256
+ Min
257
+ 1 - 2 years of experience as a Data Analyst/Business Analyst/Category Management preferably in E-Commerce or FMCG industry
258
+ Advanced Excel and data mining skills 
259
+ Proficient in data visualization tools (e.g. Power BI) and database query languages (e.g. SQL) 
260
+ Knowledge of Cloud Computing is an added advantage (e.g. Azure) 
261
+ Familiarity with Azure Data Factory and Azure Synapse Pipelines is an added advantage 
262
+ Ability to work independently and take the lead in projects 
263
+ Strong presentation skills
264
+ Permanent employment
265
+ Working hours : Mon - Fri (9am – 6pm) *
266
+ Saturday half day only when required by Management during certain peak events (very occasional)
267
+ Working location : Puncak Alam
268
+ What's your expected monthly basic salary?"
269
+ 14,74601731,Data Analyst Internship,Air Energy Consulting (Malaysia) Sdn Bhd,Kuala Lumpur,Administration & Office Support,Records Management & Document Control,Contract/Temp,"RM 900 – RM 1,000 per month","Review and update on Bullhorn data for Asia placements.
270
+ Work with the Service Team Australia on updating the Bullhorn data for data accuracy.
271
+ Work closely with the Quality Coordinator on the accuracy and completeness of the data (Client’s PO/WO and Contractors’ employment packages) within the Airswift systems.
272
+ Work with the relevant parties to ensure that any errors are corrected promptly.
273
+ Administrator and investigate client, assignee and consultant surveys.
274
+ Data analysis – accuracy report that was generated for the entire Asia Pacific Region.
275
+ Consolidate data and dashboards for distribution to SVPs and Global Operations as required.
276
+ Manage the Business Continuity plan and Office Safety for the super region and specific country.
277
+ Work with other members of the Regional Commercial Team, ensuring that all requests for review, questions and queries are addressed, and escalating to the relevant people when required.
278
+ The entire process flow of how a manpower/workforce solution provider organisation functions (Sales/BD -> Recruitment -> Onboarding -> Payroll -> Billing).
279
+ Diploma or Degree in Business Administration or Management or equivalent.
280
+ Knowledge of relevant regulatory requirements in the region.
281
+ Excellent technical skills including all Microsoft Office products and understanding of statistics.
282
+ Ability to problem solve with high attention to detail.
283
+ Excellent written and verbal communication and interpersonal skills.
284
+ Must have a sense of urgency in completing tasks.
285
+ Client service orientated with good teamwork skills.
286
+ Ownership and willingness to accept responsibility.
287
+ Virtual Onboarding Events.
288
+ Team-driven environment fostering a supportive culture with a strong emphasis on work-life balance.
289
+ Cheers! where your peers can nominate you for outstanding performance.
290
+ Coaching and Training are provided.
291
+ Growth
292
+ - In life and business, one must grow to flourish and achieve high ambitions. Growth requires change, challenge, risk and sacrifice - we will always choose growth.
293
+ Safety
294
+ - We create a safe and inclusive environment to protect the physical and psychological well-being of everyone we work with.
295
+ Excellence
296
+ - We deliver, holding ourselves accountable for results. Our customers see excellence in everything we do.
297
+ Integrity
298
+ - We are ethical, open, honest and authentic. People trust us to do the right thing for the right reason.
299
+ Which of the following statements best describes your right to work in Malaysia?"
300
+ 15,74360323,Data Scientist,TALENTBANK SDN BHD,Kuala Lumpur,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 6,800 – RM 10,200 per month","Which of the following programming languages are you experienced in?
301
+ What's your expected monthly basic salary?
302
+ Which of the following data analytics tools are you experienced with?
303
+ How many years' experience do you have as a data scientist?"
304
+ 16,74433737,Data Engineer,Michael Page International (Malaysia) Sdn Bhd,Selangor,Sales,Other,Full time,$96k - $97k p.a.,"Attractive remuneration package
305
+ Be part of a globally recognised organisation
306
+ Enhance your development skills"
307
+ 17,74323540,Data Scientist,AGENSI PEKERJAAN LEWIS TALENT CONSULTING SDN. BHD.,Petaling,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 6,000 – RM 8,500 per month","Understanding the data information, investigating and preprocessing vast datasets to unveil significant features and insights.
308
+ Design and deploy predictive models employing advanced statistical and machine learning techniques to address complex business issues.
309
+ Generate compelling visualizations to illustrate complex analyses and insights to both technical and non-technical audiences.
310
+ Engage closely with multidisciplinary teams to grasp business objectives and partake in data-driven projects.
311
+ Keep abreast of the latest data science advancements and enhance the organization's data science methodologies.
312
+ At least possess a Master's or Ph.D. in Data Science, Statistics, Computer Science, or related fields.
313
+ Demonstrated experience as a Data Scientist, supported by a portfolio of successful data science projects.
314
+ Expertise in Python or R, including familiarity with relevant libraries and frameworks.
315
+ Solid foundation and experience in applying machine learning algorithms to practical problems.
316
+ Strong interpersonal and communication capabilities for effective teamwork with cross-functional groups.
317
+ Commitment to staying informed about industry trends, data science techniques, and tools. Engage in ongoing training and development programs as necessary.
318
+ Expertise in analyzing extensive, complex datasets to extract valuable insights.
319
+ Proficiency in applying statistical and machine learning methods to solve real-world challenges.
320
+ Competence in programming languages like Python or R for data analysis and modeling.
321
+ Strong ability in creating visualizations with tools such as Tableau, Power BI, or similar platforms.
322
+ Creative problem-solving aptitude for tackling complex business issues with data-driven solutions.
323
+ Exceptional ability to articulate complex findings and insights clearly.
324
+ How many years' experience do you have as a data scientist?
325
+ Which of the following programming languages are you experienced in?
326
+ Which of the following data analytics tools are you experienced with?
327
+ How would you rate your Mandarin language skills?
328
+ Which of the following languages are you fluent in?"
329
+ 18,73981733,Big Data Engineer - Cloud,Private Advertiser,Kuala Lumpur,Information & Communication Technology,Engineering - Software,Contract/Temp,"RM 9,000 – RM 13,000 per month","Which of the following statements best describes your right to work in Malaysia?
330
+ What's your expected monthly basic salary?
331
+
332
+
333
+ Role descriptions
334
+
335
+
336
+ Salary insights
337
+
338
+
339
+ Tools to help you prepare for jobs
340
+
341
+ "
342
+ 19,74324430,Data Scientist Assistant Manager,AGENSI PEKERJAAN LEWIS TALENT CONSULTING SDN. BHD.,Petaling,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 10,000 – RM 14,000 per month","Support the oversight of data science projects to ensure their timely and precise execution, delivering advanced analytics and insights. Address project hurdles collaboratively and refine project outcomes.
343
+ Assist the Data Scientist Manager in mentoring and directing the data science team. Offer guidance on project priorities, technical challenges, and career growth.
344
+ Engage with your team to craft and execute predictive models leveraging sophisticated statistical and machine learning techniques to tackle complex business issues.
345
+ Aid in the exploration and preprocessing of large datasets to unearth valuable features and insights.
346
+ Collaborate effectively with multidisciplinary teams to grasp business goals, align data science projects, and articulate insights to non-technical audiences.
347
+ Drive the enhancement of data science methodologies and processes. Keep abreast of industry standards and evolving trends.
348
+ Holding a minimum of Master's or Ph.D. in Data Science, Statistics, Computer Science, or any other related fields.
349
+ Demonstrable experience as a Data Scientist, including roles in leadership or management.
350
+ High proficiency in languages such as Python or R, along with familiarity with pertinent libraries and frameworks.
351
+ A thorough grasp and application experience of machine learning algorithms in practical scenarios.
352
+ Excellent interpersonal and communication skills for effective teamwork and cross-functional collaboration.
353
+ A commitment to continuous learning, staying updated with data science trends, techniques, and tools, and participating in relevant training and development programs.
354
+ Proven leadership ability to mentor and inspire a team of data scientists.
355
+ Exceptional project management capabilities, with a knack for prioritizing, resource allocation, and deadline adherence.
356
+ Skilled in dissecting large, intricate datasets to extract significant insights.
357
+ Proficiency in utilizing advanced statistical and machine learning methods to address real-world challenges.
358
+ Competent in programming languages like Python or R for data analysis and modeling.
359
+ Strong communication skills, capable of explaining complex findings to a broad audience range.
360
+ How many years' experience do you have as an Assistant Manager?
361
+ Which of the following programming languages are you experienced in?
362
+ Which of the following data analytics tools are you experienced with?
363
+ Which of the following languages are you fluent in?
364
+ How would you rate your Mandarin language skills?"
365
+ 20,74488251,Data Engineer/Data Cloud Engineer/Salesforce,LIEGE SOLUTIONS SDN. BHD.,Kuala Lumpur,Information & Communication Technology,Engineering - Software,Full time,"RM 8,000 – RM 12,000 per month","Develop, build, and implement Salesforce Data Cloud instances in conjunction with Salesforces (Sales/Service, Salesforce Marketing Cloud, etc.)
366
+ Develop, oversee, and maintain key data model, segmentation, architecture, code documentation, and technical specifications for designated workstreams.
367
+ Lead the development and physical design of solutions, guide the development team, and ensure the system's architectural integrity.
368
+ Implement Salesforce Data Cloud, emphasizing data modelling, performance optimization, and integrations.
369
+ Function as a technical solution specialist within the Salesforce Data Cloud ecosystem, including components like Data streams, Unified Data Model, Transformations/Insights, Identity Resolution, Segmentations, etc. 
370
+ Drive end-to-end project execution, encompassing analysis, documentation, solution development, testing, and performance, to meet client expectations.
371
+ Identify technical design and data model risks, confirm their validity, and devise strategies to mitigate them.
372
+ Actively participate in technical design and data model workshops alongside Engineering Teams and business stakeholders.
373
+ Clearly translate and communicate technical requirements and solutions to the business using story-based narratives, and present strategies and recommendations to executives.
374
+ Ensure that technical documentation is consistently updated through active participation in technical workshops.
375
+ Lead sessions to understand client needs in technical marketing and Salesforce Data Cloud requirements, and act as a crucial technical resource throughout the implementation process.
376
+ Assess client requirements, propose effective solutions, and communicate the capabilities and limitations of Salesforce Data Cloud, Marketing Cloud Engage, Personalization, Intelligence, Sales, Service, and other relevant SF ecosystem platforms.
377
+ Oversee and coordinate testing activities, including System Integration Testing (SIT) and User Acceptance Testing (UAT), in line with the project's scope.
378
+ Manage workstreams in large-scale enterprise implementations, focusing on Salesforce Data Cloud architecture and data models.
379
+ Bachelor's Degree.
380
+ Experience with at least 1 end-to-end Salesforce Data Cloud implementation.
381
+ Experience with at least 1 Salesforce cross-cloud integrations.
382
+ 5+ years in data model and architecture in the Salesforce ecosystem.
383
+ 5+ years in consulting, project implementation for global based client, or project management using Agile methodologies.
384
+ 5+ years in Customer Data Modelling best practices.
385
+ Proficiency in moderately complex SQL queries, SOQL, Python.
386
+ Experience in integrating with cloud-based data warehouses/data lakes (e.g. Snowflake, Databricks, etc.).
387
+ Effective communication to both technical and non-technical audiences.
388
+ Experience with Tableu, Data Analytic Platform (e.g. PowerBI).
389
+ Experience with Segmentation strategy.
390
+ Experience with Marketing Cloud Personalization.
391
+ Experience with Adobe, Marketo, Oracle or other enterprise marketing automation or CDP platforms.
392
+ Front-end coding skills (HTML, CSS/SASS, Typescript, React, XML, JSON).
393
+ Salesforce certifications (Salesforce Data Cloud Consultant , and Salesforce Certifications - Marketing Cloud, Admin, etc.).
394
+ A supportive, friendly and professional team environment;
395
+ Stability to develop your career with an established and growth focused company; 
396
+ A range of benefits as part of our Employee Benefit Program;
397
+ Which of the following statements best describes your right to work in Malaysia?
398
+ What's your expected monthly basic salary?
399
+ How many years' experience do you have as a Salesforce Developer?
400
+ How many years' experience do you have as a Salesforce Architect?
401
+ How many years' experience do you have as a Salesforce Marketing Cloud Specialist?
402
+ How many years' experience do you have as a Salesforce Engineer?"
403
+ 21,74354434,Data Analyst,EMART24 HOLDINGS SDN. BHD.,Petaling,Information & Communication Technology,Business/Systems Analysts,Full time,"RM 6,800 – RM 7,000 per month","Develop and maintain dashboards and reports that provide insights into key performance indicators (KPIs) across all business functions.
404
+ Partner with IT team to leverage technology infrastructure and tools to optimize data collection, storage, and analysis processes.
405
+ Conduct ad-hoc analysis to answer business questions and identify opportunities for growth and improvement.
406
+ Collaborate with cross-functional teams to identify, analyze and recommend data driven solutions to business problems.
407
+ Develop and maintain data pipelines to ensure data is accurate, consistent, and available in a timely manner.
408
+ Identify and recommend BI tools that can help to improve data visibility and analysis across the organization.
409
+ Train and support end-users on the use of BI tools and dashboards.
410
+ Provide recommendations to stakeholders on how to use data to drive growth and improve bottom-line results.
411
+ Continuously monitor the performance of the business and make recommendations for improvement based on data analysis.
412
+ Maintain a high level of data quality, accuracy, and consistency.
413
+ Bachelor's degree in Actuarial Science, Computer Science, Information Technology, or a related field.
414
+ Proven experience (3-5 years) in a business intelligence or data analyst role, preferably in a retail or eCommerce environment.
415
+ Experience using BI tools such as Power BI, Tableau, Metabase, or Google Data Studio.
416
+ Proficient in SQL and/or Python, and data visualization tools.
417
+ Strong analytical and problem-solving skills with the ability to work with complex data sets.
418
+ Ability to work collaboratively and communicate effectively with cross-functional teams.
419
+ Knowledge of retail or eCommerce industry and data trends.
420
+ Excellent communication skills, both written and verbal, with the ability to communicate complex ideas in a simple and clear manner.
421
+ Ability to prioritize and manage multiple projects simultaneously.
422
+ Proficient in both English and Bahasa Malaysia.
423
+ Which of the following statements best describes your right to work in Malaysia?
424
+ What's your expected monthly basic salary?
425
+ How many years' experience do you have as a data analyst?
426
+ Which of the following programming languages are you experienced in?
427
+ How many years' experience do you have using SQL queries?
428
+ Which of the following Relational Database Management Systems (RDBMS) are you experienced with?
429
+ How would you rate your English language skills?
430
+ Which of the following languages are you fluent in?"
431
+ 22,74366437,Data Analyst,Asia Careers Group SDN BHD,Kuala Lumpur,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 3,700 – RM 5,500 per month","Make large and complex data more
432
+ accessible, understandable and usable
433
+ within Tableau
434
+ Transform, improve and integrate
435
+ graduate outcomes and average salary data
436
+ Combining data result
437
+ sets across multiple sources using Excel and or other databased platforms
438
+ Maintain and manage existing data collection tools
439
+ , analytics tools and reporting platforms
440
+ Maintain existing reports and develop new report where necessary to support business needs.
441
+ Delivering the data in a useful and appealing way to clients
442
+ Excellent analytical skills
443
+ Ability to work with large amounts of information and see the ‘bigger picture’
444
+ Comfortable with
445
+ juggling facts, figures, and number crunching
446
+ Strong communication skills
447
+ , both written and oral and
448
+ able to work well with other teams
449
+ Proficient in interacting with computer softwares
450
+ Critical thinking
451
+ : able to look at numbers, trends, and data and come to new conclusions based on the findings
452
+ Attention to detail
453
+ Excellent maths skills
454
+ Working to deadlines
455
+ Independent
456
+ Drive data collection
457
+ and input data collection with
458
+ preprogrammed scripts
459
+ To
460
+ collate and merge data sets
461
+ to enable reporting within Tableau
462
+ Analyse current and past trends
463
+ of key performance indicators, highlighting trends and analysing causes of unexpected variances
464
+ Develop models and analysis to
465
+ support strategic initiatives
466
+ Create presentations that provide insightful analysis,
467
+ identify and propose actions for effective decisions
468
+ Create infographics and generate insight
469
+ for both clients and for use in marketing and promotion
470
+ To
471
+ perform ad hoc requests
472
+ for data on a day-to-day basis
473
+ Minimum 1 years working experience in data analytics and visualisation
474
+ Must be proficient in Power BI,
475
+ Tableau, Excel
476
+ . Coding skills (Python / NodeJS) and graphic designing skills (Adobe Illustrator) are an advantage but not essential
477
+ Experience creating, integrating dashboards to present data in a meaningful and interactive manner
478
+ Good team worker
479
+ and able to multitask & meet tight deadlines
480
+ Results oriented and creative
481
+ Required language(s):
482
+ Proficient in English
483
+ , other foreign languages a plus.
484
+ Full time employment
485
+ Company provided computer
486
+ Exposure to fun and challenging problems in an agile environment
487
+ Transparent company culture and openness to new ideas
488
+ Training courses in your areas of interest
489
+ Which of the following languages are you fluent in?
490
+ Are you willing to undergo a pre-employment background check?
491
+ How much notice are you required to give your current employer?"
492
+ 23,74354364,Data Analyst,PERSOLKELLY Workforce Solutions Malaysia Sdn Bhd,Kuala Lumpur,Consulting & Strategy,Analysts,Contract/Temp,"RM 5,500 – RM 6,500 per month","Provide quantitative analytics support to the AML Transformation & Governance team to enhance risk assessment of ongoing current and emerging AML centric risks in core areas; business, client, operations & system controls within the international business entities. 
493
+ Develop and enhance quantitative/qualitative risk assessment model to proactively identify risks enhancement opportunities in areas that are not clearly defined by legislative AML guidance 
494
+ Provide data extraction & quantitative validation for targeted AML risks or ongoing AML issues identified 
495
+ Provide other ad hoc data extraction and validation requests from time to time and assist in the maintenance of data library for all IBG entities 
496
+ Provide periodic quantitative entities’ risk assessment report as part of AML governance update to the AML IBG management 
497
+ Provide collaboration support with the AML data team on entity level business intelligence related request from regional AML teams. 
498
+ Bachelor degree or above majoring in computer science/engineering, statistics, Mathematics, finance and similar fields; Candidate with experience in AML risk modelling and data analytics are preferred 
499
+ At least three years' AML/risk control/fraud management working experience in FI, PI, consulting firms or similar industries 
500
+ Proficient in data handling coding (e.g., SQL) and at least one professional analytics tool such as SAS, python etc. 
501
+ Familiar with at least one popular BI tool, such as Power BI, Tableau.
502
+ Strong ability in logical thinking and problem solving 
503
+ Able to plan & work independently on required deliverables and deadlines 
504
+ Able to work under pressure and result-oriented 
505
+ Highly flexible and detail-oriented with the ability to multi-task and change directions based on demand
506
+ How many years' experience do you have as a data analyst?
507
+ Which of the following languages are you fluent in?
508
+ How many years' experience do you have as a business intelligence (BI) analyst?"
509
+ 24,74323206,Senior Data Analyst,AGENSI PEKERJAAN LEWIS TALENT CONSULTING SDN. BHD.,Petaling,Information & Communication Technology,Business/Systems Analysts,Full time,"RM 5,000 – RM 7,000 per month","Lead sophisticated ETL processes to aggregate data from various sources into our data warehouse.
510
+ Perform comprehensive data analysis to uncover trends, patterns, and insights, aiding in addressing business challenges and uncovering opportunities.
511
+ Pioneer the development of complex data visualizations, charts, and dashboards with tools like Tableau and Power BI.
512
+ Ensure data integrity through rigorous quality control, identifying and correcting discrepancies.
513
+ Work closely with different departments to align data insights with business objectives, advising on strategic actions.
514
+ Mentor junior data analysts, promoting a culture of continuous learning and professional growth.
515
+ At least possess a Bachelor’s or Master’s in Data Science, Mathematics, Statistics, Computer Science, or any equivalent fields.
516
+ Technical Proficiency in data analysis (Excel, Python, SQL) and visualization tools (Tableau, Power BI) are required.
517
+ Significant experience in a senior data analyst capacity, evidenced by impactful contributions.
518
+ A solid understanding of Machine Learning, AI, and Data Warehousing principles.
519
+ Leadership Experience: Demonstrated ability to lead and develop a team of data analysts.
520
+ Exceptional analytical skills for dissecting complex datasets and performing advanced statistical analyses to derive actionable insights.
521
+ Demonstrated leadership with the capability to effectively collaborate with both technical and non-technical teams.
522
+ Creative problem-solving skills, offering innovative solutions to business issues.
523
+ Excellent communication, able to simplify complex data findings for a varied audience.
524
+ How many years' experience do you have using SQL queries?
525
+ How would you rate your Mandarin language skills?
526
+ Which of the following languages are you fluent in?
527
+ Which of the following data visualisation tools are you experienced with?
528
+ Which of the following data analytics tools are you experienced with?"
529
+ 25,74653378,Data Analyst (E-Commerce),AGENSI PEKERJAAN JS STAFFING SERVICES SDN BHD,Kuala Selangor District,Information & Communication Technology,Database Development & Administration,Full time,"RM 4,000 – RM 4,500 per month","Analyze and interpret key indicators in e-commerce such as sales performance, SKU sales, traffic, conversion rates, buying behavior, pricing, etc. 
530
+ Provide valuable insights and identify trends based on the sales data and suggest recommendations and key action plans 
531
+ Provide competitor analysis 
532
+ Evaluation of the effectiveness of sales and marketing tools and mechanics 
533
+ Creating and developing tools for analysis and information processing 
534
+ Presenting the results of reports and graphical modeling of data 
535
+ Work closely with the E-Commerce sales team to increase sales and revenue 
536
+ Develop and maintain data pipelines to ensure data accuracy, integrity, and availability. 
537
+ Develop and maintain E-Commerce dashboards as per client’s requests. 
538
+ Providing insight to the E-Commerce sales team to optimize marketing expenses. 
539
+ Data Review: reviewing and validating data from various sources (e.g. Lazada, Shopee, etc.) to ensure data quality and accuracy. 
540
+ Candidates must possess at least a Bachelor’s degree in Data Science, Statistics, Computer Science, or a related field 
541
+ Min. 1-2 years of experience as a Data Analyst/Business Analyst/Category Management preferably in E-Commerce or FMCG role 
542
+ Advanced Excel and data mining skills 
543
+ Proficient in data visualization tools (e.g. Power BI) and database query languages (e.g. SQL) 
544
+ Knowledge of Cloud Computing is an added advantage (e.g. Azure) 
545
+ Familiarity with Azure Data Factory and Azure Synapse Pipelines is an added advantage 
546
+ Ability to work independently and take the lead in projects 
547
+ Strong presentation skills 
548
+ Which of the following statements best describes your right to work in Malaysia?
549
+ What's your expected monthly basic salary?"
550
+ 26,74552169,Senior Business Intelligence Developer / Data Engineer,CITY FACILITIES MANAGEMENT SDN. BHD.,Kuala Lumpur,Science & Technology,"Mathematics, Statistics & Information Sciences",Full time,"RM 8,000 – RM 12,000 per month","Translation of business requirements into accurate and interactive data visualizations and analytics that enable management and other stakeholders to make informed decisions.
551
+ Drive data mining & data science initiatives
552
+ Review reports to determine basic characteristics, such as origin and report flow, format, frequency, distribution, and purpose or function of report.
553
+ May prepare and issue instructions concerning generation, completion, and distribution of reports according to new or revised practices, procedures, or policies of reports management.
554
+ Examine and evaluate purpose and content of business reports to develop new, or improve existing, format, use, and control.
555
+ Recommend the establishment of new or modified reporting methods and procedures to improve report content and completeness of information.
556
+ Produce associated documentation including functional specifications, testing plans and support documentation.
557
+ Understand and comprehend new system changes as they affect data, reports, and reporting systems.
558
+ Confer with team members originating, handling, processing, or receiving reports to identify problems and to gather suggestions for improvements.
559
+ Manage internal and external stakeholders.
560
+ Provide support and coaching to application support staff and associated team members.
561
+ Provides assistance to support existing infrastructure where appropriate..
562
+ Participate in application validation and QA efforts as they pertain to reporting, data, metrics, and report creation and execution.
563
+ Provides assistance to support existing infrastructure where appropriate.
564
+ Develop the appropriate strategies and plans to ensure a successful implementation that conforms to the company agreed SLA’s.
565
+ A relevant advanced technical degree.
566
+ A mastery of Business Intelligence concepts and the ability to articulate these concepts to technical and non-technical audiences.
567
+ At least 5+ years’ experience in Business Intelligence
568
+ Demonstrable strong data analysis and data mining skills (T-SQL, DAX and MDX).
569
+ Strong data visualisation skills using SSRS & Power BI.
570
+ Exposure to Snowflake on Azure platform and Matillion tools is desirable.
571
+ Demonstrable experience with Business Intelligence tools in the Microsoft BI stack (SSAS and SSIS) or Matillion.
572
+ Strong communication skills with the ability to collaborate and contribute to a team positively.
573
+ Strong analytical skills with the ability to translate business needs into technical solutions.
574
+ The ideal candidate will have experience working in agile projects.
575
+ High cognitive ability to problem solving
576
+ Own and display City values, respect and value others and work as one team. (mandatory all positions)
577
+ Maintain a professional and positive attitude at all times and act with integrity (lead by example).
578
+ High level numerical and analytical skills
579
+ Demonstrated attention to detail and accuracy
580
+ Maintain a high standard of delivery on all work and meet deadlines
581
+ Display high level of motivation and drive at all times.
582
+ Remain calm and organised at all times.
583
+ Ability to develop rapport to ensure effective working relationship with a broad range of managers and team members.
584
+ Demonstrate initiative and confidentiality across the business.
585
+ Treat others with respect and be conscientious and courteous.
586
+ Good time management, planning and organising ability.
587
+ What's your expected monthly basic salary?
588
+
589
+
590
+ Role descriptions
591
+
592
+
593
+ Salary insights
594
+
595
+
596
+ Tools to help you prepare for jobs
597
+
598
+ "
599
+ 27,74976890,Risk Operation Data Analyst ( Online Payment),ALIPAY MALAYSIA SDN. BHD.,Kuala Lumpur,Banking & Financial Services,Analysis & Reporting,Full time,"RM 10,000 – RM 15,000 per month","What's your expected monthly basic salary?
600
+ Which of the following languages are you fluent in?
601
+ How many years' experience do you have as an Operational Risk Analyst?"
602
+ 28,74434092,Data Analyst - Internal Audit,Michael Page International (Malaysia) Sdn Bhd,Petaling,"Manufacturing, Transport & Logistics","Warehousing, Storage & Distribution",Full time,$102k - $103k p.a.,"Gain understanding of the company's financial, commercial and operational processes.
603
+ Perform a hybrid role of conducting risk-based financial and operational audits with a core function of data analytics and statistical analysis of existing data sets
604
+ Query and mine large data sets to discover transaction patterns, examine data and filter for targeted information
605
+ Prepare and execute data analysis and interpret the results based on requirements to support the Express audit department and the Data Analytics team
606
+ Prepare data reports and clear data visualizations for stakeholders
607
+ Develop tools, audit approach and audit programs in strong cooperation with the Express audit team to support the internal audit work
608
+ Develop automation for data analysis to increase audit efficiency
609
+ Present key insights using advanced visualization tools, and highlight the data analytics capability to the IA team globally
610
+ Present and discuss findings/data analysis results with management and create high quality audit reports and working papers in line with Internal Audit standards
611
+ Perform requested follow-up procedures as necessary
612
+ Openness to conduct audits across various segments of the company's business, including Operations (OPS), Customs, Finance, and Commercial
613
+ Build and maintain solid relationships with key stakeholders, especially in the business IT area
614
+ Manage own development in the professional area proactively
615
+ Knowledge and experience in MS Office and the following desirable applications:
616
+ Database query languages (e.g. SQL)
617
+ Power BI, Tableau, Qlik Sense, KNIME
618
+ Programming and coding (e.g. Python, R, C/C++/C#, Visual Basic)
619
+ Cloud computing, Azure, AWS, RPA
620
+ Process mining, e.g. Celonis, DISCO, Signavio
621
+ Excellent English language skills
622
+ Convincing communicator and negotiator
623
+ Analytical and conceptual abilities
624
+ Analytical problem solving and proactive in finding solutions
625
+ High personal responsibility and high degree of integrity
626
+ Ability to work under pressure in a demanding environment
627
+ Rapid comprehension
628
+ Willingness to travel up to 30% of the time (or more if business needs dictate)
629
+ Database query languages (e.g. SQL)
630
+ Power BI, Tableau, Qlik Sense, KNIME
631
+ Programming and coding (e.g. Python, R, C/C++/C#, Visual Basic)
632
+ Cloud computing, Azure, AWS, RPA
633
+ Process mining, e.g. Celonis, DISCO, Signavio
634
+ Minimum two years of experience in data analytics, preferably within the company
635
+ Minimum bachelor's degree in information technology, computer science, engineering, business administration or comparable qualification/background
636
+ First professional experience, preferably within the company
637
+ Attractive remuneration package
638
+ Be part of a globally recognised organisation
639
+ Enhance your analytical skills"
data/01_raw/data_scientist_jobstreet_scraped_v2 2.csv CHANGED
@@ -1,4 +1,4 @@
1
- ,job_id,job_title,company,descriptions,location,category,subcategory,type,salary
2
  0,72761527,Data Engineer,ANHSIN TECHNOLOGY SDN BHD,"Design, develop, and maintain scalable and robust data architecture.
3
  Create and manage databases, data processing systems, and data integration solutions.
4
  Develop and implement efficient ETL processes for data ingestion and transformation.
 
1
+ ,job_id,job_title,company,description,location,category,subcategory,type,salary
2
  0,72761527,Data Engineer,ANHSIN TECHNOLOGY SDN BHD,"Design, develop, and maintain scalable and robust data architecture.
3
  Create and manage databases, data processing systems, and data integration solutions.
4
  Develop and implement efficient ETL processes for data ingestion and transformation.
data/02_processed/73981733.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 73981733, "location": "kuala lumpur", "company": "private advertiser", "job_title": "big data engineer - cloud", "salary": "RM\u00a09,000 \u2013 RM\u00a013,000 per month", "description": "Which of the following statements best describes your right to work in Malaysia?\nWhat's your expected monthly basic salary?\n\n\nRole descriptions\n\n\nSalary insights\n\n\nTools to help you prepare for jobs\n\n"}
data/02_processed/74323206.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74323206, "location": "petaling", "company": "agensi pekerjaan lewis talent consulting sdn. bhd.", "job_title": "senior data analyst", "salary": "RM\u00a05,000 \u2013 RM\u00a07,000 per month", "description": "Lead sophisticated ETL processes to aggregate data from various sources into our data warehouse.\nPerform comprehensive data analysis to uncover trends, patterns, and insights, aiding in addressing business challenges and uncovering opportunities.\nPioneer the development of complex data visualizations, charts, and dashboards with tools like Tableau and Power BI.\nEnsure data integrity through rigorous quality control, identifying and correcting discrepancies.\nWork closely with different departments to align data insights with business objectives, advising on strategic actions.\nMentor junior data analysts, promoting a culture of continuous learning and professional growth.\nAt least possess a Bachelor\u2019s or Master\u2019s in Data Science, Mathematics, Statistics, Computer Science, or any equivalent fields.\nTechnical Proficiency in data analysis (Excel, Python, SQL) and visualization tools (Tableau, Power BI) are required.\nSignificant experience in a senior data analyst capacity, evidenced by impactful contributions.\nA solid understanding of Machine Learning, AI, and Data Warehousing principles.\nLeadership Experience: Demonstrated ability to lead and develop a team of data analysts.\nExceptional analytical skills for dissecting complex datasets and performing advanced statistical analyses to derive actionable insights.\nDemonstrated leadership with the capability to effectively collaborate with both technical and non-technical teams.\nCreative problem-solving skills, offering innovative solutions to business issues.\nExcellent communication, able to simplify complex data findings for a varied audience.\nHow many years' experience do you have using SQL queries?\nHow would you rate your Mandarin language skills?\nWhich of the following languages are you fluent in?\nWhich of the following data visualisation tools are you experienced with?\nWhich of the following data analytics tools are you experienced with?"}
data/02_processed/74323540.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74323540, "location": "petaling", "company": "agensi pekerjaan lewis talent consulting sdn. bhd.", "job_title": "data scientist", "salary": "RM\u00a06,000 \u2013 RM\u00a08,500 per month", "description": "Understanding the data information, investigating and preprocessing vast datasets to unveil significant features and insights.\nDesign and deploy predictive models employing advanced statistical and machine learning techniques to address complex business issues.\nGenerate compelling visualizations to illustrate complex analyses and insights to both technical and non-technical audiences.\nEngage closely with multidisciplinary teams to grasp business objectives and partake in data-driven projects.\nKeep abreast of the latest data science advancements and enhance the organization's data science methodologies.\nAt least possess a Master's or Ph.D. in Data Science, Statistics, Computer Science, or related fields.\nDemonstrated experience as a Data Scientist, supported by a portfolio of successful data science projects.\nExpertise in Python or R, including familiarity with relevant libraries and frameworks.\nSolid foundation and experience in applying machine learning algorithms to practical problems.\nStrong interpersonal and communication capabilities for effective teamwork with cross-functional groups.\nCommitment to staying informed about industry trends, data science techniques, and tools. Engage in ongoing training and development programs as necessary.\nExpertise in analyzing extensive, complex datasets to extract valuable insights.\nProficiency in applying statistical and machine learning methods to solve real-world challenges.\nCompetence in programming languages like Python or R for data analysis and modeling.\nStrong ability in creating visualizations with tools such as Tableau, Power BI, or similar platforms.\nCreative problem-solving aptitude for tackling complex business issues with data-driven solutions.\nExceptional ability to articulate complex findings and insights clearly.\nHow many years' experience do you have as a data scientist?\nWhich of the following programming languages are you experienced in?\nWhich of the following data analytics tools are you experienced with?\nHow would you rate your Mandarin language skills?\nWhich of the following languages are you fluent in?"}
data/02_processed/74324430.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74324430, "location": "petaling", "company": "agensi pekerjaan lewis talent consulting sdn. bhd.", "job_title": "data scientist assistant manager", "salary": "RM\u00a010,000 \u2013 RM\u00a014,000 per month", "description": "Support the oversight of data science projects to ensure their timely and precise execution, delivering advanced analytics and insights. Address project hurdles collaboratively and refine project outcomes.\nAssist the Data Scientist Manager in mentoring and directing the data science team. Offer guidance on project priorities, technical challenges, and career growth.\nEngage with your team to craft and execute predictive models leveraging sophisticated statistical and machine learning techniques to tackle complex business issues.\nAid in the exploration and preprocessing of large datasets to unearth valuable features and insights.\nCollaborate effectively with multidisciplinary teams to grasp business goals, align data science projects, and articulate insights to non-technical audiences.\nDrive the enhancement of data science methodologies and processes. Keep abreast of industry standards and evolving trends.\nHolding a minimum of Master's or Ph.D. in Data Science, Statistics, Computer Science, or any other related fields.\nDemonstrable experience as a Data Scientist, including roles in leadership or management.\nHigh proficiency in languages such as Python or R, along with familiarity with pertinent libraries and frameworks.\nA thorough grasp and application experience of machine learning algorithms in practical scenarios.\nExcellent interpersonal and communication skills for effective teamwork and cross-functional collaboration.\nA commitment to continuous learning, staying updated with data science trends, techniques, and tools, and participating in relevant training and development programs.\nProven leadership ability to mentor and inspire a team of data scientists.\nExceptional project management capabilities, with a knack for prioritizing, resource allocation, and deadline adherence.\nSkilled in dissecting large, intricate datasets to extract significant insights.\nProficiency in utilizing advanced statistical and machine learning methods to address real-world challenges.\nCompetent in programming languages like Python or R for data analysis and modeling.\nStrong communication skills, capable of explaining complex findings to a broad audience range.\nHow many years' experience do you have as an Assistant Manager?\nWhich of the following programming languages are you experienced in?\nWhich of the following data analytics tools are you experienced with?\nWhich of the following languages are you fluent in?\nHow would you rate your Mandarin language skills?"}
data/02_processed/74354364.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74354364, "location": "kuala lumpur", "company": "persolkelly workforce solutions malaysia sdn bhd", "job_title": "data analyst", "salary": "RM\u00a05,500 \u2013 RM\u00a06,500 per month", "description": "Provide quantitative analytics support to the AML Transformation & Governance team to enhance risk assessment of ongoing current and emerging AML centric risks in core areas; business, client, operations & system controls within the international business entities.\u00a0\nDevelop and enhance quantitative/qualitative risk assessment model to proactively identify risks enhancement opportunities in areas that are not clearly defined by legislative AML guidance\u00a0\nProvide data extraction & quantitative validation for targeted AML risks or ongoing AML issues identified\u00a0\nProvide other ad hoc data extraction and validation requests from time to time and assist in the maintenance of data library for all IBG entities\u00a0\nProvide periodic quantitative entities\u2019 risk assessment report as part of AML governance update to the AML IBG management\u00a0\nProvide collaboration support with the AML data team on entity level business intelligence related request from regional AML teams.\u00a0\nBachelor degree or above majoring in computer science/engineering, statistics, Mathematics, finance and similar fields; Candidate with experience in AML risk modelling and data analytics are preferred\u00a0\nAt least three years' AML/risk control/fraud management working experience in FI, PI, consulting firms or similar industries\u00a0\nProficient in data handling coding (e.g., SQL) and at least one professional analytics tool such as SAS, python etc.\u00a0\nFamiliar with at least one popular BI tool, such as Power BI, Tableau.\nStrong ability in logical thinking and problem solving\u00a0\nAble to plan & work independently on required deliverables and deadlines\u00a0\nAble to work under pressure and result-oriented\u00a0\nHighly flexible and detail-oriented with the ability to multi-task and change directions based on demand\nHow many years' experience do you have as a data analyst?\nWhich of the following languages are you fluent in?\nHow many years' experience do you have as a business intelligence (BI) analyst?"}
data/02_processed/74354434.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74354434, "location": "petaling", "company": "emart24 holdings sdn. bhd.", "job_title": "data analyst", "salary": "RM\u00a06,800 \u2013 RM\u00a07,000 per month", "description": "Develop and maintain dashboards and reports that provide insights into key performance indicators (KPIs) across all business functions.\nPartner with IT team to leverage technology infrastructure and tools to optimize data collection, storage, and analysis processes.\nConduct ad-hoc analysis to answer business questions and identify opportunities for growth and improvement.\nCollaborate with cross-functional teams to identify, analyze and recommend data driven solutions to business problems.\nDevelop and maintain data pipelines to ensure data is accurate, consistent, and available in a timely manner.\nIdentify and recommend BI tools that can help to improve data visibility and analysis across the organization.\nTrain and support end-users on the use of BI tools and dashboards.\nProvide recommendations to stakeholders on how to use data to drive growth and improve bottom-line results.\nContinuously monitor the performance of the business and make recommendations for improvement based on data analysis.\nMaintain a high level of data quality, accuracy, and consistency.\nBachelor's degree in Actuarial Science, Computer Science, Information Technology, or a related field.\nProven experience (3-5 years) in a business intelligence or data analyst role, preferably in a retail or eCommerce environment.\nExperience using BI tools such as Power BI, Tableau, Metabase, or Google Data Studio.\nProficient in SQL and/or Python, and data visualization tools.\nStrong analytical and problem-solving skills with the ability to work with complex data sets.\nAbility to work collaboratively and communicate effectively with cross-functional teams.\nKnowledge of retail or eCommerce industry and data trends.\nExcellent communication skills, both written and verbal, with the ability to communicate complex ideas in a simple and clear manner.\nAbility to prioritize and manage multiple projects simultaneously.\nProficient in both English and Bahasa Malaysia.\nWhich of the following statements best describes your right to work in Malaysia?\nWhat's your expected monthly basic salary?\nHow many years' experience do you have as a data analyst?\nWhich of the following programming languages are you experienced in?\nHow many years' experience do you have using SQL queries?\nWhich of the following Relational Database Management Systems (RDBMS) are you experienced with?\nHow would you rate your English language skills?\nWhich of the following languages are you fluent in?"}
data/02_processed/74360323.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74360323, "location": "kuala lumpur", "company": "talentbank sdn bhd", "job_title": "data scientist", "salary": "RM\u00a06,800 \u2013 RM\u00a010,200 per month", "description": "Which of the following programming languages are you experienced in?\nWhat's your expected monthly basic salary?\nWhich of the following data analytics tools are you experienced with?\nHow many years' experience do you have as a data scientist?"}
data/02_processed/74366437.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74366437, "location": "kuala lumpur", "company": "asia careers group sdn bhd", "job_title": "data analyst", "salary": "RM\u00a03,700 \u2013 RM\u00a05,500 per month", "description": "Make large and complex data more \naccessible, understandable and usable\n within Tableau\nTransform, improve and integrate\n graduate outcomes and average salary data\nCombining data result\n sets across multiple sources using Excel and or other databased platforms\nMaintain and manage existing data collection tools\n, analytics tools and reporting platforms\nMaintain existing reports and develop new report where necessary to support business needs. \nDelivering the data in a useful and appealing way to clients\nExcellent analytical skills\nAbility to work with large amounts of information and see the \u2018bigger picture\u2019\nComfortable with \njuggling facts, figures, and number crunching\nStrong communication skills\n, both written and oral and \nable to work well with other teams\nProficient in interacting with computer softwares\nCritical thinking\n: able to look at numbers, trends, and data and come to new conclusions based on the findings\nAttention to detail\nExcellent maths skills\nWorking to deadlines\nIndependent\nDrive data collection\n and input data collection with \npreprogrammed scripts\nTo\n collate and merge data sets\n to enable reporting within Tableau\nAnalyse current and past trends\n of key performance indicators, highlighting trends and analysing causes of unexpected variances\nDevelop models and analysis to \nsupport strategic initiatives\nCreate presentations that provide insightful analysis, \nidentify and propose actions for effective decisions\nCreate infographics and generate insight\n for both clients and for use in marketing and promotion\nTo \nperform ad hoc requests\n for data on a day-to-day basis\nMinimum 1 years working experience in data analytics and visualisation\nMust be proficient in Power BI, \nTableau, Excel\n. Coding skills (Python / NodeJS) and graphic designing skills (Adobe Illustrator) are an advantage but not essential\nExperience creating, integrating dashboards to present data in a meaningful and interactive manner\nGood team worker\n and able to multitask & meet tight deadlines\nResults oriented and creative\nRequired language(s): \nProficient in English\n, other foreign languages a plus.\nFull time employment\nCompany provided computer\nExposure to fun and challenging problems in an agile environment\nTransparent company culture and openness to new ideas\nTraining courses in your areas of interest\nWhich of the following languages are you fluent in?\nAre you willing to undergo a pre-employment background check?\nHow much notice are you required to give your current employer?"}
data/02_processed/74433737.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74433737, "location": "selangor", "company": "michael page international (malaysia) sdn bhd", "job_title": "data engineer", "salary": "$96k - $97k p.a.", "description": "Attractive remuneration package\nBe part of a globally recognised organisation\nEnhance your development skills"}
data/02_processed/74434092.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74434092, "location": "petaling", "company": "michael page international (malaysia) sdn bhd", "job_title": "data analyst - internal audit", "salary": "$102k - $103k p.a.", "description": "Gain understanding of the company's financial, commercial and operational processes.\nPerform a hybrid role of conducting risk-based financial and operational audits with a core function of data analytics and statistical analysis of existing data sets\nQuery and mine large data sets to discover transaction patterns, examine data and filter for targeted information\nPrepare and execute data analysis and interpret the results based on requirements to support the Express audit department and the Data Analytics team\nPrepare data reports and clear data visualizations for stakeholders\nDevelop tools, audit approach and audit programs in strong cooperation with the Express audit team to support the internal audit work\nDevelop automation for data analysis to increase audit efficiency\nPresent key insights using advanced visualization tools, and highlight the data analytics capability to the IA team globally\nPresent and discuss findings/data analysis results with management and create high quality audit reports and working papers in line with Internal Audit standards\nPerform requested follow-up procedures as necessary\nOpenness to conduct audits across various segments of the company's business, including Operations (OPS), Customs, Finance, and Commercial\nBuild and maintain solid relationships with key stakeholders, especially in the business IT area\nManage own development in the professional area proactively\nKnowledge and experience in MS Office and the following desirable applications: \nDatabase query languages (e.g. SQL)\nPower BI, Tableau, Qlik Sense, KNIME\nProgramming and coding (e.g. Python, R, C/C++/C#, Visual Basic)\nCloud computing, Azure, AWS, RPA\nProcess mining, e.g. Celonis, DISCO, Signavio\nExcellent English language skills\nConvincing communicator and negotiator\nAnalytical and conceptual abilities\nAnalytical problem solving and proactive in finding solutions\nHigh personal responsibility and high degree of integrity\nAbility to work under pressure in a demanding environment\nRapid comprehension\nWillingness to travel up to 30% of the time (or more if business needs dictate)\nDatabase query languages (e.g. SQL)\nPower BI, Tableau, Qlik Sense, KNIME\nProgramming and coding (e.g. Python, R, C/C++/C#, Visual Basic)\nCloud computing, Azure, AWS, RPA\nProcess mining, e.g. Celonis, DISCO, Signavio\nMinimum two years of experience in data analytics, preferably within the company\nMinimum bachelor's degree in information technology, computer science, engineering, business administration or comparable qualification/background\nFirst professional experience, preferably within the company\nAttractive remuneration package\nBe part of a globally recognised organisation\nEnhance your analytical skills"}
data/02_processed/74488251.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74488251, "location": "kuala lumpur", "company": "liege solutions sdn. bhd.", "job_title": "data engineer/data cloud engineer/salesforce", "salary": "RM\u00a08,000 \u2013 RM\u00a012,000 per month", "description": "Develop, build, and implement Salesforce Data Cloud instances in conjunction with Salesforces (Sales/Service, Salesforce Marketing Cloud, etc.)\nDevelop, oversee, and maintain key data model, segmentation, architecture, code documentation, and technical specifications for designated workstreams.\nLead the development and physical design of solutions, guide the development team, and ensure the system's architectural integrity.\nImplement Salesforce Data Cloud, emphasizing data modelling, performance optimization, and integrations.\nFunction as a technical solution specialist within the Salesforce Data Cloud ecosystem, including components like Data streams, Unified Data Model, Transformations/Insights, Identity Resolution, Segmentations, etc.\u00a0\nDrive end-to-end project execution, encompassing analysis, documentation, solution development, testing, and performance, to meet client expectations.\nIdentify technical design and data model risks, confirm their validity, and devise strategies to mitigate them.\nActively participate in technical design and data model workshops alongside Engineering Teams and business stakeholders.\nClearly translate and communicate technical requirements and solutions to the business using story-based narratives, and present strategies and recommendations to executives.\nEnsure that technical documentation is consistently updated through active participation in technical workshops.\nLead sessions to understand client needs in technical marketing and Salesforce Data Cloud requirements, and act as a crucial technical resource throughout the implementation process.\nAssess client requirements, propose effective solutions, and communicate the capabilities and limitations of Salesforce Data Cloud, Marketing Cloud Engage, Personalization, Intelligence, Sales, Service, and other relevant SF ecosystem platforms.\nOversee and coordinate testing activities, including System Integration Testing (SIT) and User Acceptance Testing (UAT), in line with the project's scope.\nManage workstreams in large-scale enterprise implementations, focusing on Salesforce Data Cloud architecture and data models.\nBachelor's Degree.\nExperience with at least 1 end-to-end Salesforce Data Cloud implementation.\nExperience with at least 1 Salesforce cross-cloud integrations.\n5+ years in data model and architecture in the Salesforce ecosystem.\n5+ years in consulting, project implementation for global based client, or project management using Agile methodologies.\n5+ years in Customer Data Modelling best practices.\nProficiency in moderately complex SQL queries, SOQL, Python.\nExperience in integrating with cloud-based data warehouses/data lakes (e.g. Snowflake, Databricks, etc.).\nEffective communication to both technical and non-technical audiences.\nExperience with Tableu, Data Analytic Platform (e.g. PowerBI).\nExperience with Segmentation strategy.\nExperience with Marketing Cloud Personalization.\nExperience with Adobe, Marketo, Oracle or other enterprise marketing automation or CDP platforms.\nFront-end coding skills (HTML, CSS/SASS, Typescript, React, XML, JSON).\nSalesforce certifications (Salesforce Data Cloud Consultant , and Salesforce Certifications - Marketing Cloud, Admin, etc.).\nA supportive, friendly and professional team environment;\nStability to develop your career with an established and growth focused company;\u00a0\nA range of benefits as part of our Employee Benefit Program;\nWhich of the following statements best describes your right to work in Malaysia?\nWhat's your expected monthly basic salary?\nHow many years' experience do you have as a Salesforce Developer?\nHow many years' experience do you have as a Salesforce Architect?\nHow many years' experience do you have as a Salesforce Marketing Cloud Specialist?\nHow many years' experience do you have as a Salesforce Engineer?"}
data/02_processed/74513015.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74513015, "location": "petaling", "company": "ambition group malaysia sdn bhd", "job_title": "senior data scientist (full stack)", "salary": "Attractive Remuneration Package", "description": "Utilize technical proficiency for data gathering, cleaning, and preprocessing from varied sources.\nDevelop, train, and assess machine learning models for solving business problems like predictive analytics and natural language processing.\nConduct statistical analysis, hypothesis testing, and data-driven experimentation; stay updated on emerging trends in data science and machine learning\nDegree/Master in Computer Science, Mathematics, Statistics.\nExperienced in all aspects of the data science lifecycle, including problem formulation, data collection and preprocessing, exploratory data analysis, feature engineering, model selection and training, model evaluation.\nFamiliar working with cloud-based big data platforms (Azure, GCP, AWS)\nA minimum 3 years of hands on experience with Python.\nExcellent communication skills in Bahasa Melayu and English."}
data/02_processed/74552169.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74552169, "location": "kuala lumpur", "company": "city facilities management sdn. bhd.", "job_title": "senior business intelligence developer / data engineer", "salary": "RM\u00a08,000 \u2013 RM\u00a012,000 per month", "description": "Translation of business requirements into accurate and interactive data visualizations and analytics that enable management and other stakeholders to make informed decisions.\nDrive data mining & data science initiatives\nReview reports to determine basic characteristics, such as origin and report flow, format, frequency, distribution, and purpose or function of report.\nMay prepare and issue instructions concerning generation, completion, and distribution of reports according to new or revised practices, procedures, or policies of reports management.\nExamine and evaluate purpose and content of business reports to develop new, or improve existing, format, use, and control.\nRecommend the establishment of new or modified reporting methods and procedures to improve report content and completeness of information.\nProduce associated documentation including functional specifications, testing plans and support documentation.\nUnderstand and comprehend new system changes as they affect data, reports, and reporting systems.\nConfer with team members originating, handling, processing, or receiving reports to identify problems and to gather suggestions for improvements.\nManage internal and external stakeholders.\nProvide support and coaching to application support staff and associated team members.\nProvides assistance to support existing infrastructure where appropriate..\nParticipate in application validation and QA efforts as they pertain to reporting, data, metrics, and report creation and execution.\nProvides assistance to support existing infrastructure where appropriate.\nDevelop the appropriate strategies and plans to ensure a successful implementation that conforms to the company agreed SLA\u2019s.\nA relevant advanced technical degree.\nA mastery of Business Intelligence concepts and the ability to articulate these concepts to technical and non-technical audiences.\nAt least 5+ years\u2019 experience in Business Intelligence\nDemonstrable strong data analysis and data mining skills (T-SQL, DAX and MDX).\nStrong data visualisation skills using SSRS & Power BI.\nExposure to Snowflake on Azure platform and Matillion tools is desirable.\nDemonstrable experience with Business Intelligence tools in the Microsoft BI stack (SSAS and SSIS) or Matillion.\nStrong communication skills with the ability to collaborate and contribute to a team positively.\nStrong analytical skills with the ability to translate business needs into technical solutions.\nThe ideal candidate will have experience working in agile projects.\nHigh cognitive ability to problem solving\nOwn and display City values, respect and value others and work as one team. (mandatory all positions)\nMaintain a professional and positive attitude at all times and act with integrity (lead by example).\nHigh level numerical and analytical skills\nDemonstrated attention to detail and accuracy\nMaintain a high standard of delivery on all work and meet deadlines\nDisplay high level of motivation and drive at all times.\nRemain calm and organised at all times.\nAbility to develop rapport to ensure effective working relationship with a broad range of managers and team members.\nDemonstrate initiative and confidentiality across the business.\nTreat others with respect and be conscientious and courteous.\nGood time management, planning and organising ability.\nWhat's your expected monthly basic salary?\n\n\nRole descriptions\n\n\nSalary insights\n\n\nTools to help you prepare for jobs\n\n"}
data/02_processed/74586835.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74586835, "location": "kuala lumpur", "company": "michael page international (malaysia) sdn bhd", "job_title": "data engineer", "salary": "$36k - $64k p.a. + Bonus + Other benefits", "description": "Designing and implementing Data pipelines and ETL processes from various sources into our Data Warehouse\nEnsure that data is cleaned, standardized, and properly structured to minimize data inconsistencies and gaps\nCreate and maintain data pipelines to facilitate the smooth flow of data throughout the organization\nWork closely with cross-functional teams to identify data integration needs and implement solutions that enable seamless data movement\nEstablish data quality standards and processes to monitor and maintain the accuracy and integrity of data within Big Query and other data systems\nImplement data validation techniques to identify and rectify invalid or incomplete data entries\nDesign and maintain data schemas to ensure consistency and efficiency in data storage and retrieval\nCollaborate with Tech and Business teams to refine data models and adapt to changing business needs\nDrive data-centric decision-making within the organization\nCollaborate with stakeholders to foster a data-driven culture by promoting data literacy and usage\nBachelor's degree in Computer Science, Engineering or a related field\nProven experience as a Data Engineer or similar role, with a track record of implementing successful data solutions\nProficiency in ETL (Extract, Transform, Load) processes and data integration techniques\nStrong knowledge of database systems and data warehousing concepts\nExperience with Google Cloud Platform (GCP) services"}
data/02_processed/74601731.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74601731, "location": "kuala lumpur", "company": "air energy consulting (malaysia) sdn bhd", "job_title": "data analyst internship", "salary": "RM\u00a0900 \u2013 RM\u00a01,000 per month", "description": "Review and update on Bullhorn data for Asia placements.\nWork with the Service Team Australia on updating the Bullhorn data for data accuracy.\nWork closely with the Quality Coordinator on the accuracy and completeness of the data (Client\u2019s PO/WO and Contractors\u2019 employment packages) within the Airswift systems.\nWork with the relevant parties to ensure that any errors are corrected promptly.\nAdministrator and investigate client, assignee and consultant surveys.\nData analysis \u2013 accuracy report that was generated for the entire Asia Pacific Region.\nConsolidate data and dashboards for distribution to SVPs and Global Operations as required.\nManage the Business Continuity plan and Office Safety for the super region and specific country.\nWork with other members of the Regional Commercial Team, ensuring that all requests for review, questions and queries are addressed, and escalating to the relevant people when required.\nThe entire process flow of how a manpower/workforce solution provider organisation functions (Sales/BD -> Recruitment -> Onboarding -> Payroll -> Billing).\nDiploma or Degree in Business Administration or Management or equivalent.\nKnowledge of relevant regulatory requirements in the region.\nExcellent technical skills including all Microsoft Office products and understanding of statistics.\nAbility to problem solve with high attention to detail.\nExcellent written and verbal communication and interpersonal skills.\nMust have a sense of urgency in completing tasks.\nClient service orientated with good teamwork skills.\nOwnership and willingness to accept responsibility.\nVirtual Onboarding Events.\nTeam-driven environment fostering a supportive culture with a strong emphasis on work-life balance.\nCheers! where your peers can nominate you for outstanding performance.\nCoaching and Training are provided.\nGrowth\n - In life and business, one must grow to flourish and achieve high ambitions. Growth requires change, challenge, risk and sacrifice - we will always choose growth.\nSafety\n - We create a safe and inclusive environment to protect the physical and psychological well-being of everyone we work with.\nExcellence\n - We deliver, holding ourselves accountable for results. Our customers see excellence in everything we do.\nIntegrity\n - We are ethical, open, honest and authentic. People trust us to do the right thing for the right reason.\nWhich of the following statements best describes your right to work in Malaysia?"}
data/02_processed/74602585.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74602585, "location": "kulim district", "company": "persolkelly workforce solutions malaysia sdn bhd", "job_title": "data analyst/scientist", "salary": "RM\u00a03,500 \u2013 RM\u00a05,000 per month", "description": "What's your expected monthly basic salary?\nHow many years' experience do you have as a data analyst?\nHow many years' experience do you have using SQL queries?"}
data/02_processed/74636692.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74636692, "location": "kuala selangor district", "company": "agensi pekerjaan talent recruit sdn. bhd.", "job_title": "data analyst", "salary": "RM\u00a04,000 \u2013 RM\u00a05,000 per month", "description": "Analyze and interpret key indicators in e-commerce\nProvide valuable insights and identify trends based on the sales data and suggest recommendations and key action plans\u00a0\nProvide competitor analysis\u00a0\nEvaluation of the effectiveness of sales and marketing tools and mechanics\nCreating and developing tools for analysis and information processing\u00a0\nPresenting the results of reports and graphical modeling of data\u00a0\nWork closely with the e-Commerce sales team to increase sales and revenue\u00a0\nDevelop and maintain data pipelines to ensure data accuracy, integrity, and availability\nDevelop and maintain e-Commerce dashboards as per client\u2019s requests\nProviding insight to the e-Commerce sales team to optimize marketing expenses\nData Review: reviewing and validating data from various sources (e.g. Lazada, Shopee, etc.) to ensure data quality and accuracy\nDiploma or Bachelor\u2019s degree in Data Science, Statistics, Computer Science, or a related field\u00a0\nMin \n1 - 2 years of experience as a Data Analyst/Business Analyst/Category Management preferably in E-Commerce or FMCG industry\nAdvanced Excel and data mining skills\u00a0\nProficient in data visualization tools (e.g. Power BI) and database query languages (e.g. SQL)\u00a0\nKnowledge of Cloud Computing is an added advantage (e.g. Azure)\u00a0\nFamiliarity with Azure Data Factory and Azure Synapse Pipelines is an added advantage\u00a0\nAbility to work independently and take the lead in projects\u00a0\nStrong presentation skills\nPermanent employment\nWorking hours : Mon - Fri (9am \u2013 6pm) *\nSaturday half day only when required by Management during certain peak events (very occasional)\nWorking location : Puncak Alam\nWhat's your expected monthly basic salary?"}
data/02_processed/74653378.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74653378, "location": "kuala selangor district", "company": "agensi pekerjaan js staffing services sdn bhd", "job_title": "data analyst (e-commerce)", "salary": "RM\u00a04,000 \u2013 RM\u00a04,500 per month", "description": "Analyze and interpret key indicators in e-commerce such as sales performance, SKU sales, traffic, conversion rates, buying behavior, pricing, etc.\u00a0\nProvide valuable insights and identify trends based on the sales data and suggest recommendations and key action plans\u00a0\nProvide competitor analysis\u00a0\nEvaluation of the effectiveness of sales and marketing tools and mechanics\u00a0\nCreating and developing tools for analysis and information processing\u00a0\nPresenting the results of reports and graphical modeling of data\u00a0\nWork closely with the E-Commerce sales team to increase sales and revenue\u00a0\nDevelop and maintain data pipelines to ensure data accuracy, integrity, and availability.\u00a0\nDevelop and maintain E-Commerce dashboards as per client\u2019s requests.\u00a0\nProviding insight to the E-Commerce sales team to optimize marketing expenses.\u00a0\nData Review: reviewing and validating data from various sources (e.g. Lazada, Shopee, etc.) to ensure data quality and accuracy.\u00a0\nCandidates must possess at least a Bachelor\u2019s degree in Data Science, Statistics, Computer Science, or a related field\u00a0\nMin. 1-2 years of experience as a Data Analyst/Business Analyst/Category Management preferably in E-Commerce or FMCG role\u00a0\nAdvanced Excel and data mining skills\u00a0\nProficient in data visualization tools (e.g. Power BI) and database query languages (e.g. SQL)\u00a0\nKnowledge of Cloud Computing is an added advantage (e.g. Azure)\u00a0\nFamiliarity with Azure Data Factory and Azure Synapse Pipelines is an added advantage\u00a0\nAbility to work independently and take the lead in projects\u00a0\nStrong presentation skills\u00a0\nWhich of the following statements best describes your right to work in Malaysia?\nWhat's your expected monthly basic salary?"}
data/02_processed/74703677.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74703677, "location": "bangsar south", "company": "private advertiser", "job_title": "junior data analyst", "salary": "RM\u00a03,500 \u2013 RM\u00a04,300 per month", "description": "Analyze and manage data related to E-sports events, focusing on Key Performance Indicators (KPIs).\nCreate daily reports highlighting discrepancies or errors identified during the analysis process.\nChart and conduct in-depth analyses of team performance in E-sports events.\nReal-time monitoring and analysis of data, including the production of charts and performance evaluations.\nMaintain daily event reports with important metrics for overall performance.\nCollaborate with team members, ensuring smooth operations and providing support.\nFluency in Mandarin preferred for client interaction.\nStrong analytical and critical thinking skills.\nAbility to work independently under pressure and manage multiple projects with tight schedules.\nFlexibility for shifts and overnight work.\nPassion for E-sports is a plus.\nFresh graduates are welcome to apply.\n13 months\u2019 Salary\nQuarterly and annual bonuses\nMonthly meal, shift, and parking allowances\nMonthly team gatherings & birthday celebrations\nUnlimited snacks & beverages\nTeam building activities & outings\nMedical insurance\n14 days of annual leave\nBirthday leave\n8 hours of work per day\nEPF, SOCSO, EIS contributions\nWhich of the following languages are you fluent in?"}
data/02_processed/74727704.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74727704, "location": "kuala lumpur", "company": "private advertiser", "job_title": "machine learning engineer (artificial intelligence)", "salary": "RM\u00a04,000 \u2013 RM\u00a06,000 per month", "description": "Design, develop, and deploy machine learning models and algorithms for complex and unique datasets, using various techniques such as mathematical modeling, scikit-learn, NLP, CNN, RNN, DL, RL, Transformers, GAN, LLM, RAG\nCollaborate with cross-functional teams to extract insights, identify business opportunities and provide data-driven recommendations\nStay up-to-date with the latest machine learning and AI techniques and tools\nCommunicate complex technical concepts to non-technical stakeholders in an easy-to-understand manner\nBachelor's degree or higher in Computer Science, Mathematics, Statistics, Actuarial Science, Informatics, Information Science or related fields\nStrong analytical skills and attention to detail\nParticipation in Kaggle, Mathematics Olympiad or similar competitions is a plus\nExcellent programming skills in Python, R, Java, or C++\nFamiliar with ML frameworks such as Tensorflow, Keras, PyTorch, MLFlow, AutoML, TensorRT, CUDA\nExcellent communication and collaboration skills\nExperience with designing, training, and deploying machine learning models\nCustomer centric and committed to deliver the best AI results to customers\nWhat's your expected monthly basic salary?\n\n\nRole descriptions\n\n\nSalary insights\n\n\nTools to help you prepare for jobs\n\n"}
data/02_processed/74789886.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74789886, "location": "petaling", "company": "oriental fortune trading malaysia sdn bhd", "job_title": "junior data analyst", "salary": "RM\u00a03,000 \u2013 RM\u00a04,300 per month", "description": "Data Collection and Cleaning:\n Collect and compile data from various sources, ensuring data accuracy and completeness. Perform data cleaning and transformation as necessary to prepare it for analysis.\nData Analysis:\n Assist in conducting data analysis using statistical methods, data visualization tools, and data analysis software. Summarize findings in clear and concise reports.\nData Visualization:\n Create visually appealing and informative charts, graphs, and dashboards to present data insights to stakeholders. Use tools such as Tableau, Power BI, or Excel for visualization.\nReport Generation:\n Prepare regular and ad-hoc reports to communicate key metrics and findings to the team and management. Ensure reports are accurate and delivered on time.\nData Quality Assurance:\n Help maintain data quality standards by identifying and resolving data discrepancies or inconsistencies. Collaborate with other team members to establish data quality protocols.\nCollaboration: \nWork closely with cross-functional teams to understand data requirements and deliver actionable insights.\nResearch and Learning:\n Stay up-to-date with industry trends and best practices in data analysis. Continuously expand your knowledge and skills in data analytics techniques and tools.\nBachelor's degree in a relevant field such as Finance,\u00a0Data Science, Statistics, Mathematics, Computer Science, or a related discipline.\nCandidate with 1-2 years quantitative analysis or finance background are preferred; \nfresh graduates are welcome as well.\nRequired language(s): \nFluent in English & Mandarin\n are preferred because the role requires candidates to interact with Mandarin-speaking clients.\nStrong analytical skills and a passion for working with data.\nProficiency in data analysis tools and programming languages such as Excel, SQL, Python, or R.\nFamiliarity with data visualization tools (e.g., Tableau, Power BI) is a plus.\nExcellent communication skills, with the ability to translate complex data into understandable insights.\nDetail-oriented and capable of maintaining data accuracy and integrity.\nHighly organized and able to manage multiple tasks and deadlines effectively.\nTeam player with a collaborative mindset and the ability to work well in a diverse team environment.\nEagerness to learn and adapt to new tools, techniques, and technologies in the field of data analysis.\nPossess own car.\nWhat's your expected monthly basic salary?\nHow many years' experience do you have as a data analyst?\nWhich of the following languages are you fluent in?"}
data/02_processed/74850440.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74850440, "location": "kuala lumpur", "company": "phenomenal uam sdn bhd", "job_title": "data scientist", "salary": "RM\u00a05,000 \u2013 RM\u00a07,500 per month", "description": "Utilizing machine learning techniques to analyze and interpret complex data sets.\nDeveloping AI models and algorithms to solve specific business problems and improve operational efficiency.\nCollaborating with engineers and software developers to integrate AI solutions into our products and services.\nConducting exploratory data analysis to identify trends, patterns, and insights.\nDesigning and implementing experiments to validate model performance and accuracy.\nCommunicating findings and insights to stakeholders through visualizations, presentations, and reports.\nStaying abreast of the latest advancements in AI and machine learning technologies, and evaluating their potential applications within our organization.\nBachelor's degree in Computer Science, Data Science, Statistics, Mathematics, or a related field. Advanced degree (Master's or Ph.D.) preferred.\nProven experience (X years) working as a Data Scientist or AI Engineer, with a strong focus on developing and implementing machine learning algorithms and AI models.\nProficiency in programming languages such as Python, R, or Java, along with experience using libraries and frameworks such as TensorFlow, PyTorch, or scikit-learn.\nSolid understanding of statistical analysis, hypothesis testing, and experimental design.\nExperience with data visualization tools such as Matplotlib, Seaborn, or Tableau.\nStrong problem-solving skills and the ability to translate business requirements into analytical solutions.\nExcellent communication and collaboration skills, with the ability to effectively convey technical concepts to both technical and non-technical stakeholders.\nFamiliarity with cloud computing platforms (e.g., AWS, Azure, Google Cloud) and big data technologies (e.g., Hadoop, Spark) is a plus.\nPassion for innovation and a keen interest in staying updated with the latest trends and advancements in AI and machine learning."}
data/02_processed/74866336.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74866336, "location": "petaling", "company": "elken sdn bhd", "job_title": "manager - data engineer", "salary": "RM\u00a08,000 \u2013 RM\u00a012,000 per month", "description": "Provide database operation support including monitoring, performance tuning, backup and restoration, disaster recovery, troubleshooting, replication setup and capacity management.\nResponsible for installing, configuration, upgrading and maintaining multiple environments including on premise and cloud databases.\nProvide recommendation for future direction of database technology and infrastructure setup.\nDeploy database objects and data patches when required by development team.\nAssist developers with complex query tuning and best practice suggestion.\nDatabase environment setup for Development, UAT and Production.\nDatabase migration for various platform.\nBuild automation script for database tasks.\nCollaborate with Data Analysts, and other stakeholders to understand data requirements and provide data solutions to meet their needs.\nBachelor\u2019s Degree in Computer Science or equivalent in related field.\nAt least 3 years working experience in related field.\nPossess professional database certifications such as Microsoft Certified, Oracle Certified will be added advantage.\nExperience with RDBMS Administration such as MS SQL, My SQL MariaDB, PostgresSQL and Oracle.\nExperience with AWS RDS Cloud database environment administration.\nExperience with Windows Server and Linux environments.\nExperience with Powershell scripting, Linux command line.\nAble to converse in\u00a0\nMandarin\n\u00a0as needed to support oversea team like Hong Kong, China and Taiwan.\nWhat's your expected monthly basic salary?\nWhich of the following Relational Database Management Systems (RDBMS) are you experienced with?"}
data/02_processed/74879920.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74879920, "location": "sri hartamas", "company": "mont royale college", "job_title": "data analyst (intern)", "salary": "RM\u00a0600 per month", "description": "Collect, organize, and analyze data from multiple sources to identify trends, patterns, and insights.\nDevelop and maintain databases and data systems necessary for data analysis and reporting.\nGenerate regular reports and dashboards to communicate key findings and metrics to stakeholders.\nCollaborate with departments to understand their data needs and provide analytical support to inform strategic initiatives.\nConduct thorough data quality assessments and implement solutions to address any discrepancies or issues.\nStay updated on industry trends and best practices in data analysis and visualization techniques.\nBachelor's degree in Computer Science, Statistics, Mathematics, Economics, or related field. (Master's degree preferred)\nProven experience working as a data analyst or in a similar role.\nProficiency in data analysis tools and software such as Excel, SQL, Python, R, or Tableau.\nStrong analytical and problem-solving skills with the ability to translate complex data into actionable insights.\nExcellent communication and presentation skills, with the ability to effectively convey technical information to non-technical stakeholders.\nDetail-oriented with a commitment to accuracy and data integrity.\nAbility to work independently and collaboratively in a fast-paced environment.\nWhat's your expected monthly basic salary?"}
data/02_processed/74891040.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74891040, "location": "petaling", "company": "agensi pekerjaan hays (malaysia) sdn bhd", "job_title": "principal data engineer", "salary": "$10000.0 - $14000.0 p.m.", "description": "Take ownership of the end-to-end data solution development and operation, including modelling, ETL, scripting, visualisation, testing, deployment, etc.\nDrive maturity of data infrastructure and architecture by designing, implementing, and maintaining the ingestion, ETL, storage, consumption, archival and disposal process and structure for product and organisation.\nBuild data products, including APIs, visualisation tools, analytics and machine learning platforms.\nUtilise ETL to provide actionable insights and intelligence, serving both product offerings and internal data consumption for business functions.\nCoach and mentor the junior team members to develop and improve both technical and non-technical capabilities.\nAt least 3 years of experience in a Data Engineering role.\nStrong command of SQL, Python, Data Modelling, Data Warehousing, ETL/ELT.\nStrong foundation in cloud data services such as AWS Redshift, Azure Synapse, BigQuery, Snowflakes, etc.\nA competitive salary based on performance and output.\nExcellent career support and leadership training.\nExposure to the growing fintech industry and room to grow at an elevated rate."}
data/02_processed/74915330.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74915330, "location": "kuala lumpur", "company": "inter island manpower pte ltd", "job_title": "data engineer (python)", "salary": "RM\u00a06,000 \u2013 RM\u00a07,000 per month", "description": "Bachelor's or Master's degree in Computer Science, Engineering, or related fields.\n1-3 years as a Data Engineer with expertise in data engineering concepts.\nHands-on experience with Azure or AWS cloud services, Python/Java/Scala, and data processing frameworks.\nStrong knowledge of data modeling, database technologies, and problem-solving skills.\nExcellent communication, collaboration, and adaptability to fast-paced environments.\nDesign, develop, and maintain data pipelines for diverse sources into data lake/warehouse.\nCollaborate with data scientists for analysis and reporting needs.\nResearch and evaluate data sources for property development.\nDevelop web-scraping scripts and automate pipeline deployment.\nUtilize Azure, AWS, and relevant tech for data solutions.\nIntegrate data between systems/platforms, monitor, and optimize pipelines.\nEnsure data security, compliance, and quality while staying updated on data tech trends.\nHow many years' experience do you have as a Data Engineer?\nWhich of the following programming languages are you experienced in?"}
data/02_processed/74923865.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74923865, "location": "petaling", "company": "monkeycloud media sdn. bhd.", "job_title": "data analyst", "salary": "RM\u00a03,000 \u2013 RM\u00a04,500 per month", "description": "Collect and compile data from different sources, including databases, and spreadsheets.\nFiltering and cleaning data to ensure accuracy.\nAnalyze data to identify trends, patterns, and anomalies, and provide actionable insights to support in various team decision-making.\nDevelop and maintain dashboards and reports to track key project performance metrics, including progress and target achievement.\nConduct ad-hoc data analysis to address specific project management questions or issues.\nAssist in the preparation of presentations and reports for project status updates and executive-level reporting.\nDiploma or Degree in any related field\nExperience in data analysis, business analysis or similar role, with a strong focus on data manipulation and analysis is preferred.\nFamiliarity with data analysis tools and concepts, with experience in Power BI being a strong advantage.\n2-3 years working experience as a Business Intelligence Analyst/Data Analyst or any position that related to data and business analytics with strong statistical background.\nPreferably with prior experience in the FMCG industry.\nExperience in data analysis, business analysis or similar role, with a strong focus on data manipulation and analysis is preferred.\nProficiency in data analysis tools and software, such as Power BI, Microsoft Excel (Pivot, Macro) or similar tools.\nStrong organizational skills and the ability to manage multiple tasks and priorities simultaneously.\nA collaborative and team-oriented mindset with the ability to work effectively in a cross-functional environment.\nExcellent communication and presentation skills, with the ability to convey complex information clearly and effectively.\nStrong analytical and problem-solving skills, with the ability to translate data into actionable insights.\nMonday to Friday\n : 8.30a.m to 5.30p.m\u00a0\nHow many years' experience do you have as a data analyst?\nHow many years' experience do you have as a Power BI Consultant?\nHow would you rate your Mandarin language skills?\nWhich of the following data analytics tools are you experienced with?\nWhich of the following Relational Database Management Systems (RDBMS) are you experienced with?"}
data/02_processed/74946043.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74946043, "location": "bandar tasik selatan", "company": "kk group of companies", "job_title": "data engineer lead", "salary": "RM\u00a07,000 \u2013 RM\u00a08,000 per month", "description": "Making large and/or complex data more accessible, understandable and usable\nTransforming, improving and integrating data, depending on the business requirements\nDelivering the data in a useful and appealing way to users\nWork with noisy, dirty, and unstructured data, data cleansing, scraping unstructured data, and converting into structured data\nBuild and maintain data pipelines, ETL, databases and applications to acquire, store and manage data\nChoose, design, test and implement data software and solutions for projects and enterprise\nSelecting and integrating any Big Data tools and frameworks required to provide requested capabilities\nMonitoring performance and advising necessary infrastructure changes to optimize performance and cost\nArchitect, develop and implement end-to-end automated data flows for data and analytical needs\nPerform data profiling, data quality assessment, data cleansing and data transformation\nRecommend appropriate solutions architecture that is highly scalable based on business requirement\nCreate date tools for data scientists and data visualization team members\nBuilds the infrastructure required for optimal extraction, transformation, and loading for data from a wide variety of data sources using SQL, noSQL and cloud (AWS / Azure / Google / Alibaba / DigitalOcean) technologies\nBenchmark, evaluate and improve the scalability, robustness, efficiency and performance of big data platforms and applications\nEngage in the delivery of analytics assignments for business users involving, amongst others; data preparation, data visualization, creating business insights etc\nContribute and facilitate workshops/interviews bridging the gap between data analytics and business outcomes\nProvide strategic and tactical advice to business units by translating analytical solutions into out-of-the-box execution recommendations addressing business objectives and impacting bottom line results\nCombine your technical expertise with the ability to architect, deploy and oversee integrated analytics/business intelligence/technology solutions\nBuild case for analytics through need analysis and business case (quantify likely benefits)\nConsult with leaders of business teams and manage expectations, meet project and deliverable timelines, ensure accurate and timely completion of deliverables\nCollaborate with several other teams and roles across the organization on requirements gathering and solution deployment\nMake analytical product, service or process decisions likely to impact multiple groups of employees and/or customers (internal or external)\nManage technology solution portfolios, and identifying new technology approaches to solve business problems with a strong focus on leveraging enterprise data\nContribute to the building and management of the analytics infrastructure\nWork with management team on staffing needs, strategic technology trend identification, monitoring project progress and completion, issue resolution and technical solutions related to data, data flow and advising on algorithm deployment\nCollaborate with analytics practitioners, business analysts and methodologists to design and oversee the development of working prototypes of technical solutions\nTechnical support and lead for the Data Engineer team\nWork with project teams to assist with data-related technical issues and support their data infrastructure needs\nWhat's your expected monthly basic salary?\nHow many years' experience do you have as a Data Engineer?\nHow many years' experience do you have as a Tableau Specialist?\nHow many years' experience do you have as a Power BI Developer?\nHave you completed an Amazon Web Services Developer certification?\nHave you completed a Cloudera Certified Professional Data Engineer certification?\nHow many years' experience do you have as an ETL Data Warehouse Programmer?\nHow much notice are you required to give your current employer?"}
data/02_processed/74976890.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"id": 74976890, "location": "kuala lumpur", "company": "alipay malaysia sdn. bhd.", "job_title": "risk operation data analyst ( online payment)", "salary": "RM\u00a010,000 \u2013 RM\u00a015,000 per month", "description": "What's your expected monthly basic salary?\nWhich of the following languages are you fluent in?\nHow many years' experience do you have as an Operational Risk Analyst?"}
data/02_processed/metadata/job_vacancy_metadata.json CHANGED
@@ -1 +1 @@
1
- {"data engineer": ["data analysts/ data engineers (bangsar south- kerinchi lrt)", "data engineer"], "data scientist": [], "data analyst": ["data analysts/ data engineers (bangsar south- kerinchi lrt)", "junior data analyst", "data analyst executive", "e-commerce data analyst", "data analyst"], "machine learning engineer": ["machine learning engineer (artificial intelligence)", "machine learning engineer (ai)"]}
 
1
+ {"data engineer": ["senior business intelligence developer / data engineer", "data engineer (python)", "big data engineer - cloud", "principal data engineer", "data engineer lead", "data engineer", "data engineer/data cloud engineer/salesforce", "manager - data engineer", "data analysts/ data engineers (bangsar south- kerinchi lrt)"], "data scientist": ["senior data scientist (full stack)", "data scientist", "data scientist assistant manager"], "data analyst": ["e-commerce data analyst", "risk operation data analyst ( online payment)", "data analyst/scientist", "data analyst (e-commerce)", "senior data analyst", "data analyst (intern)", "junior data analyst", "data analyst", "data analyst - internal audit", "data analyst internship", "data analyst executive", "data analysts/ data engineers (bangsar south- kerinchi lrt)"], "machine learning engineer": ["machine learning engineer (artificial intelligence)", "machine learning engineer (ai)"]}
notebooks/ETL_for_Kaggle_dataset.ipynb CHANGED
@@ -1,5 +1,12 @@
1
  {
2
  "cells": [
 
 
 
 
 
 
 
3
  {
4
  "cell_type": "code",
5
  "execution_count": 1,
@@ -46,7 +53,6 @@
46
  "conf_file = \".env\"\n",
47
  "\n",
48
  "data_dir = os.path.join(\"data\", \"01_raw\")\n",
49
- "data_file = \"data_scientist_jobstreet_scraped_v2 2.csv\"\n",
50
  "\n",
51
  "res_dir = os.path.join(\"data\", \"02_processed\")\n",
52
  "\n",
@@ -75,37 +81,70 @@
75
  "metadata": {},
76
  "outputs": [],
77
  "source": [
78
- "data_path = os.path.join(data_dir, data_file)\n",
79
- "\n",
80
- "data_df = pd.read_csv(data_path, sep=\",\")"
81
  ]
82
  },
83
  {
84
  "cell_type": "code",
85
  "execution_count": 6,
86
  "metadata": {},
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
87
  "outputs": [
88
  {
89
  "name": "stdout",
90
  "output_type": "stream",
91
  "text": [
92
  "<class 'pandas.core.frame.DataFrame'>\n",
93
- "RangeIndex: 606 entries, 0 to 605\n",
94
- "Data columns (total 10 columns):\n",
95
- " # Column Non-Null Count Dtype \n",
96
- "--- ------ -------------- ----- \n",
97
- " 0 Unnamed: 0 606 non-null int64 \n",
98
- " 1 job_id 606 non-null int64 \n",
99
- " 2 job_title 606 non-null object\n",
100
- " 3 company 606 non-null object\n",
101
- " 4 descriptions 588 non-null object\n",
102
- " 5 location 606 non-null object\n",
103
- " 6 category 606 non-null object\n",
104
- " 7 subcategory 606 non-null object\n",
105
- " 8 type 606 non-null object\n",
106
- " 9 salary 198 non-null object\n",
107
- "dtypes: int64(2), object(8)\n",
108
- "memory usage: 47.5+ KB\n"
109
  ]
110
  }
111
  ],
@@ -115,7 +154,7 @@
115
  },
116
  {
117
  "cell_type": "code",
118
- "execution_count": 7,
119
  "metadata": {},
120
  "outputs": [
121
  {
@@ -139,130 +178,92 @@
139
  " <thead>\n",
140
  " <tr style=\"text-align: right;\">\n",
141
  " <th></th>\n",
142
- " <th>Unnamed: 0</th>\n",
143
  " <th>job_id</th>\n",
144
  " <th>job_title</th>\n",
145
  " <th>company</th>\n",
146
- " <th>descriptions</th>\n",
147
  " <th>location</th>\n",
148
- " <th>category</th>\n",
149
- " <th>subcategory</th>\n",
150
- " <th>type</th>\n",
151
  " <th>salary</th>\n",
 
152
  " </tr>\n",
153
  " </thead>\n",
154
  " <tbody>\n",
155
  " <tr>\n",
156
- " <th>381</th>\n",
157
- " <td>381</td>\n",
158
- " <td>72235414</td>\n",
159
- " <td>Data Scientist (Credit)</td>\n",
160
- " <td>Michael Page International (Malaysia) Sdn Bhd</td>\n",
161
- " <td>Lead requirement gathering discussions for use...</td>\n",
162
  " <td>Kuala Lumpur</td>\n",
163
- " <td>Banking &amp; Financial Services</td>\n",
164
- " <td>Analysis &amp; Reporting</td>\n",
165
- " <td>Full time</td>\n",
166
- " <td>NaN</td>\n",
167
  " </tr>\n",
168
  " <tr>\n",
169
- " <th>469</th>\n",
170
- " <td>469</td>\n",
171
- " <td>72700478</td>\n",
172
- " <td>RPA Manager (UiPath)</td>\n",
173
- " <td>Ambition Group Malaysia Sdn Bhd</td>\n",
174
- " <td>Develop and execute a comprehensive UiPath aut...</td>\n",
175
  " <td>Kuala Lumpur</td>\n",
176
- " <td>Information &amp; Communication Technology</td>\n",
177
- " <td>Team Leaders</td>\n",
178
- " <td>Full time</td>\n",
179
- " <td>RM 7,000 – RM 10,000 per month</td>\n",
180
  " </tr>\n",
181
  " <tr>\n",
182
- " <th>167</th>\n",
183
- " <td>167</td>\n",
184
- " <td>72376611</td>\n",
185
- " <td>Head of Advanced Data Analytics</td>\n",
186
- " <td>Agensi Pekerjaan Hays (Malaysia) Sdn Bhd</td>\n",
187
- " <td>Being the point-of-contact for advanced analyt...</td>\n",
188
  " <td>Kuala Lumpur</td>\n",
189
- " <td>Science &amp; Technology</td>\n",
190
- " <td>Mathematics, Statistics &amp; Information Sciences</td>\n",
191
- " <td>Full time</td>\n",
192
- " <td>RM 20,000 – RM 30,000 per month</td>\n",
193
  " </tr>\n",
194
  " <tr>\n",
195
- " <th>314</th>\n",
196
- " <td>314</td>\n",
197
- " <td>72750107</td>\n",
198
- " <td>IT Business Analyst</td>\n",
199
- " <td>Mewah Group</td>\n",
200
- " <td>To develop new software applications and custo...</td>\n",
201
- " <td>Klang District</td>\n",
202
- " <td>Information &amp; Communication Technology</td>\n",
203
- " <td>Business/Systems Analysts</td>\n",
204
- " <td>Full time</td>\n",
205
  " <td>NaN</td>\n",
 
206
  " </tr>\n",
207
  " <tr>\n",
208
- " <th>5</th>\n",
209
- " <td>5</td>\n",
210
- " <td>72762762</td>\n",
211
- " <td>Senior Engineer / Engineer - Data Management &amp;...</td>\n",
212
- " <td>B. Braun Medical Industries Sdn. Bhd.</td>\n",
213
- " <td>Build, maintain and utilize a variety of datab...</td>\n",
214
- " <td>Penang</td>\n",
215
- " <td>Engineering</td>\n",
216
- " <td>Other</td>\n",
217
- " <td>Full time</td>\n",
218
  " <td>NaN</td>\n",
 
219
  " </tr>\n",
220
  " </tbody>\n",
221
  "</table>\n",
222
  "</div>"
223
  ],
224
  "text/plain": [
225
- " Unnamed: 0 job_id job_title \\\n",
226
- "381 381 72235414 Data Scientist (Credit) \n",
227
- "469 469 72700478 RPA Manager (UiPath) \n",
228
- "167 167 72376611 Head of Advanced Data Analytics \n",
229
- "314 314 72750107 IT Business Analyst \n",
230
- "5 5 72762762 Senior Engineer / Engineer - Data Management &... \n",
231
- "\n",
232
- " company \\\n",
233
- "381 Michael Page International (Malaysia) Sdn Bhd \n",
234
- "469 Ambition Group Malaysia Sdn Bhd \n",
235
- "167 Agensi Pekerjaan Hays (Malaysia) Sdn Bhd \n",
236
- "314 Mewah Group \n",
237
- "5 B. Braun Medical Industries Sdn. Bhd. \n",
238
  "\n",
239
- " descriptions location \\\n",
240
- "381 Lead requirement gathering discussions for use... Kuala Lumpur \n",
241
- "469 Develop and execute a comprehensive UiPath aut... Kuala Lumpur \n",
242
- "167 Being the point-of-contact for advanced analyt... Kuala Lumpur \n",
243
- "314 To develop new software applications and custo... Klang District \n",
244
- "5 Build, maintain and utilize a variety of datab... Penang \n",
245
  "\n",
246
- " category \\\n",
247
- "381 Banking & Financial Services \n",
248
- "469 Information & Communication Technology \n",
249
- "167 Science & Technology \n",
250
- "314 Information & Communication Technology \n",
251
- "5 Engineering \n",
252
  "\n",
253
- " subcategory type \\\n",
254
- "381 Analysis & Reporting Full time \n",
255
- "469 Team Leaders Full time \n",
256
- "167 Mathematics, Statistics & Information Sciences Full time \n",
257
- "314 Business/Systems Analysts Full time \n",
258
- "5 Other Full time \n",
259
- "\n",
260
- " salary \n",
261
- "381 NaN \n",
262
- "469 RM 7,000 – RM 10,000 per month \n",
263
- "167 RM 20,000 – RM 30,000 per month \n",
264
- "314 NaN \n",
265
- "5 NaN "
266
  ]
267
  },
268
  "metadata": {},
@@ -275,25 +276,25 @@
275
  },
276
  {
277
  "cell_type": "code",
278
- "execution_count": 8,
279
  "metadata": {},
280
  "outputs": [],
281
  "source": [
282
- "data_clean_df = data_df[['job_id', 'location', 'company', 'job_title', 'salary', 'descriptions']].dropna().drop_duplicates().copy()"
283
  ]
284
  },
285
  {
286
  "cell_type": "code",
287
- "execution_count": 9,
288
  "metadata": {},
289
  "outputs": [
290
  {
291
  "data": {
292
  "text/plain": [
293
- "32"
294
  ]
295
  },
296
- "execution_count": 9,
297
  "metadata": {},
298
  "output_type": "execute_result"
299
  }
@@ -304,7 +305,7 @@
304
  },
305
  {
306
  "cell_type": "code",
307
- "execution_count": 10,
308
  "metadata": {},
309
  "outputs": [
310
  {
@@ -334,14 +335,29 @@
334
  " </thead>\n",
335
  " <tbody>\n",
336
  " <tr>\n",
337
- " <th>4</th>\n",
338
  " <td>Data Analyst</td>\n",
339
- " <td>5</td>\n",
340
  " </tr>\n",
341
  " <tr>\n",
342
- " <th>7</th>\n",
343
  " <td>Data Engineer</td>\n",
344
- " <td>4</td>\n",
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
345
  " </tr>\n",
346
  " <tr>\n",
347
  " <th>0</th>\n",
@@ -349,112 +365,182 @@
349
  " <td>1</td>\n",
350
  " </tr>\n",
351
  " <tr>\n",
352
- " <th>14</th>\n",
 
 
 
 
 
353
  " <td>Intern - Data Science</td>\n",
354
  " <td>1</td>\n",
355
  " </tr>\n",
356
  " <tr>\n",
357
- " <th>23</th>\n",
358
- " <td>System Analyst</td>\n",
359
  " <td>1</td>\n",
360
  " </tr>\n",
361
  " <tr>\n",
362
- " <th>22</th>\n",
363
- " <td>Scientist (Bioinformatics)</td>\n",
364
  " <td>1</td>\n",
365
  " </tr>\n",
366
  " <tr>\n",
367
- " <th>21</th>\n",
368
- " <td>RPA Manager (UiPath)</td>\n",
369
  " <td>1</td>\n",
370
  " </tr>\n",
371
  " <tr>\n",
372
- " <th>20</th>\n",
 
 
 
 
 
 
 
 
 
 
373
  " <td>Principal Engineer, Automation Project</td>\n",
374
  " <td>1</td>\n",
375
  " </tr>\n",
376
  " <tr>\n",
377
- " <th>19</th>\n",
378
- " <td>MicroStrategy Developers (MNC)</td>\n",
379
  " <td>1</td>\n",
380
  " </tr>\n",
381
  " <tr>\n",
382
- " <th>18</th>\n",
383
- " <td>Manager, RPA Developer</td>\n",
384
  " <td>1</td>\n",
385
  " </tr>\n",
386
  " <tr>\n",
387
- " <th>17</th>\n",
388
- " <td>Machine Learning Engineer (Artificial Intellig...</td>\n",
389
  " <td>1</td>\n",
390
  " </tr>\n",
391
  " <tr>\n",
392
- " <th>16</th>\n",
393
- " <td>Machine Learning Engineer (AI)</td>\n",
394
  " <td>1</td>\n",
395
  " </tr>\n",
396
  " <tr>\n",
397
- " <th>15</th>\n",
398
- " <td>Junior Data Analyst</td>\n",
399
  " <td>1</td>\n",
400
  " </tr>\n",
401
  " <tr>\n",
402
- " <th>12</th>\n",
403
- " <td>Head of Advanced Data Analytics</td>\n",
404
  " <td>1</td>\n",
405
  " </tr>\n",
406
  " <tr>\n",
407
- " <th>13</th>\n",
 
 
 
 
 
408
  " <td>IT Business Analyst</td>\n",
409
  " <td>1</td>\n",
410
  " </tr>\n",
411
  " <tr>\n",
412
- " <th>1</th>\n",
413
- " <td>BUSINESS SYSTEM ANALYST</td>\n",
414
  " <td>1</td>\n",
415
  " </tr>\n",
416
  " <tr>\n",
417
- " <th>11</th>\n",
418
  " <td>ERP System Analyst</td>\n",
419
  " <td>1</td>\n",
420
  " </tr>\n",
421
  " <tr>\n",
422
  " <th>10</th>\n",
423
- " <td>E-commerce Data Analyst</td>\n",
424
  " <td>1</td>\n",
425
  " </tr>\n",
426
  " <tr>\n",
427
- " <th>9</th>\n",
428
- " <td>Database Administrator (DBA)</td>\n",
429
  " <td>1</td>\n",
430
  " </tr>\n",
431
  " <tr>\n",
432
- " <th>8</th>\n",
433
- " <td>Database Administrator</td>\n",
 
 
 
 
 
434
  " <td>1</td>\n",
435
  " </tr>\n",
436
  " <tr>\n",
437
  " <th>6</th>\n",
438
- " <td>Data Analysts/ Data Engineers (Bangsar South- ...</td>\n",
439
  " <td>1</td>\n",
440
  " </tr>\n",
441
  " <tr>\n",
442
- " <th>5</th>\n",
 
 
 
 
 
 
 
 
 
 
443
  " <td>Data Analyst Executive</td>\n",
444
  " <td>1</td>\n",
445
  " </tr>\n",
446
  " <tr>\n",
447
- " <th>3</th>\n",
448
- " <td>DATA ANALYST</td>\n",
449
  " <td>1</td>\n",
450
  " </tr>\n",
451
  " <tr>\n",
452
- " <th>2</th>\n",
453
- " <td>Contract Business Administration Analyst</td>\n",
454
  " <td>1</td>\n",
455
  " </tr>\n",
456
  " <tr>\n",
457
- " <th>24</th>\n",
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
458
  " <td>System Analyst - IT</td>\n",
459
  " <td>1</td>\n",
460
  " </tr>\n",
@@ -464,34 +550,51 @@
464
  ],
465
  "text/plain": [
466
  " job_title job_id\n",
467
- "4 Data Analyst 5\n",
468
- "7 Data Engineer 4\n",
 
 
 
469
  "0 BI Developer - Immediate Hiring 1\n",
470
- "14 Intern - Data Science 1\n",
471
- "23 System Analyst 1\n",
472
- "22 Scientist (Bioinformatics) 1\n",
473
- "21 RPA Manager (UiPath) 1\n",
474
- "20 Principal Engineer, Automation Project 1\n",
475
- "19 MicroStrategy Developers (MNC) 1\n",
476
- "18 Manager, RPA Developer 1\n",
477
- "17 Machine Learning Engineer (Artificial Intellig... 1\n",
478
- "16 Machine Learning Engineer (AI) 1\n",
479
- "15 Junior Data Analyst 1\n",
480
- "12 Head of Advanced Data Analytics 1\n",
481
- "13 IT Business Analyst 1\n",
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
482
  "1 BUSINESS SYSTEM ANALYST 1\n",
483
- "11 ERP System Analyst 1\n",
484
- "10 E-commerce Data Analyst 1\n",
485
- "9 Database Administrator (DBA) 1\n",
486
- "8 Database Administrator 1\n",
487
- "6 Data Analysts/ Data Engineers (Bangsar South- ... 1\n",
488
- "5 Data Analyst Executive 1\n",
489
- "3 DATA ANALYST 1\n",
490
- "2 Contract Business Administration Analyst 1\n",
491
- "24 System Analyst - IT 1"
492
  ]
493
  },
494
- "execution_count": 10,
495
  "metadata": {},
496
  "output_type": "execute_result"
497
  }
@@ -502,7 +605,7 @@
502
  },
503
  {
504
  "cell_type": "code",
505
- "execution_count": 11,
506
  "metadata": {},
507
  "outputs": [],
508
  "source": [
@@ -514,7 +617,7 @@
514
  " \"company\": row['company'].lower(),\n",
515
  " \"job_title\": row['job_title'].lower(),\n",
516
  " \"salary\": row['salary'],\n",
517
- " \"description\": row['descriptions'],\n",
518
  " }\n",
519
  "\n",
520
  " res_file = \"{job_id}.json\".format(job_id=row['job_id'])\n",
@@ -526,25 +629,41 @@
526
  },
527
  {
528
  "cell_type": "code",
529
- "execution_count": 12,
530
  "metadata": {},
531
  "outputs": [
532
  {
533
  "data": {
534
  "text/plain": [
535
- "{'data engineer': ['data analysts/ data engineers (bangsar south- kerinchi lrt)',\n",
536
- " 'data engineer'],\n",
537
- " 'data scientist': [],\n",
538
- " 'data analyst': ['data analysts/ data engineers (bangsar south- kerinchi lrt)',\n",
 
 
 
 
 
 
 
 
 
 
 
 
 
 
539
  " 'junior data analyst',\n",
 
 
 
540
  " 'data analyst executive',\n",
541
- " 'e-commerce data analyst',\n",
542
- " 'data analyst'],\n",
543
  " 'machine learning engineer': ['machine learning engineer (artificial intelligence)',\n",
544
  " 'machine learning engineer (ai)']}"
545
  ]
546
  },
547
- "execution_count": 12,
548
  "metadata": {},
549
  "output_type": "execute_result"
550
  }
@@ -564,7 +683,7 @@
564
  },
565
  {
566
  "cell_type": "code",
567
- "execution_count": 14,
568
  "metadata": {},
569
  "outputs": [],
570
  "source": [
 
1
  {
2
  "cells": [
3
+ {
4
+ "cell_type": "markdown",
5
+ "metadata": {},
6
+ "source": [
7
+ "The main purpose of this notebook is to parse the file with the scraped jobs into a json file per job."
8
+ ]
9
+ },
10
  {
11
  "cell_type": "code",
12
  "execution_count": 1,
 
53
  "conf_file = \".env\"\n",
54
  "\n",
55
  "data_dir = os.path.join(\"data\", \"01_raw\")\n",
 
56
  "\n",
57
  "res_dir = os.path.join(\"data\", \"02_processed\")\n",
58
  "\n",
 
81
  "metadata": {},
82
  "outputs": [],
83
  "source": [
84
+ "# Get list of file names in ingestion folder\n",
85
+ "file_names = [file for file in os.listdir(data_dir) if \".csv\" in file]"
 
86
  ]
87
  },
88
  {
89
  "cell_type": "code",
90
  "execution_count": 6,
91
  "metadata": {},
92
+ "outputs": [
93
+ {
94
+ "data": {
95
+ "text/plain": [
96
+ "['data_scientist_jobstreet_scraped_v2 2.csv',\n",
97
+ " 'data_jobs_scraped_from_jobstreet_2024-04-07.csv']"
98
+ ]
99
+ },
100
+ "execution_count": 6,
101
+ "metadata": {},
102
+ "output_type": "execute_result"
103
+ }
104
+ ],
105
+ "source": [
106
+ "file_names"
107
+ ]
108
+ },
109
+ {
110
+ "cell_type": "code",
111
+ "execution_count": 16,
112
+ "metadata": {},
113
+ "outputs": [],
114
+ "source": [
115
+ "columns = ['job_id', 'job_title', 'company', 'location', 'salary', 'description']\n",
116
+ "\n",
117
+ "for file_name in file_names:\n",
118
+ " data_path = os.path.join(data_dir, file_name)\n",
119
+ " if 'data_df' not in locals():\n",
120
+ " data_df = pd.read_csv(data_path, sep=\",\")[columns]\n",
121
+ " else:\n",
122
+ " data_file = pd.read_csv(data_path, sep=\",\")[columns]\n",
123
+ " data_df = pd.concat([data_df, data_file])"
124
+ ]
125
+ },
126
+ {
127
+ "cell_type": "code",
128
+ "execution_count": 8,
129
+ "metadata": {},
130
  "outputs": [
131
  {
132
  "name": "stdout",
133
  "output_type": "stream",
134
  "text": [
135
  "<class 'pandas.core.frame.DataFrame'>\n",
136
+ "Index: 635 entries, 0 to 28\n",
137
+ "Data columns (total 6 columns):\n",
138
+ " # Column Non-Null Count Dtype \n",
139
+ "--- ------ -------------- ----- \n",
140
+ " 0 job_id 635 non-null int64 \n",
141
+ " 1 job_title 635 non-null object\n",
142
+ " 2 company 635 non-null object\n",
143
+ " 3 location 635 non-null object\n",
144
+ " 4 salary 227 non-null object\n",
145
+ " 5 description 617 non-null object\n",
146
+ "dtypes: int64(1), object(5)\n",
147
+ "memory usage: 34.7+ KB\n"
 
 
 
 
148
  ]
149
  }
150
  ],
 
154
  },
155
  {
156
  "cell_type": "code",
157
+ "execution_count": 9,
158
  "metadata": {},
159
  "outputs": [
160
  {
 
178
  " <thead>\n",
179
  " <tr style=\"text-align: right;\">\n",
180
  " <th></th>\n",
 
181
  " <th>job_id</th>\n",
182
  " <th>job_title</th>\n",
183
  " <th>company</th>\n",
 
184
  " <th>location</th>\n",
 
 
 
185
  " <th>salary</th>\n",
186
+ " <th>description</th>\n",
187
  " </tr>\n",
188
  " </thead>\n",
189
  " <tbody>\n",
190
  " <tr>\n",
191
+ " <th>167</th>\n",
192
+ " <td>72376611</td>\n",
193
+ " <td>Head of Advanced Data Analytics</td>\n",
194
+ " <td>Agensi Pekerjaan Hays (Malaysia) Sdn Bhd</td>\n",
 
 
195
  " <td>Kuala Lumpur</td>\n",
196
+ " <td>RM 20,000 RM 30,000 per month</td>\n",
197
+ " <td>Being the point-of-contact for advanced analyt...</td>\n",
 
 
198
  " </tr>\n",
199
  " <tr>\n",
200
+ " <th>101</th>\n",
201
+ " <td>72761527</td>\n",
202
+ " <td>Data Engineer</td>\n",
203
+ " <td>ANHSIN TECHNOLOGY SDN BHD</td>\n",
 
 
204
  " <td>Kuala Lumpur</td>\n",
205
+ " <td>NaN</td>\n",
206
+ " <td>Design, develop, and maintain scalable and rob...</td>\n",
 
 
207
  " </tr>\n",
208
  " <tr>\n",
209
+ " <th>376</th>\n",
210
+ " <td>72531370</td>\n",
211
+ " <td>ERP Analyst / Application Analyst</td>\n",
212
+ " <td>Amazon Papyrus (M) Sdn Bhd</td>\n",
 
 
213
  " <td>Kuala Lumpur</td>\n",
214
+ " <td>NaN</td>\n",
215
+ " <td>Based in Malaysia, Work from home arrangement\\...</td>\n",
 
 
216
  " </tr>\n",
217
  " <tr>\n",
218
+ " <th>430</th>\n",
219
+ " <td>72683703</td>\n",
220
+ " <td>Data Engineer</td>\n",
221
+ " <td>DATABRICKS TECH SDN. BHD.</td>\n",
222
+ " <td>Bukit Jalil</td>\n",
 
 
 
 
 
223
  " <td>NaN</td>\n",
224
+ " <td>Maintain, backup, secure and transform data fr...</td>\n",
225
  " </tr>\n",
226
  " <tr>\n",
227
+ " <th>161</th>\n",
228
+ " <td>72403257</td>\n",
229
+ " <td>Data Engineer</td>\n",
230
+ " <td>KINESSO MALAYSIA SDN. BHD. (fka Mediabrands Gl...</td>\n",
231
+ " <td>Petaling</td>\n",
 
 
 
 
 
232
  " <td>NaN</td>\n",
233
+ " <td>Perform daily, weekly and monthly monitoring o...</td>\n",
234
  " </tr>\n",
235
  " </tbody>\n",
236
  "</table>\n",
237
  "</div>"
238
  ],
239
  "text/plain": [
240
+ " job_id job_title \\\n",
241
+ "167 72376611 Head of Advanced Data Analytics \n",
242
+ "101 72761527 Data Engineer \n",
243
+ "376 72531370 ERP Analyst / Application Analyst \n",
244
+ "430 72683703 Data Engineer \n",
245
+ "161 72403257 Data Engineer \n",
 
 
 
 
 
 
 
246
  "\n",
247
+ " company location \\\n",
248
+ "167 Agensi Pekerjaan Hays (Malaysia) Sdn Bhd Kuala Lumpur \n",
249
+ "101 ANHSIN TECHNOLOGY SDN BHD Kuala Lumpur \n",
250
+ "376 Amazon Papyrus (M) Sdn Bhd Kuala Lumpur \n",
251
+ "430 DATABRICKS TECH SDN. BHD. Bukit Jalil \n",
252
+ "161 KINESSO MALAYSIA SDN. BHD. (fka Mediabrands Gl... Petaling \n",
253
  "\n",
254
+ " salary \\\n",
255
+ "167 RM 20,000 RM 30,000 per month \n",
256
+ "101 NaN \n",
257
+ "376 NaN \n",
258
+ "430 NaN \n",
259
+ "161 NaN \n",
260
  "\n",
261
+ " description \n",
262
+ "167 Being the point-of-contact for advanced analyt... \n",
263
+ "101 Design, develop, and maintain scalable and rob... \n",
264
+ "376 Based in Malaysia, Work from home arrangement\\... \n",
265
+ "430 Maintain, backup, secure and transform data fr... \n",
266
+ "161 Perform daily, weekly and monthly monitoring o... "
 
 
 
 
 
 
 
267
  ]
268
  },
269
  "metadata": {},
 
276
  },
277
  {
278
  "cell_type": "code",
279
+ "execution_count": 10,
280
  "metadata": {},
281
  "outputs": [],
282
  "source": [
283
+ "data_clean_df = data_df[['job_id', 'location', 'company', 'job_title', 'salary', 'description']].dropna().drop_duplicates().copy()"
284
  ]
285
  },
286
  {
287
  "cell_type": "code",
288
+ "execution_count": 11,
289
  "metadata": {},
290
  "outputs": [
291
  {
292
  "data": {
293
  "text/plain": [
294
+ "61"
295
  ]
296
  },
297
+ "execution_count": 11,
298
  "metadata": {},
299
  "output_type": "execute_result"
300
  }
 
305
  },
306
  {
307
  "cell_type": "code",
308
+ "execution_count": 12,
309
  "metadata": {},
310
  "outputs": [
311
  {
 
335
  " </thead>\n",
336
  " <tbody>\n",
337
  " <tr>\n",
338
+ " <th>5</th>\n",
339
  " <td>Data Analyst</td>\n",
340
+ " <td>10</td>\n",
341
  " </tr>\n",
342
  " <tr>\n",
343
+ " <th>13</th>\n",
344
  " <td>Data Engineer</td>\n",
345
+ " <td>6</td>\n",
346
+ " </tr>\n",
347
+ " <tr>\n",
348
+ " <th>26</th>\n",
349
+ " <td>Junior Data Analyst</td>\n",
350
+ " <td>3</td>\n",
351
+ " </tr>\n",
352
+ " <tr>\n",
353
+ " <th>17</th>\n",
354
+ " <td>Data Scientist</td>\n",
355
+ " <td>3</td>\n",
356
+ " </tr>\n",
357
+ " <tr>\n",
358
+ " <th>28</th>\n",
359
+ " <td>Machine Learning Engineer (Artificial Intellig...</td>\n",
360
+ " <td>2</td>\n",
361
  " </tr>\n",
362
  " <tr>\n",
363
  " <th>0</th>\n",
 
365
  " <td>1</td>\n",
366
  " </tr>\n",
367
  " <tr>\n",
368
+ " <th>31</th>\n",
369
+ " <td>MicroStrategy Developers (MNC)</td>\n",
370
+ " <td>1</td>\n",
371
+ " </tr>\n",
372
+ " <tr>\n",
373
+ " <th>25</th>\n",
374
  " <td>Intern - Data Science</td>\n",
375
  " <td>1</td>\n",
376
  " </tr>\n",
377
  " <tr>\n",
378
+ " <th>27</th>\n",
379
+ " <td>Machine Learning Engineer (AI)</td>\n",
380
  " <td>1</td>\n",
381
  " </tr>\n",
382
  " <tr>\n",
383
+ " <th>29</th>\n",
384
+ " <td>Manager - Data Engineer</td>\n",
385
  " <td>1</td>\n",
386
  " </tr>\n",
387
  " <tr>\n",
388
+ " <th>30</th>\n",
389
+ " <td>Manager, RPA Developer</td>\n",
390
  " <td>1</td>\n",
391
  " </tr>\n",
392
  " <tr>\n",
393
+ " <th>32</th>\n",
394
+ " <td>Principal Data Engineer</td>\n",
395
+ " <td>1</td>\n",
396
+ " </tr>\n",
397
+ " <tr>\n",
398
+ " <th>23</th>\n",
399
+ " <td>Head of Advanced Data Analytics</td>\n",
400
+ " <td>1</td>\n",
401
+ " </tr>\n",
402
+ " <tr>\n",
403
+ " <th>33</th>\n",
404
  " <td>Principal Engineer, Automation Project</td>\n",
405
  " <td>1</td>\n",
406
  " </tr>\n",
407
  " <tr>\n",
408
+ " <th>34</th>\n",
409
+ " <td>RPA Manager (UiPath)</td>\n",
410
  " <td>1</td>\n",
411
  " </tr>\n",
412
  " <tr>\n",
413
+ " <th>35</th>\n",
414
+ " <td>Risk Operation Data Analyst ( Online Payment)</td>\n",
415
  " <td>1</td>\n",
416
  " </tr>\n",
417
  " <tr>\n",
418
+ " <th>36</th>\n",
419
+ " <td>Scientist (Bioinformatics)</td>\n",
420
  " <td>1</td>\n",
421
  " </tr>\n",
422
  " <tr>\n",
423
+ " <th>37</th>\n",
424
+ " <td>Senior Business Intelligence Developer / Data ...</td>\n",
425
  " <td>1</td>\n",
426
  " </tr>\n",
427
  " <tr>\n",
428
+ " <th>38</th>\n",
429
+ " <td>Senior Data Analyst</td>\n",
430
  " <td>1</td>\n",
431
  " </tr>\n",
432
  " <tr>\n",
433
+ " <th>39</th>\n",
434
+ " <td>Senior Data Scientist (Full Stack)</td>\n",
435
  " <td>1</td>\n",
436
  " </tr>\n",
437
  " <tr>\n",
438
+ " <th>40</th>\n",
439
+ " <td>System Analyst</td>\n",
440
+ " <td>1</td>\n",
441
+ " </tr>\n",
442
+ " <tr>\n",
443
+ " <th>24</th>\n",
444
  " <td>IT Business Analyst</td>\n",
445
  " <td>1</td>\n",
446
  " </tr>\n",
447
  " <tr>\n",
448
+ " <th>21</th>\n",
449
+ " <td>E-commerce Data Analyst</td>\n",
450
  " <td>1</td>\n",
451
  " </tr>\n",
452
  " <tr>\n",
453
+ " <th>22</th>\n",
454
  " <td>ERP System Analyst</td>\n",
455
  " <td>1</td>\n",
456
  " </tr>\n",
457
  " <tr>\n",
458
  " <th>10</th>\n",
459
+ " <td>Data Analyst Internship</td>\n",
460
  " <td>1</td>\n",
461
  " </tr>\n",
462
  " <tr>\n",
463
+ " <th>2</th>\n",
464
+ " <td>Big Data Engineer - Cloud</td>\n",
465
  " <td>1</td>\n",
466
  " </tr>\n",
467
  " <tr>\n",
468
+ " <th>3</th>\n",
469
+ " <td>Contract Business Administration Analyst</td>\n",
470
+ " <td>1</td>\n",
471
+ " </tr>\n",
472
+ " <tr>\n",
473
+ " <th>4</th>\n",
474
+ " <td>DATA ANALYST</td>\n",
475
  " <td>1</td>\n",
476
  " </tr>\n",
477
  " <tr>\n",
478
  " <th>6</th>\n",
479
+ " <td>Data Analyst (E-Commerce)</td>\n",
480
  " <td>1</td>\n",
481
  " </tr>\n",
482
  " <tr>\n",
483
+ " <th>7</th>\n",
484
+ " <td>Data Analyst (Intern)</td>\n",
485
+ " <td>1</td>\n",
486
+ " </tr>\n",
487
+ " <tr>\n",
488
+ " <th>8</th>\n",
489
+ " <td>Data Analyst - Internal Audit</td>\n",
490
+ " <td>1</td>\n",
491
+ " </tr>\n",
492
+ " <tr>\n",
493
+ " <th>9</th>\n",
494
  " <td>Data Analyst Executive</td>\n",
495
  " <td>1</td>\n",
496
  " </tr>\n",
497
  " <tr>\n",
498
+ " <th>11</th>\n",
499
+ " <td>Data Analyst/Scientist</td>\n",
500
  " <td>1</td>\n",
501
  " </tr>\n",
502
  " <tr>\n",
503
+ " <th>1</th>\n",
504
+ " <td>BUSINESS SYSTEM ANALYST</td>\n",
505
  " <td>1</td>\n",
506
  " </tr>\n",
507
  " <tr>\n",
508
+ " <th>12</th>\n",
509
+ " <td>Data Analysts/ Data Engineers (Bangsar South- ...</td>\n",
510
+ " <td>1</td>\n",
511
+ " </tr>\n",
512
+ " <tr>\n",
513
+ " <th>14</th>\n",
514
+ " <td>Data Engineer (Python)</td>\n",
515
+ " <td>1</td>\n",
516
+ " </tr>\n",
517
+ " <tr>\n",
518
+ " <th>15</th>\n",
519
+ " <td>Data Engineer Lead</td>\n",
520
+ " <td>1</td>\n",
521
+ " </tr>\n",
522
+ " <tr>\n",
523
+ " <th>16</th>\n",
524
+ " <td>Data Engineer/Data Cloud Engineer/Salesforce</td>\n",
525
+ " <td>1</td>\n",
526
+ " </tr>\n",
527
+ " <tr>\n",
528
+ " <th>18</th>\n",
529
+ " <td>Data Scientist Assistant Manager</td>\n",
530
+ " <td>1</td>\n",
531
+ " </tr>\n",
532
+ " <tr>\n",
533
+ " <th>19</th>\n",
534
+ " <td>Database Administrator</td>\n",
535
+ " <td>1</td>\n",
536
+ " </tr>\n",
537
+ " <tr>\n",
538
+ " <th>20</th>\n",
539
+ " <td>Database Administrator (DBA)</td>\n",
540
+ " <td>1</td>\n",
541
+ " </tr>\n",
542
+ " <tr>\n",
543
+ " <th>41</th>\n",
544
  " <td>System Analyst - IT</td>\n",
545
  " <td>1</td>\n",
546
  " </tr>\n",
 
550
  ],
551
  "text/plain": [
552
  " job_title job_id\n",
553
+ "5 Data Analyst 10\n",
554
+ "13 Data Engineer 6\n",
555
+ "26 Junior Data Analyst 3\n",
556
+ "17 Data Scientist 3\n",
557
+ "28 Machine Learning Engineer (Artificial Intellig... 2\n",
558
  "0 BI Developer - Immediate Hiring 1\n",
559
+ "31 MicroStrategy Developers (MNC) 1\n",
560
+ "25 Intern - Data Science 1\n",
561
+ "27 Machine Learning Engineer (AI) 1\n",
562
+ "29 Manager - Data Engineer 1\n",
563
+ "30 Manager, RPA Developer 1\n",
564
+ "32 Principal Data Engineer 1\n",
565
+ "23 Head of Advanced Data Analytics 1\n",
566
+ "33 Principal Engineer, Automation Project 1\n",
567
+ "34 RPA Manager (UiPath) 1\n",
568
+ "35 Risk Operation Data Analyst ( Online Payment) 1\n",
569
+ "36 Scientist (Bioinformatics) 1\n",
570
+ "37 Senior Business Intelligence Developer / Data ... 1\n",
571
+ "38 Senior Data Analyst 1\n",
572
+ "39 Senior Data Scientist (Full Stack) 1\n",
573
+ "40 System Analyst 1\n",
574
+ "24 IT Business Analyst 1\n",
575
+ "21 E-commerce Data Analyst 1\n",
576
+ "22 ERP System Analyst 1\n",
577
+ "10 Data Analyst Internship 1\n",
578
+ "2 Big Data Engineer - Cloud 1\n",
579
+ "3 Contract Business Administration Analyst 1\n",
580
+ "4 DATA ANALYST 1\n",
581
+ "6 Data Analyst (E-Commerce) 1\n",
582
+ "7 Data Analyst (Intern) 1\n",
583
+ "8 Data Analyst - Internal Audit 1\n",
584
+ "9 Data Analyst Executive 1\n",
585
+ "11 Data Analyst/Scientist 1\n",
586
  "1 BUSINESS SYSTEM ANALYST 1\n",
587
+ "12 Data Analysts/ Data Engineers (Bangsar South- ... 1\n",
588
+ "14 Data Engineer (Python) 1\n",
589
+ "15 Data Engineer Lead 1\n",
590
+ "16 Data Engineer/Data Cloud Engineer/Salesforce 1\n",
591
+ "18 Data Scientist Assistant Manager 1\n",
592
+ "19 Database Administrator 1\n",
593
+ "20 Database Administrator (DBA) 1\n",
594
+ "41 System Analyst - IT 1"
 
595
  ]
596
  },
597
+ "execution_count": 12,
598
  "metadata": {},
599
  "output_type": "execute_result"
600
  }
 
605
  },
606
  {
607
  "cell_type": "code",
608
+ "execution_count": 13,
609
  "metadata": {},
610
  "outputs": [],
611
  "source": [
 
617
  " \"company\": row['company'].lower(),\n",
618
  " \"job_title\": row['job_title'].lower(),\n",
619
  " \"salary\": row['salary'],\n",
620
+ " \"description\": row['description'],\n",
621
  " }\n",
622
  "\n",
623
  " res_file = \"{job_id}.json\".format(job_id=row['job_id'])\n",
 
629
  },
630
  {
631
  "cell_type": "code",
632
+ "execution_count": 14,
633
  "metadata": {},
634
  "outputs": [
635
  {
636
  "data": {
637
  "text/plain": [
638
+ "{'data engineer': ['senior business intelligence developer / data engineer',\n",
639
+ " 'data engineer (python)',\n",
640
+ " 'big data engineer - cloud',\n",
641
+ " 'principal data engineer',\n",
642
+ " 'data engineer lead',\n",
643
+ " 'data engineer',\n",
644
+ " 'data engineer/data cloud engineer/salesforce',\n",
645
+ " 'manager - data engineer',\n",
646
+ " 'data analysts/ data engineers (bangsar south- kerinchi lrt)'],\n",
647
+ " 'data scientist': ['senior data scientist (full stack)',\n",
648
+ " 'data scientist',\n",
649
+ " 'data scientist assistant manager'],\n",
650
+ " 'data analyst': ['e-commerce data analyst',\n",
651
+ " 'risk operation data analyst ( online payment)',\n",
652
+ " 'data analyst/scientist',\n",
653
+ " 'data analyst (e-commerce)',\n",
654
+ " 'senior data analyst',\n",
655
+ " 'data analyst (intern)',\n",
656
  " 'junior data analyst',\n",
657
+ " 'data analyst',\n",
658
+ " 'data analyst - internal audit',\n",
659
+ " 'data analyst internship',\n",
660
  " 'data analyst executive',\n",
661
+ " 'data analysts/ data engineers (bangsar south- kerinchi lrt)'],\n",
 
662
  " 'machine learning engineer': ['machine learning engineer (artificial intelligence)',\n",
663
  " 'machine learning engineer (ai)']}"
664
  ]
665
  },
666
+ "execution_count": 14,
667
  "metadata": {},
668
  "output_type": "execute_result"
669
  }
 
683
  },
684
  {
685
  "cell_type": "code",
686
+ "execution_count": 15,
687
  "metadata": {},
688
  "outputs": [],
689
  "source": [
notebooks/jobstreet-job-scraping.ipynb ADDED
@@ -0,0 +1 @@
 
 
1
+ {"cells":[{"cell_type":"markdown","metadata":{},"source":["The main goal of this notebook is to scrape Data Scientist job vacancy from [Jobstreet Malaysia](https://www.jobstreet.com.my/)(a job searching website).\n","\n","We'll be using the combination API and manual HTML scraping technique from the Jobstreet for our webscraping."]},{"cell_type":"code","execution_count":1,"metadata":{"execution":{"iopub.execute_input":"2024-04-07T09:21:59.667260Z","iopub.status.busy":"2024-04-07T09:21:59.666761Z","iopub.status.idle":"2024-04-07T09:22:00.422635Z","shell.execute_reply":"2024-04-07T09:22:00.421244Z","shell.execute_reply.started":"2024-04-07T09:21:59.667217Z"},"tags":[],"trusted":true},"outputs":[],"source":["import os\n","import requests\n","from bs4 import BeautifulSoup\n","from resume_worth.utils.utils import get_params\n","import pandas as pd\n","pd.set_option('display.max_colwidth', 200)"]},{"cell_type":"code","execution_count":2,"metadata":{},"outputs":[{"data":{"text/plain":["'/Users/luisrodrigues/Documents/Projects/PERSONAL/resume-worth'"]},"execution_count":2,"metadata":{},"output_type":"execute_result"}],"source":["# Change the current working directory to the pachage root\n","# That's step is due to the way settings.py is defined\n","ROOT_DIR = os.path.join(*os.path.split(os.getcwd())[:-1])\n","os.chdir(ROOT_DIR)\n","os.getcwd()"]},{"cell_type":"code","execution_count":3,"metadata":{},"outputs":[],"source":["max_pages = 15\n","# api url taken from Network -> Header \n","api_url = 'https://www.jobstreet.com.my/api/chalice-search/v4/search?siteKey=MY-Main&sourcesystem=houston&userqueryid=2b00edd417ec163434fca9421e24c97a-7428735&userid=e56c7e89-1d1a-42b9-b7a5-37c12653d6b9&usersessionid=e56c7e89-1d1a-42b9-b7a5-37c12653d6b9&eventCaptureSessionId=e56c7e89-1d1a-42b9-b7a5-37c12653d6b9&seekSelectAllPages=true&keywords=data+scientist&pageSize=99&include=seodata&locale=en-MY&solId=568d2fe8-e8ef-4998-8e24-3e1ccfb1348b'\n","\n","data_dir = os.path.join(\"data\", \"01_raw\")\n","ingestion_date = pd.to_datetime('today').strftime('%Y-%m-%d')\n","data_file = f\"data_jobs_scraped_from_jobstreet_{ingestion_date}.csv\""]},{"cell_type":"markdown","metadata":{},"source":["## Scrape available job articles"]},{"cell_type":"code","execution_count":4,"metadata":{"execution":{"iopub.execute_input":"2024-04-07T09:22:00.425276Z","iopub.status.busy":"2024-04-07T09:22:00.424497Z","iopub.status.idle":"2024-04-07T09:22:01.980509Z","shell.execute_reply":"2024-04-07T09:22:01.979252Z","shell.execute_reply.started":"2024-04-07T09:22:00.425240Z"},"tags":[],"trusted":true},"outputs":[],"source":["def scrape_article_ids(api_url, max_pages):\n"," job_id =[]\n"," titles = []\n"," companies = []\n"," locations = []\n"," categorys= []\n"," subCategorys= []\n"," job_types=[]\n"," salarys=[]\n"," for page_number in range(1, max_pages + 1):\n"," page_url = f'{api_url}&page={page_number}'\n"," \n"," # Send an HTTP request to the API endpoint\n"," response = requests.get(page_url)\n"," if response.status_code == 200:\n"," # Parse the JSON response\n"," data = response.json()\n","\n"," # Extract advertiser IDs from each item in the 'data' list\n"," for item in data['data']:\n"," #print(item)\n"," jid = item['id']\n"," title = item['title']\n"," company = item['advertiser'].get('description', '')\n"," location = item.get('location', '')\n"," category = item['classification'].get('description', '')\n"," subCategory= item['subClassification'].get('description', '')\n"," job_type = item.get('workType', '')\n"," salary = item.get('salary', '')\n","\n"," job_id.append(jid)\n"," titles.append(title)\n"," companies.append(company)\n"," locations.append(location)\n"," categorys.append(category)\n"," subCategorys.append(subCategory)\n"," job_types.append(job_type)\n"," salarys.append(salary)\n"," #print(f\"Job ID: {job_id}\")\n","\n"," else:\n"," print(f\"Failed to retrieve data from the API. Status Code: {response.status_code}\")\n"," break\n","\n"," return job_id, titles, companies, locations, categorys, subCategorys,job_types,salarys"]},{"cell_type":"code","execution_count":5,"metadata":{},"outputs":[],"source":["job_ids, titles, companies, locations, categorys, sub_categorys, job_types, salarys = scrape_article_ids(api_url, max_pages)\n","\n","jobs = []\n","for idx, job_id in enumerate(job_ids):\n"," job_info = {\n"," 'job_id': job_ids[idx], \n"," 'job_title': titles[idx], \n"," 'company': companies[idx], \n"," 'location': locations[idx], \n"," 'category': categorys[idx], \n"," 'sub_category': sub_categorys[idx], \n"," 'job_type': job_types[idx], \n"," 'salary': salarys[idx]\n"," }\n"," jobs.append(job_info)"]},{"cell_type":"code","execution_count":6,"metadata":{"execution":{"iopub.execute_input":"2024-04-07T09:24:27.229473Z","iopub.status.busy":"2024-04-07T09:24:27.228315Z","iopub.status.idle":"2024-04-07T09:24:27.246251Z","shell.execute_reply":"2024-04-07T09:24:27.244966Z","shell.execute_reply.started":"2024-04-07T09:24:27.229409Z"},"trusted":true},"outputs":[{"name":"stdout","output_type":"stream","text":["Number of retrieved jobs: 1515\n"]}],"source":["print(f\"Number of retrieved jobs: {len(jobs)}\")"]},{"cell_type":"markdown","metadata":{},"source":["# Filter retrieved jobs by job title and salary availability"]},{"cell_type":"code","execution_count":7,"metadata":{},"outputs":[],"source":["params = get_params()\n","target_job_titles = params['job_titles']"]},{"cell_type":"code","execution_count":8,"metadata":{},"outputs":[{"data":{"text/plain":["['Data Engineer',\n"," 'Data Scientist',\n"," 'Data Analyst',\n"," 'Machine Learning Engineer']"]},"execution_count":8,"metadata":{},"output_type":"execute_result"}],"source":["target_job_titles"]},{"cell_type":"code","execution_count":9,"metadata":{},"outputs":[],"source":["target_jobs = []\n","for job in jobs:\n"," if len(job['salary']) > 0:\n"," for target_job_title in target_job_titles:\n"," if target_job_title.lower() in job['job_title'].lower():\n"," target_jobs.append(job)\n"," break"]},{"cell_type":"code","execution_count":10,"metadata":{},"outputs":[{"name":"stdout","output_type":"stream","text":["Number of filtered jobs: 31\n"]}],"source":["print(f\"Number of filtered jobs: {len(target_jobs)}\")"]},{"cell_type":"markdown","metadata":{},"source":["## Fetch job descriptions from individual job link"]},{"cell_type":"code","execution_count":11,"metadata":{"execution":{"iopub.execute_input":"2024-04-07T09:22:07.307243Z","iopub.status.busy":"2024-04-07T09:22:07.306836Z","iopub.status.idle":"2024-04-07T09:22:07.319745Z","shell.execute_reply":"2024-04-07T09:22:07.318194Z","shell.execute_reply.started":"2024-04-07T09:22:07.307211Z"},"tags":[],"trusted":true},"outputs":[],"source":["def fetch_job_article(job_id):\n"," article_url = f'https://www.jobstreet.com.my/job/{job_id}'\n"," response = requests.get(article_url)\n"," if response.status_code == 200:\n"," return response.text\n"," else:\n"," print(f\"Failed to retrieve job article. Status Code: {response.status_code}\")\n"," return None\n","\n","def extract_text_from_ul(html_content):\n"," soup = BeautifulSoup(html_content, 'html.parser')\n"," ul_tags = soup.find_all('ul')\n"," text_list = [ul.get_text(separator='\\n') for ul in ul_tags]\n"," return '\\n'.join(text_list)\n","\n","def scrape_and_store_text(job_id):\n"," job_article_content = fetch_job_article(job_id)\n"," \n"," if job_article_content:\n"," text_from_ul = extract_text_from_ul(job_article_content)\n"," return text_from_ul\n"," \n"," return ''"]},{"cell_type":"code","execution_count":12,"metadata":{"execution":{"iopub.execute_input":"2024-04-07T09:22:09.177829Z","iopub.status.busy":"2024-04-07T09:22:09.177433Z","iopub.status.idle":"2024-04-07T09:24:23.150504Z","shell.execute_reply":"2024-04-07T09:24:23.148399Z","shell.execute_reply.started":"2024-04-07T09:22:09.177798Z"},"tags":[],"trusted":true},"outputs":[],"source":["completed_jobs = []\n","for job in target_jobs:\n"," description = scrape_and_store_text(job['job_id'])\n"," if len(description) > 0:\n"," job['description'] = description\n"," completed_jobs.append(job)"]},{"cell_type":"code","execution_count":13,"metadata":{},"outputs":[{"name":"stdout","output_type":"stream","text":["Number of completly retrieved jobs: 29\n"]}],"source":["print(f\"Number of completly retrieved jobs: {len(completed_jobs)}\")"]},{"cell_type":"code","execution_count":14,"metadata":{},"outputs":[],"source":["result_df = pd.DataFrame(completed_jobs)"]},{"cell_type":"code","execution_count":15,"metadata":{},"outputs":[{"data":{"text/html":["<div>\n","<style scoped>\n"," .dataframe tbody tr th:only-of-type {\n"," vertical-align: middle;\n"," }\n","\n"," .dataframe tbody tr th {\n"," vertical-align: top;\n"," }\n","\n"," .dataframe thead th {\n"," text-align: right;\n"," }\n","</style>\n","<table border=\"1\" class=\"dataframe\">\n"," <thead>\n"," <tr style=\"text-align: right;\">\n"," <th></th>\n"," <th>job_id</th>\n"," <th>job_title</th>\n"," <th>company</th>\n"," <th>location</th>\n"," <th>category</th>\n"," <th>sub_category</th>\n"," <th>job_type</th>\n"," <th>salary</th>\n"," <th>description</th>\n"," </tr>\n"," </thead>\n"," <tbody>\n"," <tr>\n"," <th>0</th>\n"," <td>74727704</td>\n"," <td>Machine Learning Engineer (Artificial Intelligence)</td>\n"," <td>Private Advertiser</td>\n"," <td>Kuala Lumpur</td>\n"," <td>Science &amp; Technology</td>\n"," <td>Mathematics, Statistics &amp; Information Sciences</td>\n"," <td>Full time</td>\n"," <td>RM 4,000 – RM 6,000 per month</td>\n"," <td>Design, develop, and deploy machine learning models and algorithms for complex and unique datasets, using various techniques such as mathematical modeling, scikit-learn, NLP, CNN, RNN, DL, RL, Tra...</td>\n"," </tr>\n"," <tr>\n"," <th>1</th>\n"," <td>74850440</td>\n"," <td>Data Scientist</td>\n"," <td>PHENOMENAL UAM Sdn Bhd</td>\n"," <td>Kuala Lumpur</td>\n"," <td>Science &amp; Technology</td>\n"," <td>Mathematics, Statistics &amp; Information Sciences</td>\n"," <td>Full time</td>\n"," <td>RM 5,000 – RM 7,500 per month</td>\n"," <td>Utilizing machine learning techniques to analyze and interpret complex data sets.\\nDeveloping AI models and algorithms to solve specific business problems and improve operational efficiency.\\nColl...</td>\n"," </tr>\n"," <tr>\n"," <th>2</th>\n"," <td>74946043</td>\n"," <td>Data Engineer Lead</td>\n"," <td>KK Group Of Companies</td>\n"," <td>Bandar Tasik Selatan</td>\n"," <td>Information &amp; Communication Technology</td>\n"," <td>Engineering - Software</td>\n"," <td>Full time</td>\n"," <td>RM 7,000 – RM 8,000 per month</td>\n"," <td>Making large and/or complex data more accessible, understandable and usable\\nTransforming, improving and integrating data, depending on the business requirements\\nDelivering the data in a useful a...</td>\n"," </tr>\n"," </tbody>\n","</table>\n","</div>"],"text/plain":[" job_id job_title \\\n","0 74727704 Machine Learning Engineer (Artificial Intelligence) \n","1 74850440 Data Scientist \n","2 74946043 Data Engineer Lead \n","\n"," company location \\\n","0 Private Advertiser Kuala Lumpur \n","1 PHENOMENAL UAM Sdn Bhd Kuala Lumpur \n","2 KK Group Of Companies Bandar Tasik Selatan \n","\n"," category \\\n","0 Science & Technology \n","1 Science & Technology \n","2 Information & Communication Technology \n","\n"," sub_category job_type \\\n","0 Mathematics, Statistics & Information Sciences Full time \n","1 Mathematics, Statistics & Information Sciences Full time \n","2 Engineering - Software Full time \n","\n"," salary \\\n","0 RM 4,000 – RM 6,000 per month \n","1 RM 5,000 – RM 7,500 per month \n","2 RM 7,000 – RM 8,000 per month \n","\n"," description \n","0 Design, develop, and deploy machine learning models and algorithms for complex and unique datasets, using various techniques such as mathematical modeling, scikit-learn, NLP, CNN, RNN, DL, RL, Tra... \n","1 Utilizing machine learning techniques to analyze and interpret complex data sets.\\nDeveloping AI models and algorithms to solve specific business problems and improve operational efficiency.\\nColl... \n","2 Making large and/or complex data more accessible, understandable and usable\\nTransforming, improving and integrating data, depending on the business requirements\\nDelivering the data in a useful a... "]},"execution_count":15,"metadata":{},"output_type":"execute_result"}],"source":["result_df.head(3)"]},{"cell_type":"code","execution_count":16,"metadata":{},"outputs":[{"data":{"text/html":["<div>\n","<style scoped>\n"," .dataframe tbody tr th:only-of-type {\n"," vertical-align: middle;\n"," }\n","\n"," .dataframe tbody tr th {\n"," vertical-align: top;\n"," }\n","\n"," .dataframe thead th {\n"," text-align: right;\n"," }\n","</style>\n","<table border=\"1\" class=\"dataframe\">\n"," <thead>\n"," <tr style=\"text-align: right;\">\n"," <th></th>\n"," <th>job_title</th>\n"," <th>job_id</th>\n"," </tr>\n"," </thead>\n"," <tbody>\n"," <tr>\n"," <th>1</th>\n"," <td>Data Analyst</td>\n"," <td>5</td>\n"," </tr>\n"," <tr>\n"," <th>11</th>\n"," <td>Data Scientist</td>\n"," <td>3</td>\n"," </tr>\n"," <tr>\n"," <th>7</th>\n"," <td>Data Engineer</td>\n"," <td>2</td>\n"," </tr>\n"," <tr>\n"," <th>13</th>\n"," <td>Junior Data Analyst</td>\n"," <td>2</td>\n"," </tr>\n"," <tr>\n"," <th>0</th>\n"," <td>Big Data Engineer - Cloud</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>12</th>\n"," <td>Data Scientist Assistant Manager</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>19</th>\n"," <td>Senior Data Analyst</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>18</th>\n"," <td>Senior Business Intelligence Developer / Data Engineer</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>17</th>\n"," <td>Risk Operation Data Analyst ( Online Payment)</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>16</th>\n"," <td>Principal Data Engineer</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>15</th>\n"," <td>Manager - Data Engineer</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>14</th>\n"," <td>Machine Learning Engineer (Artificial Intelligence)</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>10</th>\n"," <td>Data Engineer/Data Cloud Engineer/Salesforce</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>9</th>\n"," <td>Data Engineer Lead</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>8</th>\n"," <td>Data Engineer (Python)</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>6</th>\n"," <td>Data Analyst/Scientist</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>5</th>\n"," <td>Data Analyst Internship</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>4</th>\n"," <td>Data Analyst - Internal Audit</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>3</th>\n"," <td>Data Analyst (Intern)</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>2</th>\n"," <td>Data Analyst (E-Commerce)</td>\n"," <td>1</td>\n"," </tr>\n"," <tr>\n"," <th>20</th>\n"," <td>Senior Data Scientist (Full Stack)</td>\n"," <td>1</td>\n"," </tr>\n"," </tbody>\n","</table>\n","</div>"],"text/plain":[" job_title job_id\n","1 Data Analyst 5\n","11 Data Scientist 3\n","7 Data Engineer 2\n","13 Junior Data Analyst 2\n","0 Big Data Engineer - Cloud 1\n","12 Data Scientist Assistant Manager 1\n","19 Senior Data Analyst 1\n","18 Senior Business Intelligence Developer / Data Engineer 1\n","17 Risk Operation Data Analyst ( Online Payment) 1\n","16 Principal Data Engineer 1\n","15 Manager - Data Engineer 1\n","14 Machine Learning Engineer (Artificial Intelligence) 1\n","10 Data Engineer/Data Cloud Engineer/Salesforce 1\n","9 Data Engineer Lead 1\n","8 Data Engineer (Python) 1\n","6 Data Analyst/Scientist 1\n","5 Data Analyst Internship 1\n","4 Data Analyst - Internal Audit 1\n","3 Data Analyst (Intern) 1\n","2 Data Analyst (E-Commerce) 1\n","20 Senior Data Scientist (Full Stack) 1"]},"execution_count":16,"metadata":{},"output_type":"execute_result"}],"source":["result_df.groupby('job_title')[['job_id']].count().reset_index().sort_values(by=\"job_id\",ascending=False)"]},{"cell_type":"code","execution_count":17,"metadata":{},"outputs":[],"source":["data_path = os.path.join(data_dir, data_file)\n","\n","data_df = result_df.to_csv(data_path, sep=\",\")"]},{"cell_type":"code","execution_count":null,"metadata":{},"outputs":[],"source":[]}],"metadata":{"kaggle":{"accelerator":"none","dataSources":[{"datasetId":4284628,"sourceId":7654855,"sourceType":"datasetVersion"}],"dockerImageVersionId":30626,"isGpuEnabled":false,"isInternetEnabled":true,"language":"python","sourceType":"notebook"},"kernelspec":{"display_name":"Python 3","language":"python","name":"python3"},"language_info":{"codemirror_mode":{"name":"ipython","version":3},"file_extension":".py","mimetype":"text/x-python","name":"python","nbconvert_exporter":"python","pygments_lexer":"ipython3","version":"3.11.8"}},"nbformat":4,"nbformat_minor":4}
src/app/app.py CHANGED
@@ -4,15 +4,19 @@ from resume_worth.pipelines.information_retrieval.pipeline import retrieve_top_j
4
 
5
 
6
  params = get_params()
7
- APP_TITLE = params['APP_TITLE']
8
- APP_DESCRIPTION = params['APP_DESCRIPTION']
9
- JOB_TITLES = params['JOB_TITLES']
10
- HOST = params['HOST']
11
- PORT = params['PORT']
12
 
13
 
14
- def salary_estimator(job_title, resume):
15
 
 
 
 
 
 
 
16
  salaries = retrieve_top_job_vacancy_info(job_title, resume)
17
 
18
  return salaries[0]
@@ -23,19 +27,19 @@ def run():
23
  app = gr.Interface(
24
  fn=salary_estimator,
25
  inputs=[
26
- gr.Radio(JOB_TITLES, label="Job Title"),
27
  gr.Textbox(label="Resume", lines=10)
28
  ],
29
  outputs=[
30
  gr.Textbox(label="Estimate Salary", lines=1)
31
  ],
32
- title=APP_TITLE,
33
- description=APP_DESCRIPTION,
34
  allow_flagging="never"
35
  )
36
 
37
  # Use share=True to create a public link to share. This share link expires in 72 hours.
38
- app.launch(server_name=HOST, server_port=int(PORT))
39
 
40
 
41
  if __name__ == "__main__":
 
4
 
5
 
6
  params = get_params()
7
+ app_config = params['app_config']
8
+ app_backend = params['app_backend']
9
+ app_frontend = params['app_frontend']
 
 
10
 
11
 
12
+ def salary_estimator(job_title: str, resume: str):
13
 
14
+ if job_title is None:
15
+ return app_frontend['messages']['job_title_not_found']
16
+
17
+ if len(resume) < app_backend['min_resume_size']:
18
+ return app_frontend['messages']['salary_not_found']
19
+
20
  salaries = retrieve_top_job_vacancy_info(job_title, resume)
21
 
22
  return salaries[0]
 
27
  app = gr.Interface(
28
  fn=salary_estimator,
29
  inputs=[
30
+ gr.Radio(app_frontend['jobs'], label="Job Title"),
31
  gr.Textbox(label="Resume", lines=10)
32
  ],
33
  outputs=[
34
  gr.Textbox(label="Estimate Salary", lines=1)
35
  ],
36
+ title=app_frontend['title'],
37
+ description=app_frontend['description'],
38
  allow_flagging="never"
39
  )
40
 
41
  # Use share=True to create a public link to share. This share link expires in 72 hours.
42
+ app.launch(server_name=app_config['host'], server_port=app_config['port'])
43
 
44
 
45
  if __name__ == "__main__":