/Prediction_Job-Vacancies_Machine_Learning

Teamwork. Prediction of future potential job vacancies and understanding underlying patterns in Canada’s job market using MLP Regressor and SARIMAX of supervised ML models.

Primary LanguageJupyter Notebook

Capstone Project

All content on this page were authored by Alec Ngai, Esther Utomakili, Jerry McGranaghan, Wisam Ramadan and Olesya Irkhina.

To view the overview of the project, please refer to the Google Slides documents authored by the team.

Table of Contents

Overview of Project

Project Topic

The chosen topic for this project is "Understanding and Predicting Labour Vacancies in Various Canadian Sectors".

This topic was selected to understand the Canadian labout market and predict how the job market would look like in the near future. It is important to know how the labour market have changed over the past years and how it will look like in the upcoimg years. Understanding the labour market and looking at potential vacancies could go a long way in avoiding situations such as economic collapse and could also potentially assist with education planning.

Project Goals

  • Predict future potential job vacancies in Canada
  • Understand underlying patterns in Canada’s job market

Data Source

All data used for the project was collected from the official Government of Canada website. The starting dataset was the Job vacancies, proportion of job vacancies and average offered hourly wage by selected characteristics, quarterly, unadjusted for seasonality - Dataset which contains number of job vacancies for various industries, categorized by National Occupational Classification (NOC) codes, every month from the first quarter of 2015 to the second quarter of 2021.

The original data set contains standardized column information, however, for the purpose of this project, it was cleaned to contain information needed to achieve the project goals. The original dataset included information such as:

  • Divided into Quarters
  • Includes Canada-wide and Province-wide detail
  • National Occupational Classification (NOC)
  • Vacancy Characteristics
  • Statistics describing job vacancies, proportion of job vacancies and average wage

Communication Protocols

The team made use of a dedicated Slack channel, Zoom and various Google apps to communicate, meeting every Tuesday, Thursday and Saturday. It can be found here Meeting Minutes.

Data Exploration and Cleaning

Please refer to the dedicated ReadME explaining the data exploration process.

Results

Machine Learning

Please refer to the dedicated ReadME explaining the timeSeries machine learning model, and, for Multi-Layer Perceptron regressor used for analyzing and predicting future vacancies.

Database

The team made use of AWS' Relational Database Service to host the database used for the project. Please refer to the dedicated ReadME detailing all processes involving the database

Dashboard

The team decided on making use of Tableau as a visualization tool. Please refer to the dedicated ReadME proving images of the Tableau storyboard