/FIT5147_dataset

FIT5147 assignment dataset

Primary LanguageJupyter Notebook

FIT5147_dataset

FIT5147 assignment dataset for 31029248 DEP project

raw_data folder

in this folder is the raw data download from the site below in csv or xlsx format:

  1. Tabular data: 32 rows * 170 columns Teams expected and actual performance data (https://www.kaggle.com/datasets/swaptr/fifa-world-cup-2022-statistics)
  2. Tabular data: 32 rows * 15 columns Match result and expected performance data (https://www.kaggle.com/datasets/swaptr/fifa-world-cup-2022-statistics)
  3. Tabular data: 61 rows * 176 columns Match other performance data (https://www.kaggle.com/datasets/die9origephit/fifa-world-cup-2022-complete-dataset?resource=download)
  4. Web data: World Cup Final Stage Team Statistics (https://www.whoscored.com/Regions/247/Tournaments/36/Seasons/8213/Stages/18657/TeamStatistics/International-FIFA-World-Cup-2022)

Py Files

Those python files is the script that help me to make Data Wrangling and Data cleaning

data_clean folder

this folder storage the xlsx files after the process from Py files. And those xlsx inside are working for the futher exploration in DEP and DEV assignment.