This project is coded in a Jupyter Notebook in which each kernal can be ran individually after the first 2 kernals are ran in sequential order. The first kernal imports the necessary technologies for the project while the second kernal reads the data used for the project. The photographs below exemplify the results of each kernal.
Run the kernals by hovering over each kernal then clicking the play button in the top left corner. Once a green check displays in the bottom left corner, the kernal has completed running. You can also run all kernals simultaneously in sequential order by clicking the "Run All" play button at the top of the file.
Use this link (https://github.com/blwishom/etav_project.git) to clone the repo in your terminal and begin to explore!
-
These 2 kernals import the necessary technologies for the project and read the data used for the project:
-
This kernal extracts specific data from the database while renaming the columns and also filling in NaN values with logical text and creates a dataframe to be used throughout the rest of the project:
-
This kernal combines 2 columns of data from the dataframe and displays the data in a 2-D visual:
-
This kernal removes rows with no data from the previous kernal in order for the 2-D visual to only display rows that consist of data and also orders the usCasesPerYear column in descending order from highest amount of cases to lowest:
-
This kernal groups the US Cases Per Year by grabbing the highest amount of cases(max), lowest amount of cases (min) and the average between the highest and lowest amounts (avg):
-
This kernal grabs the total amount of each type of cancer and displays the cancer type in order of greatest amount of cases to least:
-
This kernal grabs the top 5 types of cancers and displays them in a bar graph:
-
This kernal grabs the top 5 types of cancers and displays them in a pie chart:
-
This kernal grabs the top 5 types of cancers and displays them in a scatter plot: