This repository contains data analysis and visualization projects using Python, SQL, and Tableau.
Language(s): Python, SQL
Package(s): (Python) Pandas, Seaborn, BeautifulSoup
Software(s): Tableau, Jupyter Notebooks
Featured Project:
- M1 Expert Pie Analysis: Data analysis and visualization of M1 Expert Pies using Python's Pandas and Seaborn modules.
- Newsfeed Analysis in Tableau: Created a Tableau Dashboard that visualizes real user-interaction data relating to 365DataScience's "Newsfeed" service and provides a write-up summarizing the process and further insights.
- Student Onboarding Analysis in Tableau: Created a Tableau Dashboard that visualizes student responses to 365DataScience's onboarding survey and provides a write-up summarizing the process and further insights.
Other Projects:
- Amazon Web Scraper: Uses Python's Beautiful Soup module to collect and store Amazon product data (every 24 hours).
- Covid-19 Data Exploration: Uses SQL to query global Covid-19 data per country and continent (used in Tableau Dashboard here).
- Movie Correlation with Python: Explores and visualizes feature correlation of a Movie Industry dataset (numeric and all feature correlation) using Python.
- Nashville Housing Data Cleaning: SQL project cleaning and transforming sample housing data.
Data Collection
- Python Webscraping to collect and store data over time
- Working with data in CSV and Microsoft Excel formats
Data Exploration
- Using Python Pandas module (sorting, conditional filtering, finding missing data, general and aggregate statistics (describe, dtypes, max, min, mean))
- Using SQL (selecting, grouping and aggregate functions, partitions, joins, CTEs, temp tables, creating views)
Data Cleaning
- Using Python Pandas module (dtype recasting, filling null values, altering columns)
- Using SQL (creating and dropping columns, filling null values, using SPLIT and PARSENAME)
Data Analysis
- Summarizing relevant trends and features of vizualizations
- Constructing necessary queries to answer initial analysis questions
Data Visualization
- Seaborn (lineplots, scatterplots, heatmaps, using subplots to display multiple graphs simultaneously)
- Tableau (using SQL queries to create a dashboard)