Data Analytics Teaching Repository: KickStarter Data Manipulation

Overview

This repository provides hands-on examples for teaching data analytics using the KickStarter dataset. It covers everything from reading in data to telling a story with that data. The repository focuses on manipulating the dataset using pandas, exploring different ways to clean and format column names, and performing basic data analysis tasks.

Through these exercises, students will learn key data manipulation techniques that are foundational to working with real-world datasets.

In this module we will cover:

week 1:

We focus on reading in data
Performing initial analyses to gain insights
This includes understanding the data types
Finding missing values
Exploring other key aspects of the dataset

Week 2:

we learn:

How to manuplate column names
Cover removing spaces in our data
Learn how to replace values
Drop Columns
we cover how to handle missing data
We also cover how to filter data based on conditions
Get some statitsitcs on a column
Learn how to preform groupbys
cover sorting values

Virtual Environment Commands

Command	Linux/Mac	GitBash
Create	`python3 -m venv venv`	`python -m venv venv`
Activate	`source venv/bin/activate`	`source venv/Scripts/activate`
Install	`pip install -r requirements.txt`	`pip install -r requirements.txt`
Deactivate	`deactivate`	`deactivate`

ianhorn/data_m2

Data Analytics Teaching Repository: KickStarter Data Manipulation

Overview

In this module we will cover:

week 1:

Week 2:

Virtual Environment Commands