/data_m2

Primary LanguageJupyter Notebook

Data Analytics Teaching Repository: KickStarter Data Manipulation

Overview

This repository provides hands-on examples for teaching data analytics using the KickStarter dataset. It covers everything from reading in data to telling a story with that data. The repository focuses on manipulating the dataset using pandas, exploring different ways to clean and format column names, and performing basic data analysis tasks.

Through these exercises, students will learn key data manipulation techniques that are foundational to working with real-world datasets.

In this module we will cover:

week 1:

  • We focus on reading in data
  • Performing initial analyses to gain insights
  • This includes understanding the data types
  • Finding missing values
  • Exploring other key aspects of the dataset

Week 2:

we learn:

  • How to manuplate column names
  • Cover removing spaces in our data
  • Learn how to replace values
  • Drop Columns
  • we cover how to handle missing data
  • We also cover how to filter data based on conditions
  • Get some statitsitcs on a column
  • Learn how to preform groupbys
  • cover sorting values

Virtual Environment Commands

Command Linux/Mac GitBash
Create python3 -m venv venv python -m venv venv
Activate source venv/bin/activate source venv/Scripts/activate
Install pip install -r requirements.txt pip install -r requirements.txt
Deactivate deactivate deactivate