/Pandas-Data-Pipelines

Building data pipelines with Pandas dataframe

Primary LanguageJupyter Notebook

Pandas-Data-Pipelines

Introduction

Pandas is a common library in the Python ecosystem for data analytics and machine learning.

data-pipeline

Folder Structure

datasource
|-USA_Housing.csv
repository
|-pipeline.ipynb
|-.ipynb_checkpoints
README.md
requirement.txt

Pipelineing with Pandas

Check the Jupyter Notebook.

The dataset

Check datasource. The dataset is open data US Housing prices downloaded in Kaggle.

Reference

  1. Build Pipelines with Pandas Using pdpipe