/growth-school

Data Ochestration with Airflow Wokshop for GrowthSchool.io

Primary LanguagePython

GrediStore E-commerce Data Orchestration

Welcome to GrediStore's E-commerce Data Orchestration project! 🛒 (Data Orchestration with Airflow Workshop for GrowthSchool.io)

image

What We Do

GrediStore E-commerce Data Orchestration is a comprehensive project that handles data migration, integrates real-time weather information, and enables powerful visualizations for insightful decision-making.

Key Features:

  1. Database Migration and Transformation:

    • Effortlessly migrate data from CSV files to PostgreSQL.
    • Apply tailored transformations for improved data quality.
  2. Weather Data Integration:

    • Fetch live weather data via a dedicated API | Open Meteo.
    • Store and manage weather information in a dedicated PostgreSQL table.
  3. Visualization with Metabase/Superset/Tableau:

    • Set up and configure your preferred visualization tool locally.
    • Establish seamless connections to the PostgreSQL database for dynamic reporting.
  4. Analysis and Reporting:

    • Conduct in-depth analyses across predefined categories.
    • Generate compelling visualizations and detailed reports for actionable insights.

How to Get Started

  • To begin, follow our guide on Setting Up Airflow Locally with Virtual Environment. This will walk you through the process of installing and configuring Airflow for your local environment.
  • Replace your DAG folder with the one in this repo.
  • Install PostgreSQL and its Admin (Official PostgreSQL RDBMS)
  • Install the following Python library:
    • Pandas
    • pysycog2-binary
    • Numpy
    • openmeteo_requests
    • requests_cache
      And any library you're prompted to install.

Tableau Dashboard

You can access the Tableau dashboard here

License

This project is licensed under the MIT License.

Tutorial Video Link

Check my Youtube page

Follow Me on LinkedIn

For more updates and to connect with me, follow me on LinkedIn.