Pinned Repositories
airflow-pipelines-model-training-precious-metal-prices
A portfolio repository, that showcase using Airflow to orchestrate ETL pipelines that would prepare the precious metal prices data to be used with machine learning model and then train the model.
data-analysis-library-in-python
A portfolio repository, that showcase creating Data Analysis library in Python.
jobs-data-pipelines-with-python-and-airflow
A portfolio repository, that showcase using Airflow to create a data pipeline in Python, that would present job offers from several job boards.
loan-eligibility-with-docker-and-airflow
A portfolio repository, that showcase using Airflow to manage Docker containers to prepare the environment and drive ETL process of loan eligibility data.
smart_meters
A portfolio repository, that showcase using Spark for data transformation and loading in a Data Lake environment, using Airflow to orchestrate PySpark. Machine learning feature store is available via FastAPI webapp managed by Kubernetes and backed up by Redis.
spark-airflow-docker-smart_meters
A portfolio repository, that showcase using Spark for data transformations and loading in a Data Lake environment, using Airflow to orchestrate PySpark jobs that are encapsulated in Docker containers.
superstore-dimensional-modeling-postgresql
A portfolio repository, that showcase using dbt for Kimball-style dimensional modeling on a Superstore Sales dataset.
weather-data-system-with-python-and-sql
A portfolio repository, that showcase creating a data pipeline in Python with data from OpenWeatherAPI.
zakapior
zakapior's Repositories
zakapior/airflow-pipelines-model-training-precious-metal-prices
A portfolio repository, that showcase using Airflow to orchestrate ETL pipelines that would prepare the precious metal prices data to be used with machine learning model and then train the model.
zakapior/data-analysis-library-in-python
A portfolio repository, that showcase creating Data Analysis library in Python.
zakapior/jobs-data-pipelines-with-python-and-airflow
A portfolio repository, that showcase using Airflow to create a data pipeline in Python, that would present job offers from several job boards.
zakapior/loan-eligibility-with-docker-and-airflow
A portfolio repository, that showcase using Airflow to manage Docker containers to prepare the environment and drive ETL process of loan eligibility data.
zakapior/smart_meters
A portfolio repository, that showcase using Spark for data transformation and loading in a Data Lake environment, using Airflow to orchestrate PySpark. Machine learning feature store is available via FastAPI webapp managed by Kubernetes and backed up by Redis.
zakapior/spark-airflow-docker-smart_meters
A portfolio repository, that showcase using Spark for data transformations and loading in a Data Lake environment, using Airflow to orchestrate PySpark jobs that are encapsulated in Docker containers.
zakapior/superstore-dimensional-modeling-postgresql
A portfolio repository, that showcase using dbt for Kimball-style dimensional modeling on a Superstore Sales dataset.
zakapior/weather-data-system-with-python-and-sql
A portfolio repository, that showcase creating a data pipeline in Python with data from OpenWeatherAPI.
zakapior/zakapior