airscholar

Building datamasterylab.com

@Orbit-Inc England, United Kingdom

Pinned Repositories

ApacheFlink-SalesAnalytics
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.
Language:Java11 2 07
changecapture-e2e
This project shows how to capture changes from postgres database and stream them into kafka
Language:Python30 3 218
e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Language:Python192 3 688
FlinkCommerce
This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres
Language:Java36 2 019
FootballDataEngineering
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
Language:Python16 2 014
modern-data-eng-dbt-databricks-azure
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.
21 3 012
realtime-voting-data-engineering
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
Language:Python27 3 118
RealtimeStreamingEngineering
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from data acquisition, processing, sentiment analysis with ChatGPT, production to kafka topic and connection to elasticsearch.
Language:Python29 3 120
RedditDataEngineering
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
Language:Python60 3 142
SparkingFlow
This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python, Scala and Java as an example.
Language:Java29 2 617

airscholar's Repositories

airscholar/ANP-Web
Sample static ANP Website template
Language:CSS0 2 00
airscholar/ApelBackend
Language:Elixir00
airscholar/AwesomeLogin
00
airscholar/calendar.js
Port of Python calendar.py module to JavaScript
Language:JavaScript00
airscholar/CurrencyConverter
Language:Java
airscholar/Developers-connect
A MERN Stack App to serve as a social media for developers
Language:JavaScript
airscholar/DirectorXY
Language:HTML
airscholar/Discuss
Language:Elixir
airscholar/Docker-api
Language:JavaScript
airscholar/dogehouse
Taking voice conversations to the moon 🚀
airscholar/Elibar
Language:Elixir
airscholar/ElixCards
Language:Elixir
airscholar/Elixcon
Language:Elixir
airscholar/MyApp
airscholar/NestMicroService
Language:TypeScript
airscholar/NestTaskManager
Language:TypeScript
airscholar/node-redis
Language:Handlebars
airscholar/Personify
Language:HTML
airscholar/Phishing-Detector-App
airscholar/Phishing-Website-Detector
airscholar/Springboot-h2-starter
Language:Java
airscholar/StoryBooks
Language:JavaScript
airscholar/swagger-file-db
Language:JavaScript
airscholar/TastyRecipes
Language:Elixir2 0
airscholar/travel-management-system
airscholar/tutorials
DevOps by Example
airscholar/Webscraper-Stackoverflow
Language:Python

airscholar

Pinned Repositories

ApacheFlink-SalesAnalytics

changecapture-e2e

e2e-data-engineering

FlinkCommerce

FootballDataEngineering

modern-data-eng-dbt-databricks-azure

realtime-voting-data-engineering

RealtimeStreamingEngineering

RedditDataEngineering

SparkingFlow

airscholar's Repositories

airscholar/ANP-Web

airscholar/ApelBackend

airscholar/AwesomeLogin

airscholar/calendar.js

airscholar/CurrencyConverter

airscholar/Developers-connect

airscholar/DirectorXY

airscholar/Discuss

airscholar/Docker-api

airscholar/dogehouse

airscholar/Elibar

airscholar/ElixCards

airscholar/Elixcon

airscholar/MyApp

airscholar/NestMicroService

airscholar/NestTaskManager

airscholar/node-redis

airscholar/Personify

airscholar/Phishing-Detector-App

airscholar/Phishing-Website-Detector

airscholar/Springboot-h2-starter

airscholar/StoryBooks

airscholar/swagger-file-db

airscholar/TastyRecipes

airscholar/travel-management-system

airscholar/tutorials

airscholar/Webscraper-Stackoverflow