skr-learn

I'm Sumit, a Learner, Data Engineer and SQL enthusiast from India!

skr-learn's Stars

adidas/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Language:Python20136
airscholar/modern-data-eng-dbt-databricks-azure
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.
1510
nordquant/complete-dbt-bootcamp-zero-to-hero
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
Language:Shell382281
abdkumar/spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
Language:Python6612
Pavanpawar2705/Ingesting-Real-time-Logistics-Data-in-MongoDB-with-Kafka-and-Python
In this project, I am crafting an innovative solution that involves a Kafka producer and consumer, employing data serialization/deserialization with Avro, and orchestrating the smooth ingestion of data into MongoDB. 🌐⚙️ The goal? Empowering data-driven decisions and ensuring real-time insights into logistics operations. 🌐📈
Language:Python54
Lal4Tech/Data-Engineering-With-AWS
Resources and projects from Udacity Data Engineering with AWS nano degree programme
Language:Jupyter Notebook2110
gajerabhavik915/DSA
Language:Python3
JagadeeshwaranM/Data_Engineering_Simplified
Language:Python623140
Satvik26/spotahome_ETL
Language:Python4
ABZ-Aaron/Reddit-API-Pipeline
Language:Python29383
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
Language:Jupyter Notebook23.7k5.1k
TheAlgorithms/Python
All Algorithms implemented in Python
Language:Python182k44k
Rishav273/kafkaPysparkAnalytics
Real-time ETL pipeline for financial data (kafka, pyspark) .
Language:Python81
AkashSingh3031/The-Complete-FAANG-Preparation
This repository contains all the DSA (Data-Structures, Algorithms, 450 DSA by Love Babbar Bhaiya, FAANG Questions), Technical Subjects (OS + DBMS + SQL + CN + OOPs) Theory+Questions, FAANG Interview questions, and Miscellaneous Stuff (Programming MCQs, Puzzles, Aptitude, Reasoning). The Programming languages used for demonstration are C++, Python, and Java.
Language:Jupyter Notebook10.3k2.3k
cM2908/leetcode-spark
Contains spark dataframe solutions of leetcode questions
214
siddheshkankal/MySql
13
cM2908/leetcode-sql
Leetcode SQL Solutions
Language:PLpgSQL16350
siddheshkankal/Hive_Challenge
Language:Jupyter Notebook33
siddheshkankal/Hive_Hbase_connection
1
siddheshkankal/Confluent_Kafka
Language:Python11
desaikun1996/New-York-City-Arrests-Data-Modelling-Analysis-and-Visualization
Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talend, built ETL pipelines to process, clean the data and create dimensions and facts in the destination database. Further, visualized the necessary details of the database using Tableau and PowerBI.
1812
NagarajuNakka/Hive-Class
1
NagarajuNakka/ineuron_kafka_assignment
Language:Python2
NagarajuNakka/hive_assignment
1
NagarajuNakka/Python-Assignment
Language:Python1
Ajay026/SQL-Project-for-Data-Analysis-part-1-7
Complete SQL Project for data analysis with source code.
331130
bigdatabysumitm/NotesOfYouTubeSQLSeries
289331
martandsingh/ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Language:Python8859
itversity/data-engineering-spark
Language:Jupyter Notebook82121
commit-live-students/Data_Science_Masters_Program_2021
Language:Jupyter Notebook433149

skr-learn

skr-learn's Stars

adidas/lakehouse-engine

airscholar/modern-data-eng-dbt-databricks-azure

nordquant/complete-dbt-bootcamp-zero-to-hero

abdkumar/spotify-stream-analytics

Pavanpawar2705/Ingesting-Real-time-Logistics-Data-in-MongoDB-with-Kafka-and-Python

Lal4Tech/Data-Engineering-With-AWS

gajerabhavik915/DSA

JagadeeshwaranM/Data_Engineering_Simplified

Satvik26/spotahome_ETL

ABZ-Aaron/Reddit-API-Pipeline

DataTalksClub/data-engineering-zoomcamp

TheAlgorithms/Python

Rishav273/kafkaPysparkAnalytics

AkashSingh3031/The-Complete-FAANG-Preparation

cM2908/leetcode-spark

siddheshkankal/MySql

cM2908/leetcode-sql

siddheshkankal/Hive_Challenge

siddheshkankal/Hive_Hbase_connection

siddheshkankal/Confluent_Kafka

desaikun1996/New-York-City-Arrests-Data-Modelling-Analysis-and-Visualization

NagarajuNakka/Hive-Class

NagarajuNakka/ineuron_kafka_assignment

NagarajuNakka/hive_assignment

NagarajuNakka/Python-Assignment

Ajay026/SQL-Project-for-Data-Analysis-part-1-7

bigdatabysumitm/NotesOfYouTubeSQLSeries

martandsingh/ApacheSpark

itversity/data-engineering-spark

commit-live-students/Data_Science_Masters_Program_2021