shrikantpandey13's Stars
aksashu000/Kafka_Tutorial
This project will help the beginners learn Kafka with ease.
subhamkharwal/ease-with-apache-spark
Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
raveendratal/ravi_azureadbadf
Ravi Azure ADB ADF Repository
sankamuk/PysparkCheatsheet
PySpark Cheatsheet
martandsingh/ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
OBenner/data-engineering-interview-questions
More than 2000+ Data engineer interview questions.
CICIFLY/Data_Engineering_Project_Portfolio
Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3
subhayansg/PySparkInJupyterNotebook
These are PySpark notes, created in Jupyter notebook, in Itversity labs
hnawaz007/pythondataanalysis
Python data repo, jupyter notebook, python scripts and data.
hyunjoonbok/PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
Soumyadeep-github/Spark-assignment
The aim of this project is to perform analysis on some (car crash) data using PySpark and make the entire process deployable using Docker.
danielbeach/data-engineering-practice
Data Engineering Practice Problems
Soumyadeep-github/Data-Ingestion
The aim of this project is automate data ingestion from flat files like CSV and compressed files GZIP into a database like Postgres. The entire setup is automated using Docker and is pretty fast too as multiprocessing is being used.
alice203/outlier_detection-treatment
Code to the article series published in Towards Data Science on Medium.
subhayansg/SQL-art
Here are all my SQL learning uploaded as scripts and notes
aksashu000/Spark_Tutorial
Contains Spark programs for complete hands-on tutorial
ashishpatel26/Kubeflow-installation-on-windows-10
Kubeflow installation on windows 10/11
KodeWorker/Python-Design-Patterns
Learning design patterns in Python 3
palantir/pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
ahammadshawki8/Object-Oriented-Programming-in-Python
❓❓ Does anybody know that Python is an object-oriented programming language? Learn all about OOP in Python with real-world examples. ✔
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
kumaransg/LLD
Curated Collection of all Low level design Questions and implementation asked in major Tech companies , Get yourself prepared for the LLD round and ace the interview.
ashishpatel26/Treasure-of-Transformers
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
VickyAugust10/PysparkStructuredStreaming
First encounter with Pyspark structured streaming with databricks.
BasPH/data-pipelines-with-apache-airflow
Code for Data Pipelines with Apache Airflow
osin-vladimir/mlflow_tutorial
Managing machine learning life-cycle with MLflow tutorial
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
graviraja/MLOps-Basics
piskvorky/gensim
Topic Modelling for Humans
obss/jury
Comprehensive NLP Evaluation System