pmarques95's Stars
cloudquery/cloudquery
The open source high performance ELT framework powered by Apache Arrow
subhamkharwal/ease-with-apache-spark
Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
johnny-chivers/glue-full-course
yangshun/tech-interview-handbook
💯 Curated coding interview preparation materials for busy software engineers
rubenwap/coding-challenge-pyspark
Small test to learn how to use pyspark
priyankavergadia/GCPSketchnote
If you are looking to become a Google Cloud Engineer , then you are at the right place. GCPSketchnote is series where I share Google Cloud concepts in quick and easy to learn format.
rrakesh2690/dataengineering
OBenner/data-engineering-interview-questions
More than 2000+ Data engineer interview questions.
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
ItIsMeCall911/Course-Piracy-Index
Course Piracy Index 🏴☠️
Saurav3218/Pyspark_Questions_SKS
This repo is mostly created for pyspark and hive related interview questions.
siddd88/gcp-data-engineering
Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more
qubole/sparklens
Qubole Sparklens tool for performance tuning Apache Spark
san089/goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
san089/Udacity-Data-Engineering-Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
vigneshSs-07/Cloud-AI-Analytics
This Repo contain details related to Data Engineering tech stacks in GCP
LearningJournal/Spark-Programming-In-Python
Apache Spark 3 - Spark Programming in Python for Beginners
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
vaquarkhan/PySpark-2-Day-Bootcamp-Workshop
vaquarkhan/spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
vaquarkhan/Spark-with-Python---My-learning-notes-
ETL pipeline using pyspark (Spark - Python)
vaquarkhan/py-spark-example
priyankuhazarika/live-train-status-through-python-using-railway-api
Track pnr status, live train status, check seat availability using python in command line
mrpowers-io/quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
ahmadichsan/python-task-d11
Data Cleansing and Standardization
ParfaitG/DATA_MIGRATION
Binary, CSV, JSON, SQL, and XML data migration scripts in Java, PHP, Python, R, SAS, and VBA (MS Access and MS Excel).
GoogleCloudDataproc/spark-bigquery-connector
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
cartershanklin/hive-scd-examples
How to manage Slowly Changing Dimensions with Apache Hive
SomanathSankaran/advanced-data-engineering-with-databricks
SomanathSankaran/spark_medium
My Git Repo for Csv Data