Pinned Repositories
HiveExamples
KafkaTutorial
MavenHadoop
MyWorkSpaceHadoop
MyWorkSpaceHadoop
OozieSamples
Oozie Samples
RemoveHeaderMR
Storm3--Hbase-HDFS-Hive-from-HortonWorks
Storm3- Hbase HDFS Hive from HortonWorks
khajaasmath786's Repositories
khajaasmath786/aws-airflow-demo
Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Apache Airflow (MWAA) on AWS.
khajaasmath786/aws-doc-sdk-examples
Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.rst file below.
khajaasmath786/aws-solutions-constructs
The AWS Solutions Constructs Library is an open-source extension of the AWS Cloud Development Kit (AWS CDK) that provides multi-service, well-architected patterns for quickly defining solutions
khajaasmath786/aws-terraform
khajaasmath786/aws-tutorial-code
AWS tutorial code.
khajaasmath786/cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
khajaasmath786/Complete-Python-3-Bootcamp
Course Files for Complete Python 3 Bootcamp Course on Udemy
khajaasmath786/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
khajaasmath786/DataIngestion
My First Python - PYspark
khajaasmath786/DataPipelineFinalVersion
khajaasmath786/dbt-on-aws
dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats
khajaasmath786/EMR_Studio_Hudi
Apache Hudi examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
khajaasmath786/etl_with_mage_ai
An ETL data pipeline that extracts data from source and loads it to destination, automated using mage.ai
khajaasmath786/kafdrop
Kafka Web UI
khajaasmath786/ksqldb-client-iot-demo
IoT-inspired demo application using the Java client for ksqlDB
khajaasmath786/PySpark-Boilerplate
A boilerplate for writing PySpark Jobs
khajaasmath786/pyspark-example-project
Example project implementing best practices for PySpark ETL jobs and applications.
khajaasmath786/pyspark-examples
Pyspark RDD, DataFrame and Dataset Examples in Python language
khajaasmath786/pyspark-tutorial
A learning journey into the Python API of Apache Spark from an ETL-developer perspective
khajaasmath786/python-pyspark-framework
pyspark framework
khajaasmath786/shc
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
khajaasmath786/spark-data-skew-tutorial
khajaasmath786/spark-schema-merge
Spark app to merge different schemas
khajaasmath786/spotify-etl-pipeline
khajaasmath786/tutorials
Just Announced - "Learn Spring Security OAuth":
khajaasmath786/Twitter-etl-pipeline
khajaasmath786/Udacity-nd027-Data-Lake
khajaasmath786/x12-parser
khajaasmath786/x12-parser-Asmath
khajaasmath786/youtubeetl