Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
awesome-public-datasets
A topic-centric list of HQ open datasets.
aws-eks-best-practices
A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.
aws-emr-best-practices
A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational excellence, reliability and application specific best practices across Spark, Hive, Hudi, Hbase and more.
aws-glue-libs
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
Home-depot-Kaggle
Home depot
lighter
REST API for Apache Spark on K8S
livy-for-spark3.1.1
harminder0209's Repositories
harminder0209/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
harminder0209/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
harminder0209/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
harminder0209/awesome-public-datasets
A topic-centric list of HQ open datasets.
harminder0209/aws-eks-best-practices
A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.
harminder0209/aws-emr-best-practices
A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational excellence, reliability and application specific best practices across Spark, Hive, Hudi, Hbase and more.
harminder0209/aws-glue-libs
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
harminder0209/lighter
REST API for Apache Spark on K8S
harminder0209/livy-for-spark3.1.1
harminder0209/cheat-sheets
This is my personal knowledge-base. Here you'll find code-snippets, technical documentation, and command reference for various tools, and technologies.
harminder0209/DeepLearningFromScratch
This is repo for implementing deep learning concepts
harminder0209/drawflow-vue3-example
Drawflow vue 3 example
harminder0209/eddy-backend
Administration backend for the Eddy Analytics platform
harminder0209/first-contributions
🚀✨ Help beginners to contribute to open source projects
harminder0209/gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
harminder0209/harminder0209
harminder0209/hudi
Upserts, Deletes And Incremental Processing on Big Data.
harminder0209/incubator-hop
Hop Orchestration Platform
harminder0209/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
harminder0209/movie-web
A small web app for watching movies and shows easily
harminder0209/openai-cookbook
Examples and guides for using the OpenAI API
harminder0209/pulsar-perf-test
harminder0209/quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
harminder0209/roop
one-click deepfake (face swap)
harminder0209/spark
Apache Spark - A unified analytics engine for large-scale data processing
harminder0209/spark-livy-on-airflow-workspace
A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.
harminder0209/sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
harminder0209/superset
Apache Superset is a Data Visualization and Data Exploration Platform
harminder0209/system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
harminder0209/vue-dag
🏗 Data-driven directed acyclic graph (DAG) builder for Vue.js