Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
awesome-public-datasets
A topic-centric list of HQ open datasets.
aws-eks-best-practices
A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.
aws-emr-best-practices
A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational excellence, reliability and application specific best practices across Spark, Hive, Hudi, Hbase and more.
aws-glue-libs
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
Home-depot-Kaggle
Home depot
lighter
REST API for Apache Spark on K8S
livy-for-spark3.1.1
harminder0209's Repositories
harminder0209/Home-depot-Kaggle
Home depot
harminder0209/BigDataImplementation
harminder0209/EDA
harminder0209/githubCommands
harminder0209/KafkaImplementation
harminder0209/Machine-Learning-with-R-datasets
Formatted datasets for Machine Learning With R by Brett Lantz
harminder0209/temp
harminder0209/Twitter-Sentiment-Spark
Doing data analysis on tweets
harminder0209/TwitterSpark
Doing data analysis on tweets
harminder0209/workshop-rasax