Pinned Repositories
dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
kafka-python
Python client for Apache Kafka
spark-redshift
Performant Redshift data source for Apache Spark
kafka-utils
mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
paasta
An open, distributed platform as a service
service_configuration_lib
Tron
Next generation batch process scheduling and management
yelp_kafka
An extension of the kafka-python package that adds features like multiprocess consumers.
88manpreet's Repositories
88manpreet/spark-redshift
Performant Redshift data source for Apache Spark
88manpreet/dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
88manpreet/jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
88manpreet/kafka-python
Python client for Apache Kafka