Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
spark-redshift
Redshift data source for Apache Spark
docker
Docker - the open-source application container engine
druid
Column oriented distributed data store ideal for powering interactive applications
go
The Open Source Data Science Masters
incubator-airflow
Apache Airflow (Incubating)
jackar.github.io
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
our-boxen
Copy me for your team.
rabbitmq-management
RabbitMQ Management UI and HTTP API
jackar's Repositories
jackar/docker
Docker - the open-source application container engine
jackar/druid
Column oriented distributed data store ideal for powering interactive applications
jackar/go
The Open Source Data Science Masters
jackar/incubator-airflow
Apache Airflow (Incubating)
jackar/jackar.github.io
jackar/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
jackar/our-boxen
Copy me for your team.
jackar/pipeline
Real-time, End-to-End, Advanced Analytics and Machine Learning Recommendation Pipeline
jackar/rabbitmq-management
RabbitMQ Management UI and HTTP API
jackar/spark
Mirror of Apache Spark
jackar/spark-csv
CSV data source for Spark SQL and DataFrames
jackar/spark-redshift
Spark and Redshift integration