Pinned Repositories
airflow
AirFlow is a system to programmatically author, schedule and monitor data pipelines.
autokeras_tabular
Autokeras Tabular extension
erhv-lovelace
Homeassistant Lovelace Energy Reclaim Home Ventilation card
incubator-superset
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
minicluster
Miniclusters wraps some of the hadoop-minicluster functionality in classes that set some defaults so they can be used in unit testing beyond Java, eg. python projects.
pam-jwt
PAM module than can authenticate users with json web token (JWT)
ranger-old
rangers3plugin
rdpgw
Remote Desktop Gateway in Go for deploying on Linux/BSD/Kubernetes
s3gw
S3 proxy that applies Apache Ranger policies and provides bucket notifications
bolkedebruin's Repositories
bolkedebruin/snakebite
A pure python HDFS client
bolkedebruin/X11RDP-RH-Matic
Install helper tool for xrdp/x11rdp
bolkedebruin/ambari
Mirror of Apache Ambari
bolkedebruin/flask-kerberos
Kerberos Authentication for Flask
bolkedebruin/jdk7u-jdk
bolkedebruin/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
bolkedebruin/minikdc
MiniKdc for use in tests, made standalone from Hadoop's implementation
bolkedebruin/rapidmask
Data masking tool that takes data from a JDBC ResultSet or a cdv and transforms this with a hashing mechanism
bolkedebruin/reair
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
bolkedebruin/spark
Mirror of Apache Spark
bolkedebruin/spark-hive014-compat
This replaces some classes in Spark to make it able to connect to secured clusters that use Hive >= 0.14