epilif1017a
Director Data Engineering @ Adidas & Invited Professor @ University of Minho
Adidas | University of MinhoPortugal
Pinned Repositories
m3d-api
Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of data lakes.
m3d-engine
M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.
big-data-open-os
The definitive open source big data operating system.
bigdatabenchmarks
Code and Documents related to the SSB+ Benchmark
lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
m3d-api
m3d-engine
simple_secure_ansible_hdp_hadoop
Ansible Playbook to install a Kerberized Hortonworks Hadoop Cluster with some of the good practices from the documentation (e.g., ambari as non-root, dedicated mysql server, encrypted ambari database)
StarSchemaBenchmark
O'Neil et al.'s Star Schema Benchmark
epilif1017a's Repositories
epilif1017a/big-data-open-os
The definitive open source big data operating system.
epilif1017a/bigdatabenchmarks
Code and Documents related to the SSB+ Benchmark
epilif1017a/simple_secure_ansible_hdp_hadoop
Ansible Playbook to install a Kerberized Hortonworks Hadoop Cluster with some of the good practices from the documentation (e.g., ambari as non-root, dedicated mysql server, encrypted ambari database)
epilif1017a/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
epilif1017a/m3d-api
epilif1017a/m3d-engine
epilif1017a/StarSchemaBenchmark
O'Neil et al.'s Star Schema Benchmark