Pinned Repositories
lakehouse
Lakehouse: A comprehensive Lakehouse implementation with Apache Spark, Jupyter Notebook, MLflow and Apache Zeppelin. It aims to simplify the transition for data professionals experienced with Python and pandas to Spark and cloud solutions. Developed alongside informative Medium articles.
big-mac-data
Data and methodology for the Big Mac index
ISLR-python
An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code
spark-workers
This repository contains the configuration for additional Spark Workers for the Personal Lakehouse. The additional Spark Worker instances have been set up on different machines to enhance the computational power of the Spark cluster.
thorify.github.io
thorify's Repositories
thorify/lakehouse
Lakehouse: A comprehensive Lakehouse implementation with Apache Spark, Jupyter Notebook, MLflow and Apache Zeppelin. It aims to simplify the transition for data professionals experienced with Python and pandas to Spark and cloud solutions. Developed alongside informative Medium articles.
thorify/spark-workers
This repository contains the configuration for additional Spark Workers for the Personal Lakehouse. The additional Spark Worker instances have been set up on different machines to enhance the computational power of the Spark cluster.
thorify/thorify.github.io
thorify/big-mac-data
Data and methodology for the Big Mac index
thorify/ISLR-python
An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code