This is the implementation of a Minimum Data Lake, inspired by the work published in: https://github.com/kkiaune/emails-classification A complete description in Spanish can be found at: https://abxda.wordpress.com/2020/05/05/analizando-el-big-data-de-las-noticias-con-tu-micro-data-lake-baterias-incluidas/