data-lakes
There are 10 repositories under data-lakes topic.
kroudir/Data-Engineer-Nanodegree-Projects-Udacity
Projects done in the Data Engineer Nanodegree Program by Udacity.com
indexed-xyz/docs
Documentation for Getting Up and Running w/ indexed.xyz Data
BinariesGoalls/Udacity-Data-Engineering-Nanodegree
This is a repository to hold the files and notebooks produced throughout my Udacity's Nanodegree Data Engineering program.
hsnr-data-science/SEDAR
A Semantic Data Reservoir for Heterogeneous Datasets
superctj/pylon
Codebase and data for our paper - Pylon: Semantic Table Union Search in Data Lakes.
Data-Transparency-Task-Force/architecture-planning
Discussion of DTF software architecture Repository
wbsg-uni-mannheim/MannheimSearchJoinsEngine
A Search Join is a join operation which extends a user-provided table with additional attributes based on a large corpus of heterogeneous data originating from the Web or corporate intranets.
qusay-elewy/data-lakes-with-spark
Udacity Data Engineering Nanodegree - Project #4
tara-nguyen/modern-data-architecture
Follow along with materials in the book "Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses and data lakes" (Lipp, 2023)
vdmitrii/de_nd
Data Engineering Nanodegree Program