/SparkSummit2019

Delta Lake Overview and Benefits

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

SparkSummit2019

Delta Lake Overview and Benefits

Delta Lake, an open-source innovations which brings new capabilities for transactions, version control and indexing your data lakes. We uncover how Delta Lake benefits and why it matters to you. Through this session, we showcase some of its benefits and how they can improve your modern data engineering pipelines. Delta lake provides snapshot isolation which helps concurrent read/write operations and enables efficient insert, update, deletes, and rollback capabilities. It allows background file optimization through compaction and z-order partitioning achieving better performance improvements. In this presentation, w we will learn the Delta Lake benefits and how it solves common data lake challenges, and most importantly new Delta Time Travel capability.