/quickstart-datalake-47lining

AWS Quick Start Team

Primary LanguagePythonApache License 2.0Apache-2.0

quickstart-datalake-47lining

Data Lake Foundation on the AWS Cloud

This Quick Start deploys a data lake foundation that integrates Amazon Web Services (AWS) services such as Amazon Simple Storage Service (Amazon S3), Amazon Redshift, Amazon Kinesis, Amazon Athena, Amazon Elasticsearch Service (Amazon ES), and Amazon QuickSight.

The data lake foundation uses these AWS services to provide data submission, ingest processing, dataset management, data transformation, aggregation, and analysis, search, publishing, and visualization capabilities. Once this foundation is in place, you may choose to augment the data lake with ISV and SaaS tools.

The deployment also includes an optional wizard and a sample dataset that is loaded into Amazon Redshift and Kinesis streams to demonstrate data lake capabilities.

The AWS CloudFormation templates included with the Quick Start automate the following:

  • Deploying the data lake foundation into a new virtual private cloud (VPC)
  • Deploying the data lake foundation into an existing VPC in your AWS account

You can also use the AWS CloudFormation templates as a starting point for your own implementation.

Quick Start architecture for data lake foundation on AWS

For architectural details, best practices, step-by-step instructions, and customization options, see the deployment guide.

To post feedback, submit feature ideas, or report bugs, use the Issues section of this GitHub repo. If you'd like to submit code for this Quick Start, please review the AWS Quick Start Contributor's Kit.