This repository provides an easy deployment to set up an environment for demo'ing Amazon DataZone.
The instructions will deploy the architecture depicted in this diagram
You can use this architecture to learn and demonstrate publishing, subscribing workflows and other aspects of DataZone.
The sample data used for this demo is fake data that represents a few data sets that may be used by a Pharmaceutical company during drug development research and clinical trials.
- Launch the CloudFormation stack
- Run the following from the terminal of the Cloud9 environment that was deployed by the CloudFormation stack
pip install boto3
python DataZone_Demo/1_lakeformation_s3_configuration.py
- Deploy a DataZone domain from the AWS console
- Navigate to the DataZone home page and click on Create domain
- Provide a name for the domain
- Select the check mark next to the Set-up this account for data consumption and publishing under the Quick setup section
- Click on Create domain
- Update the
datazone_domain_id
variable in 2_dataZone_configuration.py file and 3_dataZone_configuration.py
To find the domain id of the DataZone domain you just deployed look at the URL for the DataZone portal
For example if the URL is https//dzd_498d049z6o1gkn.datazone.us-east-1.on.aws the domain id is dzd_498d049z6o1gkn
Once you update the variables with the domain id save the file
- Run the following from the terminal of the Cloud9 environment
python DataZone_Demo/2_dataZone_configuration.py
python DataZone_Demo/3_dataZone_configuration.py