Ace the DP-203 exam with advanced data engineering skills
This is the code repository for Azure Data Engineer Associate Certification Guide (DP-203), Second ed. published by Packt. It contains all the supporting project files necessary to work through the course from start to finish.
As cloud adoption surges, data engineering expertise is in high demand. Companies increasingly rely on cloud-based data solutions, creating a surge in data engineering jobs. This competitive landscape pushes both aspiring and experienced data engineers to showcase their skills. If you are a data engineer, a data architect, a cloud architect, a solution architect, or a DataOps professional, who is new to Azure or interviewing with companies working on Azure technologies, his book will help you get a hands-on experience with Azure data technologies.
In this book, Azure Data Engineer Associate Certification Guide (DP-203), Second ed., you will begin by exploring the basics of Azure, diving deeper into the details of storage, compute, security, monitoring, high availability, etc., and finally, apply your knowledge via practice questions and answers; all in all a complete step-by-step explanation of essential concepts, practical examples, and self-assessment questions.
This book covers the following exciting features:
- Gain intermediate-level knowledge of Azure the data infrastructure.
- Design and implement data lake solutions with batch and stream pipelines.
- Identify the partition strategies available in Azure storage technologies.
- Implement different table geometries in Azure Synapse Analytics
- Use the transformations available in T-SQL, Spark, and Azure Data Factory
- Use Azure Databricks or Synapse Spark to process data using Notebooks.
- Design security using RBAC, ACL, encryption, data masking, and more.
- Monitor and optimize data pipelines with debugging tips.
If you feel this book is for you, get your copy today!
Giacinto Palmieri has been working in the IT sector for more than 35 years (initially in his native Italy and then in London, where he moved 23 years ago) as a trainer, software developer, data engineer, and consultant. In the past three years, he has been focusing mostly on his activity as a Microsoft Certified Trainer with particular focus on Azure data services, Azure development, and the Power Platform. Outside of IT, he holds an MA in Philosophy and a PhD in Translation Studies and sometimes performs as a stand-up comedian, even bringing several shows to the Edinburgh Fringe Festival (a fact he tends to hide from his IT course participants, lest they expect a laugh per minute experience).
Surendra Mettapalli is a Principal Data Engineer/Scientist with extensive experience leading Data teams in the UK and India. He specializes in designing and implementing innovative data solutions in large and complex environments. His team's expertise spans Microsoft Fabric, Azure Data Factory, Azure Synapse Analytics, Databricks, and Power BI. With a deep understanding of Data Engineering, Cloud Architecture, and AI-driven applications, he has successfully collaborated with diverse clients across technology, finance, retail, and government sectors. His contributions have been instrumental in delivering some of the largest and most impactful data projects and empowering organizations to leverage Azure's full potential and optimize their business operations. Surendra holds various certifications from Microsoft Azure in data engineering, data science, and AI streams, as well as certifications from Databricks. His notable contribution to "Optimizing COVID-19 Interventions with Evolutionary AI" has gained positive recognition within the industry. Beyond his professional endeavors, he is passionate about sharing his knowledge and experience with the community. He regularly contributes to industry events and forums, guiding others through the complexities of the data landscape.
Newton Alex leads several Azure Data Analytics teams in Microsoft, India. His team contributes to technologies including Azure Synapse, Azure Databricks, Azure HDInsight, and many open source technologies, including Apache YARN, Apache Spark, and Apache Hive. He started using Hadoop while at Yahoo, USA, where he helped build the first batch processing pipelines for Yahoo’s ad serving team. After Yahoo, he became the leader of the big data team at Pivotal Inc., USA, where he was responsible for the entire open source stack of Pivotal Inc. He later moved to Microsoft and started the Azure Data team in India. He has worked with several Fortune 500 companies to help build their data systems on Azure.
If you've found this book useful, you might want to check out some of our other titles: