Introduction

Introduction

This short lesson summarizes the topics we'll be covering in section 01 and why they'll be important to you as a Data Scientist.

Objectives

You will be able to:

  • Understand and explain what is covered in this section
  • Understand and explain why the section will help you to become a data scientist

Setting the Stage

We start this section by looking at the kinds of problems that data science can solve and by giving a sense of the end to end process of "doing data science". We then get you set up with a professional data science environment with Git, GitHub, the command line, and the Anaconda Python distribution so you can start using industry standard tools from day 1.

We also run you through the workflow that we'll be using for the projects so you can get comfortable with how to access and update content and get some practice using the industry standard tools we just set you up with! And then to bring it all together, we ask you to "code along" with us to complete your first (very simple) data science project.

If you're new to Python (or programming), you may be a little overwhelmed at this point. Don't worry. We'll spend all of the rest of the course practicing and deepening your understanding of the materials we introduced in the first part of section 01.

Coding in Python

Next, we introduce some of the most important elements of programming in Python. We start from the basics, but move fairly quickly. We cover variables, common data types, conditionals, lists and dictionaries. This may not all seem immediately relevant to data science, but it's designed to give you the tools that you'll need for all of your future projects in Python.

Visualize that data

Finally, we take a little bit of time to introduce you to Matplotlib - one of the most popular tools in Python for generating visualizations.

You will learn to plot some basic visualizations for simple datasets. These will include plotting bar graphs, histograms and scatter plots and adding axes labeling and title information to the plot.

End of Section Project

We finish the section with a challenging project that brings together all of the skills introduced in section 1, by doing a basic analysis of Shakespeare's classic play: Macbeth.

Summary

Remember, it's OK to feel a little uncomfortable. For some students, section 01 will be the most difficult as it introduces so many new concepts at once, but you'll continue to practice these day after day, until they become second nature. For those of you with a strong background in computing, please bear with us as we help everyone else to catch up. We've tried to introduce additional challenges for students who are finding the labs easy, so look out for the "Level Up (optional)" section in some of the labs to get some deeper practice with Python and visualization.