We have learned the basics of dataframe calculation, aggregation, and summarization in the lesson. In this lab you will practice the functions covered in the lessons and learn more advanced ones by solving a series of challenges.
In this lab we also want you to focus on refining your problem-solving process in addition to completing the challenges. Data analysis is an iterative problem-solving process. You will need to break down a complex problem into a subset of less complex problems, then tackle each sub problems in a progressive order. You may need to further break down the sub problems into sub-sub problems and so on so forth depending on the complexity of those problems. You will keep breaking down the problems until you are able to solve each of them.
During the problem-solving process, you are required to constantly test your solutions and reflect on your goals and problem-solving strategies. You may be required to revise the problem-solving strategies and sometimes redefine the problem subsets based on your latest discoveries.
Keep in mind the general guidelines above when you conduct data analysis in this lab and in the future. You need to learn this scientific research methodology if you want to become a successful data analyst. For a detailed explanation of the iterative data analysis workflow, watch this YouTube video.
You are required to watch the above video before proceeding.
Launch main.ipynb
, challenge-1.ipynb
, challenge-2.ipynb
, and challenge-3.ipynb
in the your-code
directory of this lab. Take the exercises following the step-by-step instructions.
challenge-1.ipynb
,challenge-2.ipynb
andchallenge-3.ipynb
are mandatory challenges that you must submit and will help you acquire the basics of dataframe calculation and transformation.
Upon completion, add your deliverables to git. Then commit git and push your branch to the remote.