/python_data_science_course_san_diago_2019

Free Mooc studying data science from university of san diego, now onto the 2019 version

Primary LanguageJupyter Notebook

Into to python for data science

Python intro to data science course from university of san diago

Course location:- https://courses.edx.org/courses/course-v1:UCSanDiegoX+DSE200x+1T2019a/courseware/fe01d78b51774259ada67d772eee5f1c/415f9db79b3b4564930c300789bfbba9/?child=first

This reposisotry covers my own learning and notes throuhgout the course

Outline of course taken from official site:-

Course Outline

The course is broken into 10 weeks. The beginning of the course is heavily focused on learning the basic tools of data science, but we firmly believe that you learn the most about data science by doing data science. So the latter half of the course is a combination of working on large projects and introductions to advanced data analysis techniques.

Week 1 - Introduction: Welcome and overview of the course. Introduction to the data science process and the value of learning data science. Week 2 - Background: In this optional week, we provide a brief background in python or unix to get you up and running. If you are already familiar with python and/or unix, feel free to skip this content. Week 3 - Jupyter and Numpy: Jupyter notebooks are one of the most commonly used tools in data science as they allow you to combine your research notes with the code for the analysis. After getting started in Jupyter, we'll learn how to use numpy for data analysis. numpy offers many useful functions for processing data as well as data structures which are time and space efficient. Week 4 - Pandas: Pandas, built on top of numpy, adds data frames which offer critical data analysis functionality and features. Week 5 - Visualization: When working with large datasets, you often need to visualize your data to gain a better understanding of it. Also, when you reach conclusions about the data, you'll often wish to use visualizations to present your results. Week 6 - Mini Project: With the tools of Jupyter notebooks, numpy, pandas, and Visualization, you're ready to do sophisticated analysis on your own. You'll pick a dataset we've worked with already and perform an analysis for this first project. Week 7 - Machine Learning: To take your data analysis skills one step further, we'll introduce you to the basics of machine learning and how to use sci-kit learn - a powerful library for machine learning. Week 8 - Working with Text and Databases: You'll find yourself often working with text data or data from databases. This week will give you the skills to access that data. For text data, we'll also give you a preview of how to analyze text data using ideas from the field of Natural Language Processing and how to apply those ideas using the Natural Language Processing Toolkit (NLTK) library. Week 9 and 10 - Final Project: These weeks let you showcase all your new skills in an end-to-end data analysis project. You'll pick the dataset, do the data munging, ask the research questions, visualize the data, draw conclusions, and present your results.