Data science in practice is a collection of materials for learning introductory data science.
This repository is the source repository for creating the Data Science in Practice website.
Specifically, the website contains:
tutorials
, which introduce key topics for doing data scienceassignments
, which are problem sets that can be worked throughprojects
, which describes how to pursue independent analysis projects
These materials serve as a public version of materials from the COGS108 class.
This repository has the tools for building the website. To do so, it copies materials from the COGS 108 organization, and then updates and organizes them for the public website. See the instructions page for notes on how this works.
The built version of book is then posted to the website repository for hosting.
This project uses the Python programming language, and requires Python >= 3.6.
Materials are written and available as Jupyter Notebooks.
Tutorials & assignments require packages from the scientific Python ecosystem. These dependencies can all be installed using the Anaconda distribution. Details and instructions on the dependencies and how to get them are available in the materials.
The website is created using JupyterBook.
This repository contains the following sections:
dsip/
contains the content of the website, including sub-sections:docs/
contains the source for written sections of the sitetutorials/
contains tutorial notebooks which introduce key topics for doing data scienceassignments/
contains assignment notebooks with problem sets that can be worked throughprojects/
contains information on how to pursue independent analysis projects
paper/
contains a copy of the paper that describes these materials
This project is described in the following paper:
Donoghue T, Voytek B, & Ellis S (2022). Course Materials for Data Science in
Practice. Journal of Open Source Education, 5(51), 121. DOI: 10.21105/jose.00121
Direct Link: https://doi.org/10.21105/jose.00121
These materials are made freely available, and are licensed under a CC-BY 4.0 license.