/Data

Data for the Foundations of Applied Mathematics lab curriculum.

Primary LanguageHTML

These data files correspond to the Foundations of Applied Mathematics lab curriculum. Instead of downloading or cloning this repository directly, use the download_data.sh script included in the Student-Materials repository.

$ cd /path/to/folder
$ bash download_data.sh

The download requires git, which you can download at https://git-scm.com/downloads.

Below the data files are listed by volume (in the order given by each table of contents), then alphabetically by lab folder.

Labs by Volume

Python Essentials

Lab Title Folder/Data File Source
Introduction to NumPy NumpyIntro/grid.npy https://projecteuler.net/problem=11
Introduction to Matplotlib MatplotlibIntro/FARS.npy Gathered from https://www.nhtsa.gov/FARS
Exceptions and File Input/Output Exceptions_FileIO/hello_world.txt Written by hand
Exceptions and File Input/Output Exceptions_FileIO/cf_example1.txt Written by hand
Exceptions and File Input/Output Exceptions_FileIO/cf_example2.txt Written by hand
Profiling Profiling/names.txt https://projecteuler.net/problem=22
Profiling Profiling/triangle.txt https://projecteuler.net/problem=18
Profiling Profiling/triangle_large.txt https://projecteuler.net/problem=67
Data Visualization DataVisualization/anscombe.npy https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17–21. JSTOR 2682899
Data Visualization DataVisualization/MLB.npy http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified)
Data Visualization DataVisualization/earthquakes.npy Gathered from https://earthquake.usgs.gov/earthquakes/search/
Data Visualization DataVisualization/countries.npy Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population

Data Science Essentials

Lab Title Folder/Data File Source
Regular Expressions RegularExpressions/fake_contacts.txt Generated by http://www.Generatedata.com/
SQL 1: Introduction SQL1/student_info.csv Written by hand
SQL 1: Introduction SQL1/student_grades.csv Written by hand
SQL 1: Introduction SQL1/us_earthquakes.csv
SQL 2 (The Sequel) SQL2/students.db Combined from student_info.csv, student_grades.csv, and the other tables in SQL 1
Web Technologies WebTechnologies/nyc_traffic.json Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017
Introduction to Beautiful Soup WebScraping1/example.html https://www.example.com
Introduction to Beautiful Soup WebScraping1/san_diego_weather.html
Introduction to Beautiful Soup WebScraping1/large_banks_index.html
Introduction to Beautiful Soup WebScraping1/large_banks_data.html
Pandas 1: Introduction Data Science Essentials Pandas1/crime_data.csv
Pandas 1: Introduction Pandas1/final_accidents2.pickle
Pandas 1: Introduction Pandas1/final_drivers.pickle
Pandas 2: Plotting Pandas2/final_accidents2.pickle
Pandas 2: Plotting Pandas2/final_drivers.pickle
Pandas 2: Plotting Pandas2/new_york_crime_clean.csv
Pandas 3: Grouping Pandas3/Ohio_1999.csv
Pandas 3: Grouping Pandas3/time_usage.txt
Pandas 4: Time Series Pandas4/DJIA.csv
Pandas 4: Time Series Pandas4/finances.csv
Pandas 4: Time Series Pandas4/paychecks.csv
Pandas 4: Time Series Pandas4/website_traffic.csv
Pandas 5: GeoPandas Pandas5/airports.csv
Data Augmentation DataAugmentation/chopin.wav
Data Augmentation DataAugmentation/creditcard.npy https://www.kaggle.com/mlg-ulb/creditcardfraud
Data Augmentation DataAugmentation/lion.png
Data Augmentation DataAugmentation/music.zip U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom)
Data Augmentation DataAugmentation/restaurant-ambience.wav
Data Augmentation DataAugmentation/styles.npy U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom)

Volume 1

Lab Title Folder/Data File Source
Linear Transformations LinearTransformations/horse.npy Generated
Least Squares and Computing Eigenvalues LeastSquares_Eigenvalues/circle.npy Generated
Least Squares and Computing Eigenvalues LeastSquares_Eigenvalues/ellipse.npy Generated
Least Squares and Computing Eigenvalues LeastSquares_Eigenvalues/housing.npy Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx
Image Segmentation ImageSegmentation/dream.png
Image Segmentation ImageSegmentation/dream_gray.png
The SVD and Image Compression SVD_ImageCompression/hubble.jpg https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html
The SVD and Image Compression SVD_ImageCompression/hubble_gray.jpg Modification of hubble.jpg
Facial Recognition FacialRecognition/faces94.zip http://cswww.essex.ac.uk/mv/allfaces/faces94.html
Differentiation Differentiation/plane.npy Generated
Conditioning and Stability Conditioning_Stability/stability_data.npy Generated
The PageRank Algorithm PageRank/web_stanford.txt Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html
The PageRank Algorithm PageRank/ncaa2010.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2011.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2012.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2013.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2014.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2015.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2016.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2017.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/top250movies.txt Subset of movie_data.txt, scraped with imdbpy (https://imdbpy.sourceforge.io/)
The Drazin Inverse DrazinInverse/social_network.csv Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club

Volume 2

Lab Title Folder/Data File Source
Linked Lists LinkedLists/english.txt Generated
Binary Search Trees BinaryTrees/english.txt Generated
Nearest Neighbor Search NearestNeighbor/mnist_subset.npz Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/
Breadth-first Search BreadthFirstSearch/movie_data.txt Scraped with imdbpy (https://imdbpy.sourceforge.io/)
Breadth-first Search BreadthFirstSearch/movie_data_small.txt Subset of movie_data.txt.
Markov Chains MarkovChains/yoda.txt Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html
The Discrete Fourier Transform FourierTransform/tada.wav https://www.youtube.com/watch?v=bjxf-eQWKoo
The Discrete Fourier Transform FourierTransform/mystery_chord.wav Generated
The Discrete Fourier Transform FourierTransform/CGC.wav Generated
The Discrete Fourier Transform FourierTransform/GCG.wav Generated
The Discrete Fourier Transform FourierTransform/balloon.wav Recorded at BYU
The Discrete Fourier Transform FourierTransform/chopin.wav
The Discrete Fourier Transform FourierTransform/noisy1.wav
The Discrete Fourier Transform FourierTransform/noisy2.wav
The Discrete Fourier Transform FourierTransform/vuvuzela.wav Part of https://www.youtube.com/watch?v=g_0NoBKWCT8
The Discrete Fourier Transform FourierTransform/noisy_face.png Sample from faces94.zip
The Discrete Fourier Transform FourierTransform/license_plate.png
Introduction to Wavelets Wavelets/mandrill.png http://sipi.usc.edu/database/
Introduction to Wavelets Wavelets/woman_darkhair.png
Introduction to Wavelets Wavelets/noisy_darkhair.png
Introduction to Wavelets Wavelets/uncompressed_finger.png
Polynomial Interpolation PolynomialInterpolation/airdata.npy
Gradient Descent Methods GradientMethods/linregression.txt
Gradient Descent Methods GradientMethods/challenger.npy
Simplex Simplex/productMix.npy
CVXOPT CVXOPT_Intro/ForestData.npy
Interior Point 1: Linear Programs InteriorPoint_Linear/simdata.txt
Interior Point 2: Quadratic Programs InteriorPoint_Quadratic/portfolio.txt

Labs by Folder Name

Folder/Data File Lab Title Volume Source
BinaryTrees/english.txt Binary Search Trees Volume 2 Generated
BreadthFirstSearch/movieData.txt Breadt-first Search Volume 2 Scraped with imdbpy (https://imdbpy.sourceforge.io/)
CVXOPT_Intro/ForestData.npy CVXOPT Volume 2
Conditioning_Stability/stability_data.npy Conditioning and Stability Volume 1 Generated
DataAugmentation/chopin.wav Data Augmentation Data Science Essentials
DataAugmentation/creditcard.npy Data Augmentation Data Science Essentials (https://www.kaggle.com/mlg-ulb/creditcardfraud)
DataAugmentation/lion.png Data Augmentation Data Science Essentials
DataAugmentation/music.zip Data Augmentation Data Science Essentials U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom)
DataAugmentation/restaurant-ambience.wav Data Augmentation Data Science Essentials
DataAugmentation/styles.npy Data Augmentation Data Science Essentials U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom)
DataVisualization/anscombe.npy Data Visualization Python Essentials https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17–21. JSTOR 2682899
DataVisualization/MLB.npy Data Visualization Python Essentials http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified)
DataVisualization/earthquakes.npy Data Visualization Python Essentials Gathered from https://earthquake.usgs.gov/earthquakes/search/
DataVisualization/countries.npy Data Visualization Python Essentials Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population
Differentiation/plane.npy Differentiation Volume 1 Generated
DrazinInverse/social_network.csv The Drazin Inverse Volume 1 Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club
Exceptions_FileIO/hello_world.txt Exceptions and File Input/Output Python Essentials Written by hand
Exceptions_FileIO/cf_example1.txt Exceptions and File Input/Output Python Essentials Written by hand
Exceptions_FileIO/cf_example2.txt Exceptions and File Input/Output Python Essentials Written by hand
FacialRecognition/faces94.zip Facial Recognition Volume 1 http://cswww.essex.ac.uk/mv/allfaces/faces94.html
FourierTransform/tada.wav The Discrete Fourier Transform Volume 2 https://www.youtube.com/watch?v=bjxf-eQWKoo
FourierTransform/mystery_chord.wav The Discrete Fourier Transform Volume 2 Generated
FourierTransform/CGC.wav The Discrete Fourier Transform Volume 2 Generated
FourierTransform/GCG.wav The Discrete Fourier Transform Volume 2 Generated
FourierTransform/balloon.wav The Discrete Fourier Transform Volume 2 Recorded at BYU
FourierTransform/chopin.wav The Discrete Fourier Transform Volume 2
FourierTransform/noisy1.wav The Discrete Fourier Transform Volume 2
FourierTransform/noisy2.wav The Discrete Fourier Transform Volume 2
FourierTransform/vuvuzela.wav The Discrete Fourier Transform Volume 2 Part of https://www.youtube.com/watch?v=g_0NoBKWCT8
FourierTransform/noisy_face.png The Discrete Fourier Transform Volume 2 Sample from faces94.zip
FourierTransform/license_plate.png The Discrete Fourier Transform Volume 2
GradientMethods/linregression.txt Gradient Descent Methods Volume 2
GradientMethods/challenger.npy Gradient Descent Methods Volume 2
ImageSegmentation/dream.png Image Segmentation Volume 1
ImageSegmentation/dream_gray.png Image Segmentation Volume 1
InteriorPoint_Linear/simdata.txt Interior Point 1: Linear Programs Volume 2
InteriorPoint_Quadratic/portfolio.txt Interior Point 2: Quadratic Programs Volume 2
LeastSquares_Eigenvalues/circle.npy Least Squares and Computing Eigenvalues Volume 1 Generated
LeastSquares_Eigenvalues/ellipse.npy Least Squares and Computing Eigenvalues Volume 1 Generated
LeastSquares_Eigenvalues/housing.npy Least Squares and Computing Eigenvalues Volume 1 Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx
LinearTransformations/horse.npy Linear Transformations Volume 1 Generated
LinkedLists/english.txt Linked Lists Volume 2 Generated
MarkovChains/yoda.txt Markov Chains Volume 2 Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html
MatplotlibIntro/FARS.npy Introduction to Matplotlib Python Essentials Gathered from https://www.nhtsa.gov/FARS
NearestNeighbor/mnist_subset.npz Nearest Neighbor Search Volume 2 Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/
NumpyIntro/grid.npy Introduction to NumPy Python Essentials https://projecteuler.net/problem=11
PageRank/web_stanford.txt The PageRank Algorithm Volume 1 Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html
PageRank/ncaa2010.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2011.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2012.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2013.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2014.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2015.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2016.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2017.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/top250movies.txt The PageRank Algorithm Volume 1 Subset of movie_data.txt, scraped with imdbpy (https://imdbpy.sourceforge.io/)
Pandas1/crime_data.csv Pandas 1: Introduction Data Science Essentials
Pandas1/final_accidents2.pickle Pandas 1: Introduction Data Science Essentials
Pandas1/final_drivers.pickle Pandas 1: Introduction Data Science Essentials
Pandas2/final_accidents2.pickle Pandas 2: Plotting Data Science Essentials
Pandas2/final_drivers.pickle Pandas 2: Plotting Data Science Essentials
Pandas2/new_york_crime_clean.csv Pandas 2: Plotting Data Science Essentials
Pandas3/Ohio_1999.csv Pandas 3: Grouping Data Science Essentials
Pandas3/time_usage.txt Pandas 3: Grouping Data Science Essentials
Pandas4/DJIA.csv Pandas 4: Time Series Data Science Essentials
Pandas4/finances.csv Pandas 4: Time Series Data Science Essentials
Pandas4/paychecks.csv Pandas 4: Time Series Data Science Essentials
Pandas4/website_traffic.csv Pandas 4: Time Series Data Science Essentials
Pandas5/airports.csv Pandas 5: GeoPandas Data Science Essentials
PolynomialInterpolation/airdata.npy Polynomial Interpolation Volume 2
Profiling/names.txt Profiling Python Essentials https://projecteuler.net/problem=22
Profiling/triangle.txt Profiling Python Essentials https://projecteuler.net/problem=18
Profiling/triangle_large.txt Profiling Python Essentials https://projecteuler.net/problem=67
QuasiNewtonMethods/population.npy Newton and Quasi-Newton Methods Volume 2
RegularExpressions/fake_contacts.txt Regular Expressions Data Science Essentials Generated by http://www.Generatedata.com/
Simplex/productMix.npy Simplex Volume 2
SQL1/student_info.csv SQL 1: Introduction Data Science Essentials Written by hand
SQL1/student_grades.csv SQL 1: Introduction Data Science Essentials Written by hand
SQL1/us_earthquakes.csv SQL 1: Introduction Data Science Essentials
SQL2/students.db SQL 2 (The Sequel) Data Science Essentials Combined from student_info.csv, student_grades.csv, and the other tables in SQL 1
SVD_ImageCompression/hubble.jpg The SVD and Image Compression Volume 1 https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html
SVD_ImageCompression/hubble_gray.jpg The SVD and Image Compression Volume 1 Modification of hubble.jpg
Wavelets/mandrill.png Introduction to Wavelets Volume 2 http://sipi.usc.edu/database/
Wavelets/woman_darkhair.png Introduction to Wavelets Volume 2
Wavelets/noisy_darkhair.png Introduction to Wavelets Volume 2
Wavelets/uncompressed_finger.png Introduction to Wavelets Volume 2
WebScraping1/example.html Introduction to Beautiful Soup Data Science Essentials https://www.example.com
WebScraping1/san_diego_weather.html Introduction to Beautiful Soup Data Science Essentials
WebScraping1/large_banks_index.html Introduction to Beautiful Soup Data Science Essentials
WebScraping1/large_banks_data.html Introduction to Beautiful Soup Data Science Essentials
WebTechnologies/nyc_traffic.json Web Technologies Data Science Essentials Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017