These data files correspond to the Foundations of Applied Mathematics lab curriculum.
Instead of downloading or cloning this repository directly, use the download_data.sh
script included in the Student-Materials repository.
$ cd /path/to/folder
$ bash download_data.sh
The download requires git, which you can download at https://git-scm.com/downloads.
Below the data files are listed by volume (in the order given by each table of contents), then alphabetically by lab folder.
Labs by Volume
Lab Title | Folder/Data File | Source |
---|---|---|
Regular Expressions | RegularExpressions/fake_contacts.txt | Generated by http://www.Generatedata.com/ |
SQL 1: Introduction | SQL1/student_info.csv | Written by hand |
SQL 1: Introduction | SQL1/student_grades.csv | Written by hand |
SQL 1: Introduction | SQL1/us_earthquakes.csv | |
SQL 2 (The Sequel) | SQL2/students.db | Combined from student_info.csv , student_grades.csv , and the other tables in SQL 1 |
Web Technologies | WebTechnologies/nyc_traffic.json | Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017 |
Introduction to Beautiful Soup | WebScraping1/example.html | https://www.example.com |
Introduction to Beautiful Soup | WebScraping1/san_diego_weather.html | |
Introduction to Beautiful Soup | WebScraping1/large_banks_index.html | |
Introduction to Beautiful Soup | WebScraping1/large_banks_data.html | |
Pandas 1: Introduction | Data Science Essentials | Pandas1/crime_data.csv |
Pandas 1: Introduction | Pandas1/final_accidents2.pickle | |
Pandas 1: Introduction | Pandas1/final_drivers.pickle | |
Pandas 2: Plotting | Pandas2/final_accidents2.pickle | |
Pandas 2: Plotting | Pandas2/final_drivers.pickle | |
Pandas 2: Plotting | Pandas2/new_york_crime_clean.csv | |
Pandas 3: Grouping | Pandas3/Ohio_1999.csv | |
Pandas 3: Grouping | Pandas3/time_usage.txt | |
Pandas 4: Time Series | Pandas4/DJIA.csv | |
Pandas 4: Time Series | Pandas4/finances.csv | |
Pandas 4: Time Series | Pandas4/paychecks.csv | |
Pandas 4: Time Series | Pandas4/website_traffic.csv | |
Pandas 5: GeoPandas | Pandas5/airports.csv | |
Data Augmentation | DataAugmentation/chopin.wav | |
Data Augmentation | DataAugmentation/creditcard.npy | https://www.kaggle.com/mlg-ulb/creditcardfraud |
Data Augmentation | DataAugmentation/lion.png | |
Data Augmentation | DataAugmentation/music.zip | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
Data Augmentation | DataAugmentation/restaurant-ambience.wav | |
Data Augmentation | DataAugmentation/styles.npy | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
Labs by Folder Name
Folder/Data File | Lab Title | Volume | Source |
---|---|---|---|
BinaryTrees/english.txt | Binary Search Trees | Volume 2 | Generated |
BreadthFirstSearch/movieData.txt | Breadt-first Search | Volume 2 | Scraped with imdbpy (https://imdbpy.sourceforge.io/) |
CVXOPT_Intro/ForestData.npy | CVXOPT | Volume 2 | |
Conditioning_Stability/stability_data.npy | Conditioning and Stability | Volume 1 | Generated |
DataAugmentation/chopin.wav | Data Augmentation | Data Science Essentials | |
DataAugmentation/creditcard.npy | Data Augmentation | Data Science Essentials | (https://www.kaggle.com/mlg-ulb/creditcardfraud) |
DataAugmentation/lion.png | Data Augmentation | Data Science Essentials | |
DataAugmentation/music.zip | Data Augmentation | Data Science Essentials | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
DataAugmentation/restaurant-ambience.wav | Data Augmentation | Data Science Essentials | |
DataAugmentation/styles.npy | Data Augmentation | Data Science Essentials | U. Marchand, G. Peeters, "The Extended Ballroom Dataset", in ISMIR 2016 Late-Breaking Session, New-York, USA. (http://anasynth.ircam.fr/home/media/ExtendedBallroom) |
DataVisualization/anscombe.npy | Data Visualization | Python Essentials | https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17–21. JSTOR 2682899 |
DataVisualization/MLB.npy | Data Visualization | Python Essentials | http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified) |
DataVisualization/earthquakes.npy | Data Visualization | Python Essentials | Gathered from https://earthquake.usgs.gov/earthquakes/search/ |
DataVisualization/countries.npy | Data Visualization | Python Essentials | Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population |
Differentiation/plane.npy | Differentiation | Volume 1 | Generated |
DrazinInverse/social_network.csv | The Drazin Inverse | Volume 1 | Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club |
Exceptions_FileIO/hello_world.txt | Exceptions and File Input/Output | Python Essentials | Written by hand |
Exceptions_FileIO/cf_example1.txt | Exceptions and File Input/Output | Python Essentials | Written by hand |
Exceptions_FileIO/cf_example2.txt | Exceptions and File Input/Output | Python Essentials | Written by hand |
FacialRecognition/faces94.zip | Facial Recognition | Volume 1 | http://cswww.essex.ac.uk/mv/allfaces/faces94.html |
FourierTransform/tada.wav | The Discrete Fourier Transform | Volume 2 | https://www.youtube.com/watch?v=bjxf-eQWKoo |
FourierTransform/mystery_chord.wav | The Discrete Fourier Transform | Volume 2 | Generated |
FourierTransform/CGC.wav | The Discrete Fourier Transform | Volume 2 | Generated |
FourierTransform/GCG.wav | The Discrete Fourier Transform | Volume 2 | Generated |
FourierTransform/balloon.wav | The Discrete Fourier Transform | Volume 2 | Recorded at BYU |
FourierTransform/chopin.wav | The Discrete Fourier Transform | Volume 2 | |
FourierTransform/noisy1.wav | The Discrete Fourier Transform | Volume 2 | |
FourierTransform/noisy2.wav | The Discrete Fourier Transform | Volume 2 | |
FourierTransform/vuvuzela.wav | The Discrete Fourier Transform | Volume 2 | Part of https://www.youtube.com/watch?v=g_0NoBKWCT8 |
FourierTransform/noisy_face.png | The Discrete Fourier Transform | Volume 2 | Sample from faces94.zip |
FourierTransform/license_plate.png | The Discrete Fourier Transform | Volume 2 | |
GradientMethods/linregression.txt | Gradient Descent Methods | Volume 2 | |
GradientMethods/challenger.npy | Gradient Descent Methods | Volume 2 | |
ImageSegmentation/dream.png | Image Segmentation | Volume 1 | |
ImageSegmentation/dream_gray.png | Image Segmentation | Volume 1 | |
InteriorPoint_Linear/simdata.txt | Interior Point 1: Linear Programs | Volume 2 | |
InteriorPoint_Quadratic/portfolio.txt | Interior Point 2: Quadratic Programs | Volume 2 | |
LeastSquares_Eigenvalues/circle.npy | Least Squares and Computing Eigenvalues | Volume 1 | Generated |
LeastSquares_Eigenvalues/ellipse.npy | Least Squares and Computing Eigenvalues | Volume 1 | Generated |
LeastSquares_Eigenvalues/housing.npy | Least Squares and Computing Eigenvalues | Volume 1 | Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx |
LinearTransformations/horse.npy | Linear Transformations | Volume 1 | Generated |
LinkedLists/english.txt | Linked Lists | Volume 2 | Generated |
MarkovChains/yoda.txt | Markov Chains | Volume 2 | Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html |
MatplotlibIntro/FARS.npy | Introduction to Matplotlib | Python Essentials | Gathered from https://www.nhtsa.gov/FARS |
NearestNeighbor/mnist_subset.npz | Nearest Neighbor Search | Volume 2 | Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/ |
NumpyIntro/grid.npy | Introduction to NumPy | Python Essentials | https://projecteuler.net/problem=11 |
PageRank/web_stanford.txt | The PageRank Algorithm | Volume 1 | Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html |
PageRank/ncaa2010.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2011.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2012.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2013.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2014.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2015.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2016.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/ncaa2017.csv | The PageRank Algorithm | Volume 1 | Scraped from https://www.sports-reference.com |
PageRank/top250movies.txt | The PageRank Algorithm | Volume 1 | Subset of movie_data.txt , scraped with imdbpy (https://imdbpy.sourceforge.io/) |
Pandas1/crime_data.csv | Pandas 1: Introduction | Data Science Essentials | |
Pandas1/final_accidents2.pickle | Pandas 1: Introduction | Data Science Essentials | |
Pandas1/final_drivers.pickle | Pandas 1: Introduction | Data Science Essentials | |
Pandas2/final_accidents2.pickle | Pandas 2: Plotting | Data Science Essentials | |
Pandas2/final_drivers.pickle | Pandas 2: Plotting | Data Science Essentials | |
Pandas2/new_york_crime_clean.csv | Pandas 2: Plotting | Data Science Essentials | |
Pandas3/Ohio_1999.csv | Pandas 3: Grouping | Data Science Essentials | |
Pandas3/time_usage.txt | Pandas 3: Grouping | Data Science Essentials | |
Pandas4/DJIA.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas4/finances.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas4/paychecks.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas4/website_traffic.csv | Pandas 4: Time Series | Data Science Essentials | |
Pandas5/airports.csv | Pandas 5: GeoPandas | Data Science Essentials | |
PolynomialInterpolation/airdata.npy | Polynomial Interpolation | Volume 2 | |
Profiling/names.txt | Profiling | Python Essentials | https://projecteuler.net/problem=22 |
Profiling/triangle.txt | Profiling | Python Essentials | https://projecteuler.net/problem=18 |
Profiling/triangle_large.txt | Profiling | Python Essentials | https://projecteuler.net/problem=67 |
QuasiNewtonMethods/population.npy | Newton and Quasi-Newton Methods | Volume 2 | |
RegularExpressions/fake_contacts.txt | Regular Expressions | Data Science Essentials | Generated by http://www.Generatedata.com/ |
Simplex/productMix.npy | Simplex | Volume 2 | |
SQL1/student_info.csv | SQL 1: Introduction | Data Science Essentials | Written by hand |
SQL1/student_grades.csv | SQL 1: Introduction | Data Science Essentials | Written by hand |
SQL1/us_earthquakes.csv | SQL 1: Introduction | Data Science Essentials | |
SQL2/students.db | SQL 2 (The Sequel) | Data Science Essentials | Combined from student_info.csv , student_grades.csv , and the other tables in SQL 1 |
SVD_ImageCompression/hubble.jpg | The SVD and Image Compression | Volume 1 | https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html |
SVD_ImageCompression/hubble_gray.jpg | The SVD and Image Compression | Volume 1 | Modification of hubble.jpg |
Wavelets/mandrill.png | Introduction to Wavelets | Volume 2 | http://sipi.usc.edu/database/ |
Wavelets/woman_darkhair.png | Introduction to Wavelets | Volume 2 | |
Wavelets/noisy_darkhair.png | Introduction to Wavelets | Volume 2 | |
Wavelets/uncompressed_finger.png | Introduction to Wavelets | Volume 2 | |
WebScraping1/example.html | Introduction to Beautiful Soup | Data Science Essentials | https://www.example.com |
WebScraping1/san_diego_weather.html | Introduction to Beautiful Soup | Data Science Essentials | |
WebScraping1/large_banks_index.html | Introduction to Beautiful Soup | Data Science Essentials | |
WebScraping1/large_banks_data.html | Introduction to Beautiful Soup | Data Science Essentials | |
WebTechnologies/nyc_traffic.json | Web Technologies | Data Science Essentials | Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017 |