/dtsa-5504

Programming coursework for DTSA 5504, Data Mining Pipeline

Primary LanguagePython

DTSA-5504, The Data Mining Pipeline

University of Colorado, Boulder Master of Science in Data Science program

Programming coursework for DTSA 5504, Data Mining Pipeline

Week 1, Understanding Data

Problem 1 - summary_stats.py

Week 3, Data Preprocessing

Problem 1A - normalization.py

  • Write a function that takes in three arguments--a file name, attribute name, and normalization method--and returns a dictionary where the key is the original data vale and the value is the normalized data value.
  • The function should perform min-max or z-score normalization