Pinned Repositories
AIT524-DatabaseManagement
Homework Assignments for Database Management Class
association_rule_mining
Simple Association Rule Mining with Spark and Hadoop of the MovieLens Dataset
Coursera-ML
Machine Learning Course by Stanford University taught by Andrew Ng. The course taught Neural Networks, Support Vector Machines, Unsupervised Learning, Anomaly Detection, Large Scale ML, and photo OCRing.
cs657_mining_massive_datasets
Homework assignments for CS657 mining massive datasets. Assignments are in Spark and Hadoop using the Python API. Assignments include wordcount stuff, association rule mining, linear regression, and recommender systems.
CS688
datasharing
The Leek group guide to data sharing
Drug_Activity_Prediction
NOAA_NDBC_weather_pull
Scripts to pull data from buoys
TimeSeries_Motif_Classification
TimeSeriesClustering_DTW
HW1 for CS 687 - Time series clustering using dynamic time warping (DTW)
sarmstr5's Repositories
sarmstr5/TimeSeriesClustering_DTW
HW1 for CS 687 - Time series clustering using dynamic time warping (DTW)
sarmstr5/Drug_Activity_Prediction
sarmstr5/Coursera-ML
Machine Learning Course by Stanford University taught by Andrew Ng. The course taught Neural Networks, Support Vector Machines, Unsupervised Learning, Anomaly Detection, Large Scale ML, and photo OCRing.
sarmstr5/NOAA_NDBC_weather_pull
Scripts to pull data from buoys
sarmstr5/TimeSeries_Motif_Classification
sarmstr5/AIT524-DatabaseManagement
Homework Assignments for Database Management Class
sarmstr5/association_rule_mining
Simple Association Rule Mining with Spark and Hadoop of the MovieLens Dataset
sarmstr5/cs657_mining_massive_datasets
Homework assignments for CS657 mining massive datasets. Assignments are in Spark and Hadoop using the Python API. Assignments include wordcount stuff, association rule mining, linear regression, and recommender systems.
sarmstr5/CS688
sarmstr5/datasharing
The Leek group guide to data sharing
sarmstr5/dotfiles
My GNU/Linux Config Files and Bash Scripts
sarmstr5/INFS519-DataStructures
sarmstr5/kaggle_intel_mobleODT_cervix_classification
Code written for the Intel & MobileODT Cervix Classification Competition. Some code is specific to run on the Colfax Cluster.
sarmstr5/KNN_with_amazonreviews
sarmstr5/linear_regression_spark
Run Linear Regression using Spark and Python on the Kaggle New York City Taxi Trip Dataset
sarmstr5/misc_scripts
Misc scripts for installing software that I use in Ubuntu. Mainly for keeping track of packages I like for Ubuntu
sarmstr5/Movie_Recommender_System
sarmstr5/ndbc
A Python interface to National Data Buoy Center
sarmstr5/osint_social
A collection of several hundred online tools for OSINT
sarmstr5/populartimes
sarmstr5/STAT515
sarmstr5/Stock_predictions
sarmstr5/Text_Clustering
sarmstr5/tmux
sarmstr5/topic_modeling
sarmstr5/wordcount
hadoop wordcount