Pinned Repositories
alexa_skills
BayesianAstronomy
Bayesian Methods in Astronomy workshop, presented at AAS227
BayesMadeSimple
Code for a tutorial on Bayesian Statistics by Allen Downey.
Cleaning-Titanic-Data
One of the most popular starter data sets in data science, the Titanic data set. This is a data set that records various attributes of passengers on the Titanic, including who survived and who didn’t. Here I have detected some missing value, replace the missing values and also create new values added to the dataset. There are two csv files, first one is titanic_original.csv and second one is tatanic_clean.csv. Second csv is generated from the R code, called 'titanic.r' here. Have fun.
MLLytics
A library of tools for easier evaluation of ML models.
SparkLytics
A library of useful functions to help with ML analysis with PySpark and MLLib.
scottclay's Repositories
scottclay/alexa_skills
scottclay/BayesianAstronomy
Bayesian Methods in Astronomy workshop, presented at AAS227
scottclay/BayesMadeSimple
Code for a tutorial on Bayesian Statistics by Allen Downey.
scottclay/Cleaning-Titanic-Data
One of the most popular starter data sets in data science, the Titanic data set. This is a data set that records various attributes of passengers on the Titanic, including who survived and who didn’t. Here I have detected some missing value, replace the missing values and also create new values added to the dataset. There are two csv files, first one is titanic_original.csv and second one is tatanic_clean.csv. Second csv is generated from the R code, called 'titanic.r' here. Have fun.
scottclay/Lgalaxies_Analysis
Data analysis pipeline for running the detailed dust model of the L-Galaxies semi-analytic model
scottclay/Lgalaxies_Dust
Implementation of detailed dust modelling into the Henriques 2015 version of L-Galaxies
scottclay/MLLytics
A library of tools for easier evaluation of ML models.
scottclay/SparkLytics
A library of useful functions to help with ML analysis with PySpark and MLLib.
scottclay/Crash_Detection
scottclay/Credit-Card-Fraud-Detection-using-Autoencoders-in-Keras
iPython notebook and pre-trained model that shows how to build deep Autoencoder in Keras for Anomaly Detection in credit card transactions data
scottclay/dataworks-githooks
Repo for holding DataWorks Git hooks
scottclay/graphistry-notebooks
Custom Jupyter notebooks to integrate different data sources with the Graphistry API
scottclay/Lgalaxies_HDF5
scottclay/Machine-Learning-Malware-Detection
Attempt to use the machine learning workflow to process and transform sampled PE file data to create a prediction model.
scottclay/malware-prediction-rnn
RNN implementation with Keras for machine activity data to predict malware
scottclay/malware_api_class
Malware dataset for security researchers, data scientists. Public malware dataset generated by Cuckoo Sandbox based on Windows OS API calls analysis for cyber security researchers
scottclay/malwaredatascience
scottclay/misc_code
scottclay/python-docs-hello-world
A simple python application for docs
scottclay/retail_sales
scottclay/sans-indexes
Indexes for SANS Courses and GIAC Certifications
scottclay/Statistics
scottclay/terraform-terragrunt-terratest-example
Examples of using Terraform with Terragrunt and Terratest
scottclay/tf_gpu_docker
scottclay/virus-mnist
scottclay/Web_Crawler
scottclay/widgets-tutorial
A tutorial for widgets