Pinned Repositories
pmdarima
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
clust4j
A suite of classification clustering algorithm implementations for Java. A number of partitional, hierarchical and density-based algorithms including DBSCAN, k-Means, k-Medoids, MeanShift, Affinity Propagation, HDBSCAN and more.
reclab
A practical library for recommenders that wraps the Implicit package, providing various CV & model selection tools
skoot
A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.
skutil
NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learn and h2o extension classes (as well as caret classes for python). See more here: https://tgsmith61591.github.io/skutil
smrt
Handle class imbalance intelligently by using variational auto-encoders to generate synthetic observations of your minority class.
TwitterFeedToJson
Simple Python script for writing a user's Twitter feed to a JSON file (using Tweepy) for analysis
tgsmith61591's Repositories
tgsmith61591/TwitterFeedToJson
Simple Python script for writing a user's Twitter feed to a JSON file (using Tweepy) for analysis
tgsmith61591/DataTableUtils
A class "DataTableWritable" which will render a formatted C# DataTable in the console
tgsmith61591/Apriori
Python Implementation of Apriori Algorithm for finding Frequent sets and Association Rules
tgsmith61591/DeepQA
My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot
tgsmith61591/h2o-3
Open Source Fast Scalable Machine Learning API For Smarter Applications (Deep Learning, Gradient Boosting, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA...)
tgsmith61591/h2o_demo
tgsmith61591/ICML2016
Notes, demos and lessons from ICML 2016.
tgsmith61591/silhouetteR
A function to compute a Euclidean silhouette score to assess clustering efficacy.
tgsmith61591/statsmodels
Statsmodels: statistical modeling and econometrics in Python
tgsmith61591/zathura
A combinatorial game