Pinned Repositories
cdata
Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.
data_algebra
Codd method-chained SQL generator and Pandas data processing in Python.
Examples
Various examples for different articles
PDSwR2
Code, Data, and Examples for Practical Data Science with R 2nd edition (Nina Zumel and John Mount) https://github.com/WinVector/PDSwR2
pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
rquery
Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.
vtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.
wrapr
Wrap R for Sweet R Code
WVPlots
Pre-packaged plots in R
zmPDSwR
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
Win Vector LLC's Repositories
WinVector/vtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.
WinVector/Examples
Various examples for different articles
WinVector/wrapr
Wrap R for Sweet R Code
WinVector/PDSwR2
Code, Data, and Examples for Practical Data Science with R 2nd edition (Nina Zumel and John Mount) https://github.com/WinVector/PDSwR2
WinVector/data_algebra
Codd method-chained SQL generator and Pandas data processing in Python.
WinVector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
WinVector/rquery
Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.
WinVector/WVPlots
Pre-packaged plots in R
WinVector/replyr
Patches for using dplyr with Databases and Big Data
WinVector/seplyr
Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks
WinVector/cdata
Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.
WinVector/rqdatatable
Implement the rquery piped query algebra in R using data.table. Distributed under choice of GPL-2 or GPL-3 license.
WinVector/Logistic
Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.
WinVector/addinexamplesWV
Ad-ins and keyboard shortcuts for building calculation pipelines in R
WinVector/sigr
Concise formatting of significances in R (GPL3 license).
WinVector/ExploreModels
Code and data for "The Geometry of Classifiers"
WinVector/WinVector.github.io
Viewable pages from WinVector LLC view at: http://winvector.github.io
WinVector/WVLPSolver
Experimental pure Java revised simplex linear program solver (Apache 2.0 license)
WinVector/Locality-Sensitive-Hashing-Example
Simple example of Locality Sensitive Hashing
WinVector/RcppDynProg
Dynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.
WinVector/wvpy
Tools to convert from Jupyter notebooks to and from Python .py files, and render.
WinVector/ExampleRPackage
Example of how to build a simple R package
WinVector/Importance-Sampling
Importance Sampling Example
WinVector/LStep
Trivial demonstration of a diverging Newton-Raphson step when solving a logistic regression
WinVector/OutOfCore
Example of out of core coding techniques
WinVector/ATasteOfDataScience
Working an example of supervised machine learning in Python
WinVector/ExperimentInspector
Java code to build synthetic data sets that match reported summary totals. Helps explore possible range of variation.
WinVector/SessionExample
Example code for articles on sessionizing data.
WinVector/wvu
Win Vector LLC Python data science teaching tools (graphs and data manipulation)
WinVector/TypicalityCoding
Simple example of how to use an embedding plus sphering/whitening transform to measure difference in distribution.