export GRADLE_HOME=/{path to your gradle}/gradle
export JAVA_HOME=/{path to your java}/java-8-oracle
export PATH=$PATH:$GRADLE_HOME/bin:$JAVA_HOME/bin
Simple code
Basic idea get a bunch of data and label it, ie add a binary classification {0,1}
make some statistical assumptions D(x,y) -> N(mu, theta) -> N(0,1)
create a model (train)
if your data is not numerical convert it to a numerical representation (hash) fit some other data and based on your model, and using some associated statistical test based on your assumptions make a decision about the data point
Spark makes this easy for you with their Data Pipelines see this in com.logistic.BinClassRunner
Next up Clustering