alexland
scala, spark, akka dev designing & building distributed systems--data pipelines, machine learning systems, search
Palo Alto, CA
Pinned Repositories
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
coroutine-based-etl
a lightweight modular framework for data processing using python coroutines
d3-data-adapter
convert graph data from common formats (e.g., adjacency matrix) to the idiosyncratic JSON format used by d3.js
decisionTree-npgv
recursive descent parser based on C4.5, implemented using python + numpy (computation engine) and graphviz (tree layout + rendering)
foldLeft-in-scala
many examples of the foldLeft method in scala
fp-growth-for-frequent-itemsets
compact encoding & very fast retrieval of frequent item sets with FP-Growth
kPCA
several spectral-based matrix decomposition techniques, coded in Python/NumPy, for segmentation/clustering, dimension reduction
similarity-metrics
functions to determine pair-wise "similarity" of two vectors by calculating distance in various metric spaces
alexland's Repositories
alexland/fp-growth-for-frequent-itemsets
compact encoding & very fast retrieval of frequent item sets with FP-Growth
alexland/linear-discriminant-analysis-in-numpy
LDA in numpy (python) with demo code
alexland/akka-stream-graph
minimal app built using aka-streams 1.0 to demonstrate/learn the essential features of the library
alexland/bayesian-bandit-simulator
the computational machinery plus a visualization layer to configure and run simulations of bayesian multi-armed bandit systems
alexland/scala-spark-pipelines
app for distributed ETL & computation pipelines on the scala spark API
alexland/foldLeft-in-scala
many examples of the foldLeft method in scala
alexland/aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
alexland/akka-concurrency-test
Study on what Threads, ThreadPools, Executors, ExecutorServices, ExecutionContext, Futures are and how to configure them
alexland/columnar-hdfs
hdfs serialized both columnar and row-wise for efficient access by a spark app
alexland/compiled-c-extensions-in-python-libraries
template python packages including cython code, & c code for building python packages that wrap compiled c code
alexland/etl-spark
extract transform load (ETL) pipeline scala library using apache spark
alexland/Fashion-product-attribute-Identification-using-tensorflow
alexland/formalfridayclub
alexland/geographic-maps-in-js
complete web app for interactive, hi-res geographic maps in the browser
alexland/homebrew-geospatial
formulae related to geospatial for the Homebrew package manager http://brew.sh
alexland/homebrew-osgeo4mac
Mac homebrew tap for maintaining a stable work environment for the OSGeo.org geospatial toolset
alexland/interactive-js-plot-templates
interactive plots in javascript that render SVG using d3.js
alexland/matrix-toolkits-java
:rocket: High Performance Linear Algebra OOP
alexland/matrixLib
scala library for creating multi-threaded n-dimensional arrays and using them in element-wise & linear-algebra computation
alexland/recruiting-flow-simulation-engine
a simulator that models the recruiting process and generates data that records this flow--for test fixtures, etc.
alexland/rethinkdb-scala-driver
scala client driver for the rethinkdb database http://www.rethinkdb.com
alexland/sbt-templates
template for building a multi-module (aka multi-project) scala project as an uber jar using sbt
alexland/scala-app-builder
examples of various ways to build & deploy scala apps--with emphasis on the relationship among the VCS, the build tool, and the configuration manager/containerizer
alexland/scalacaster
Purely Functional Algorithms and Data Structures in Scala
alexland/shapeless-for-mortals
:sparkles: Workshop material for the Scala eXchange 2015 talk "Shapeless for Mortals"
alexland/spark-app-builder
utility library to streamline building apps on the Apache Spark api and to automate submitting them to spark cluster
alexland/stackoverflow-datawarehouse-analytics-engine
for the stackoverflow data dump, extracting, transforming, loading the data into a postgresql db, then writing analytics queries against this db
alexland/TestRepo
for our git workshop I
alexland/tiger-geocoder
alexland/ts-ops-scala
manipulation of arrays indexed by date/datetime type (aka "time series")--create, roll up, extract range, etc--in scala