/clojure-dataframe_comparison

A comparison of different libraries for dataframe manipulation in Clojure

Primary LanguageClojure

Clojure libraries for data science

A comparison of different libraries for dataframe manipulation and big data analytics in Clojure

List of libraries

Library Description
tech.ml.dataset For data processing and machine learning
Geni Dataframe library that runs on Apache Spark
Onyx High performance distributed computation system

Tutorials for each library

tech.ml.dataset

Geni

Onyx

Performance benchmarks

  • Rankings for each library in Python / Julia / Clojure

Useful tech stack

Data format framework

Streaming framework