Not sure if they are awesome, but listing them anyway
Name | Language | Classification | Comment |
---|---|---|---|
Intel® Scalable Dataframe Compiler | Python | Big | claims to "orders of magnitute faster than alternatives like Apache Spark |
Spark | Scala (Main), Python, R, Julia (weak) | Big | De facto industry standard. Basically killed Hadoop |
Dask | Python | Medium/Big | |
disk.frame | R | Medium | soft deprecated |
Husky | C++, Scala (weaker), Python (weaker) | Medium?/Big | |
JuliaDB.jl | Julia | Medium/Big | Can't get it to work for me on the Fannie Mae data |
DataFusion | Rust | Big | Apache Arrow DataFusion and Ballista query engines |
ballista | Rust | Big | Spark but in Rust |
vega | Rust | Big | Another Spark killer in Rust |
vaex | Python | Medium/Big | |
tuplex | Python | Medium/Big | Compiles a subset of Python to machine code if possible. |
nebula | Medium/Big? | seems to be Javascript based | |
arrow | Medium | Has a Dataset API in some implementations e.g. R |
Name | Notes |
---|---|
ROOT |
Quora: What are some credible Apache Spark killers out there? What are their chances of success?