clayms's Stars
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
logseq/logseq
A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
xflr6/graphviz
Simple Python interface for Graphviz
blaze-init/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
lrlna/smol-zines
sketching out concepts one 📝 at a time
cdfmlr/pyflowchart
Python codes to Flowcharts
intel-analytics/BigDL-Tutorials
Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL
neo4j-product-examples/graph-machine-learning-examples
Neo4j Graph Data Science with Graph ML & GNNs
xtream1101/s3-concat
Concat multiple files in s3
rchiodo/vscode-python
Python extension for Visual Studio Code