Pinned Repositories
data-readings
Reading List in Data Systems
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
incubator-iceberg
Apache Iceberg (Incubating)
mura
An in-memory SQL Query Compiler built on Apache Arrow
spark
Fork of Apache Spark
sujithjay.github.io
Personal Blog
sujithjay's Repositories
sujithjay/data-readings
Reading List in Data Systems
sujithjay/mura
An in-memory SQL Query Compiler built on Apache Arrow
sujithjay/sujithjay.github.io
Personal Blog
sujithjay/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
sujithjay/incubator-iceberg
Apache Iceberg (Incubating)
sujithjay/lettuceleaf
A Distributed Task Queue in Java
sujithjay/rabpubsub-subscriber
Subscriber Module of RabPubSub
sujithjay/spark
Fork of Apache Spark
sujithjay/weldj
Java Bindings for Weld
sujithjay/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
sujithjay/ballista
Distributed compute platform implemented in Rust, using Apache Arrow memory model.
sujithjay/benchmarks
Benchmarks on Code Snippets
sujithjay/breeze
Breeze is a numerical processing library for Scala.
sujithjay/colabs
https://colab.research.google.com/
sujithjay/datafusion
Apache DataFusion SQL Query Engine
sujithjay/datafusion-comet
Apache DataFusion Comet Spark Accelerator
sujithjay/llama_index
LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.
sujithjay/logseq
A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap
sujithjay/nigant
Thesaurus for the Terminal
sujithjay/rabpubsub-publisher
A simple Publisher wrapper for RabbitMQ
sujithjay/raft-rs
Raft distributed consensus algorithm implemented in Rust.
sujithjay/sujithjay
Me
sujithjay/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
sujithjay/weld
High-performance runtime for data analytics applications