Pinned Repositories
hudi
Upserts, Deletes And Incremental Processing on Big Data.
hudi
Spark Library for Hadoop Upserts And Incrementals
hudi_shopping_cart_demo
kv-perf-eval-tool
Performance testing framework for key-value datastores
rocksdb-jna
JNA (Not JNI) Wrapper around rocksdb
spark
Mirror of Apache Spark
voldemort
An open source clone of Amazon's Dynamo.
vinothchandar's Repositories
vinothchandar/hudi
Spark Library for Hadoop Upserts And Incrementals
vinothchandar/hudi_shopping_cart_demo
vinothchandar/8l-netlify-site
vinothchandar/aresdb
A GPU-powered real-time analytics storage and query engine.
vinothchandar/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
vinothchandar/async-profiler
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
vinothchandar/awesome-anki-vector
Anki Vector AI++
vinothchandar/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
vinothchandar/FlameGraph
Stack trace visualizer
vinothchandar/gobblin
vinothchandar/hudipoc
vinothchandar/hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
vinothchandar/incubator-hudi-site
Apache hudi
vinothchandar/kafka
Mirror of Apache Kafka
vinothchandar/kafka-site
Mirror of Apache Kafka site
vinothchandar/kafka-streams-examples
Demo applications and code examples for Apache Kafka's Streams API.
vinothchandar/ksql
KSQL - the Streaming SQL Engine for Apache Kafka
vinothchandar/marmaray
Marmaray
vinothchandar/og-aws
📙 Amazon Web Services — a practical guide
vinothchandar/openmessaging-benchmark
OpenMessaging Benchmark Framework
vinothchandar/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
vinothchandar/presto
Distributed SQL query engine for running interactive analytic queries against big data sources.
vinothchandar/pulsar-spark
When Apache Pulsar meets Apache Spark
vinothchandar/querybook
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
vinothchandar/spark-sql-perf
vinothchandar/starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
vinothchandar/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
vinothchandar/vector-python-sdk
Anki Vector Python SDK
vinothchandar/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
vinothchandar/wrk
Modern HTTP benchmarking tool