imamitjain's Stars
apache/datafusion
Apache DataFusion SQL Query Engine
pingcap/talent-plan
open source training courses about distributed database and distributed systems
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
linkedin/kafka-monitor
Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived vital statistics - E2E latency, service produce/consume availability, offsets commit availability & latency, message loss rate and more.
ByteByteGoHq/system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
facebookincubator/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
lancedb/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
AssemblyAI-Community/ML-Study-Guide
Minimal Machine Learning Study Plan
pct960/Competitive_Programming
A handy collection of implemented data structures and algorithms for competitive coding contests
gkhayes/mlrose
Python package for implementing a number of Machine Learning, Randomized Optimization and SEarch algorithms.
linkedin/coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
yangshun/tech-interview-handbook
💯 Curated coding interview preparation materials for busy software engineers
tigerbeetle/tigerbeetle
The financial transactions database designed for mission critical safety and performance.
openobserve/openobserve
🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
bigcapitalhq/bigcapital
💵 Bigcapital is financial accounting with intelligent reporting for faster decision-making, an open-source alternative to Quickbooks, Xero, etc.
ArroyoSystems/arroyo
Distributed stream processing engine in Rust
madhuakula/kubernetes-goat
Kubernetes Goat is a "Vulnerable by Design" cluster environment to learn and practice Kubernetes security using an interactive hands-on playground 🚀
mbrooker/simulator_example
Small numerical simulator example
iamadamdev/bypass-paywalls-chrome
Bypass Paywalls web browser extension for Chrome and Firefox.
facebookexperimental/reverie
An ergonomic and safe syscall interception framework for Linux.
inkandswitch/peritext
A CRDT for asynchronous rich-text collaboration, where authors can work independently and then merge their changes.
darrenburns/shira
the python inspector 🔍
ray-project/ray_beam_runner
Ray-based Apache Beam runner
amzn/amazon-ray
Staging area for ongoing enhancements to Ray focused on improving integration with AWS and other Amazon technologies.
fullstorydev/grpcurl
Like cURL, but for gRPC: Command-line tool for interacting with gRPC servers
emichael/dslabs
Distributed Systems Labs and Framework
mitdbg/deneva
Deneva is a distributed in-memory database framework that supports the evaluation of various concurrency control algorithms.