akshayrai's Stars
linkedin/brooklin
An extensible distributed system for reliable nearline data streaming at scale
linkedin/transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
prakhar1989/awesome-courses
:books: List of awesome university courses for learning Computer Science!
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
binhnguyennus/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
eugenp/tutorials
Getting Started with Spring Boot 3:
alibaba-archive/aliyun-oss-hadoop-fs
Hadoop filesystem implementation for Aliyun OSS
apache/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
etsy/statsd-jvm-profiler
Simple JVM Profiler Using StatsD and Other Metrics Backends
riemann/riemann-jvm-profiler
Sends stacktrace-level performance data from a JVM process to Riemann.
logicalclocks/dr-elephant-chef
Chef cookbook to install Dr Elephant for Hadoop.
codingtony/dr-elephant-docker
Docker files for Linkedin's Dr. Elephant https://github.com/linkedin/dr-elephant
linkedin/dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
apache/oozie
Mirror of Apache Oozie
linkedin/linkedin-gradle-plugin-for-apache-hadoop