Pinned Repositories
iceberg
Apache Iceberg
algo-class
Exercises and assignments from http://algo-class.org
clj-fu
This repo contains my learnings and experiences with clojure.
clojure-hadoop
Library to aid writing Hadoop jobs in Clojure.
clojure-orbit
Uncle Bob's Simple Orbital Simulator written in Clojure.
compojure-example
An example Compojure project
cs212
Exercises and assignments from Udacity 212.
every-programmer-should-know
A collection of (mostly) technical things every software developer should know
hdfs-checksum
Utility to calculate hadoop checksum for local file, plain old Vanilla checksum for hdfs file, and block-level-checksums for hdfs file
webhdfs-clj
A lightweight Clojure client for Hadoop WebHDFS Rest API
rdsr's Repositories
rdsr/clj-fu
This repo contains my learnings and experiences with clojure.
rdsr/airflow
Apache Airflow
rdsr/awesome-data-engineering
A curated list of data engineering tools for software developers
rdsr/Awesome-System-for-Machine-Learning
A curated list of research in machine learning systems (MLSys). Paper notes are also provided.
rdsr/clojure
The Clojure programming language
rdsr/conch
A flexible library for shelling out in Clojure
rdsr/copy-trader
An open-source framework to facilitate copy-trading
rdsr/cs-video-courses
List of Computer Science courses with video lectures.
rdsr/db_tutorial
Writing a sqlite clone from scratch in C
rdsr/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
rdsr/emacs
:scream: A dark theme for Emacs
rdsr/flambo
A Clojure DSL for Apache Spark
rdsr/free-programming-books
:books: Freely available programming books
rdsr/fs
File system utilities for Clojure.
rdsr/geni
A Clojure dataframe library that runs on Spark
rdsr/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
rdsr/gradle-clojure
Clojure and Clojurescript support for Gradle
rdsr/iceberg
Iceberg is a table format for large, slow-moving tabular data
rdsr/incubator-iceberg
Apache Iceberg
rdsr/li-iceberg-rdsr
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
rdsr/linkedin-orc
LinkedIn's version of Apache ORC
rdsr/peloton
The Self-Driving Database Management System
rdsr/riemann
A network event stream processing system, in Clojure.
rdsr/single-cluster-setup
Automated setup of a HDFS, YARN and Metastore in single cluster mode with a single command
rdsr/smile
Statistical Machine Intelligence & Learning Engine
rdsr/spark-1
Netflix branches of Apache Spark
rdsr/sparkling
A Clojure library for Apache Spark: fast, fully-features, and developer friendly
rdsr/t2
A better UDF API for Spark SQL
rdsr/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
rdsr/ud120-projects
Starter project code for students taking Udacity ud120