Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
adhocnetlib
CS270 Project
Documents
spark
Mirror of Apache Spark
spark-perf
Performance tests for Spark
spark-streaming-benchmark
spark-streaming-external-projects
unitycatalog
Open, Multi-modal Catalog for Data & AI
tdas's Repositories
tdas/spark-streaming-external-projects
tdas/spark-streaming-benchmark
tdas/spark
Mirror of Apache Spark
tdas/spark-perf
Performance tests for Spark
tdas/connectors
Connectors for Delta Lake
tdas/AsciiDoc
AsciiDoc Package for Sublime Text2
tdas/cannycare
tdas/copybara
Copybara: A tool for transforming and moving code between repositories.
tdas/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
tdas/delta-connectors
tdas/hadoop
Apache Hadoop
tdas/incubator-spark
Mirror of Apache Spark
tdas/kafka
Mirror of Apache Kafka
tdas/lhbench
A benchmark comparison of lakehouse systems
tdas/logcollection
tdas/random-stuff
tdas/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
tdas/Settings
Configurations files, dot files
tdas/spark-ec2
Scripts used to setup a Spark cluster on EC2
tdas/spark-github-shim
A nicer UI for browsing Spark pull requests
tdas/spark-perf-old
Performance tests for Spark, Shark, etc.
tdas/spark-test
tdas/spark-test-failures
tdas/spark-tests
tdas/spark-utils
Various development utilities created when hacking on Spark
tdas/spark-website
Mirror of Apache Spark Website
tdas/StreamingTest
tdas/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
tdas/unitycatalog
Open, Multi-modal Catalog for Data & AI
tdas/website
Delta Lake Website