Pinned Repositories
gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
15618-finalProj
azkaban
Azkaban workflow manager.
configuration_comparison
Performance Tuning of SGD-MF for Spark
coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
LoggingRepo
Spark experiment Logging repo
spark-1.6.1
Experiment Purpose
coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
openhouse
Open Control Plane for Tables in Data Lakehouse
autumnust's Repositories
autumnust/LoggingRepo
Spark experiment Logging repo
autumnust/spark-1.6.1
Experiment Purpose
autumnust/15618-finalProj
autumnust/azkaban
Azkaban workflow manager.
autumnust/configuration_comparison
Performance Tuning of SGD-MF for Spark
autumnust/coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
autumnust/course-info
GitHub Repo for http://db.csail.mit.edu/6.830/
autumnust/dagster
A Python library for building data applications: ETL, ML, Data Pipelines, and more.
autumnust/douban-client
Python client library for Douban APIs (OAuth 2.0)
autumnust/gedit-plugins
autumnust/gobblin
Universal data ingestion framework for Hadoop.
autumnust/iceberg
Apache Iceberg
autumnust/iceberg-1
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
autumnust/iceberg-python
Apache PyIceberg
autumnust/incubator-pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
autumnust/internProjectPoster
Intern project poster and component materials
autumnust/mapreduce-lite
A C++ implementaton of MapReduce without distributed filesystem
autumnust/muduo
A C++ non-blocking network library for multi-threaded server in Linux
autumnust/My-Reading-List
autumnust/openhouse
[Self Working Space] Open Control Plane for Tables in Data Lakehouse
autumnust/orc
Mirror of Apache Orc
autumnust/pinot-bot
Pinot bot
autumnust/playwithmemory
autumnust/scientific-python-lectures
Lectures on scientific computing with python, as IPython notebooks.
autumnust/show-me-the-code
Python 练习册,每天一个小程序
autumnust/spark-core
autumnust/system-design-interview
System design interview for IT company
autumnust/transport
Transportable UDFs
autumnust/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
autumnust/xv6-chinese
中文版的 MIT xv6 文档