ytsaurus: A C++ repository from koct9i

YTsaurus

YTsaurus is a distributed storage and processing platform for big data with support for MapReduce model, a distributed file system and a NoSQL key-value database.

You can read post about YTsaurus or check video:

Advantages of the platform

Multitenant ecosystem

A set of interrelated subsystems: MapReduce, an SQL query engine, a job schedule, and a key-value store for OLTP workloads.
Support for large numbers of users that eliminates multiple installations and streamlines hardware usage

Reliability and stability

No single point of failure
Automated replication between servers
Updates with no loss of computing progress

Scalability

Up to 1 million CPU cores and thousands of GPUs
Exabytes of data on different media: HDD, SSD, NVME, RAM
Tens of thousands of nodes
Automated server up and down-scaling

Rich functionality

Expansive MapReduce module
Distributed ACID transactions
A variety of SDKs and APIs
Secure isolation for compute resources and storage
User-friendly and easy-to-use UI

CHYT powered by ClickHouse®

A well-known SQL dialect and familiar functionality
Fast analytic queries
Integration with popular BI solutions via JDBC and ODBC

SPYT powered by Apache Spark

A set of popular tools for writing ETL processes
Launch and support for multiple mini SPYT clusters
Easy migration for ready-made solutions

Getting Started

Try YTsaurus cluster using Kubernetes or try our online demo.

How to Build from Source Code

Build from source code.

How to Contribute

We are glad to welcome new contributors!

Please read the contributor's guide and the styleguide.
We can accept your work to YTsaurus after you have signed contributor's license agreement (aka CLA).
Please don't forget to add a note to your pull request, that you agree to the terms of the CLA.

koct9i/ytsaurus