Pinned Repositories
actordb
ActorDB distributed SQL database
aerospike-server
Aerospike Database Server – flash-optimized, in-memory, nosql database
agatedb
A persistent key-value storage in rust.
agensgraph
AgensGraph, a transactional graph database based on PostgreSQL
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
airflow
Apache Airflow
akhq
Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
MeiliSearch
Lightning Fast, Ultra Relevant, and Typo-Tolerant Search Engine
spark-doc-zh
Apache Spark 官方文档中文版
sqllineage
SQL Lineage Analysis Tool powered by Python
awesomeDataTool's Repositories
awesomeDataTool/arangodb
🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
awesomeDataTool/arvados
An open source platform for managing and analyzing biomedical big data
awesomeDataTool/awesome-data-catalogs
📙 Awesome Data Catalogs and Observability Platforms.
awesomeDataTool/bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
awesomeDataTool/blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
awesomeDataTool/dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
awesomeDataTool/dremio-oss
Dremio - the missing link in modern data
awesomeDataTool/duckdb
DuckDB is an in-process SQL OLAP Database Management System
awesomeDataTool/EventHub
An open source event analytics platform
awesomeDataTool/FastCFS
A high performance distributed file system which can be used as the back-end storage of databases, K8s and VM etc.
awesomeDataTool/Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
awesomeDataTool/galaxyengine
GalaxyEngine is a MySQL branch originated from Alibaba Group, especially supports large-scale distributed database system.
awesomeDataTool/galaxysql
PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.
awesomeDataTool/hera
hera 分布式任务调度系统 大数据任务调度系统 任务调度 (数据部门专用)
awesomeDataTool/incubator-streampark
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
awesomeDataTool/KnowStreaming
一站式云原生流数据管控平台,通过0侵入、插件化构建企业级Kafka服务,极大降低操作、存储和管理实时流数据门槛
awesomeDataTool/manticoresearch
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
awesomeDataTool/matrixone
Hyperconverged cloud-edge native database
awesomeDataTool/nitrite-java
Java embedded nosql document store
awesomeDataTool/pebble
RocksDB/LevelDB inspired key-value database in Go
awesomeDataTool/pgmodeler
Open-source data modeling tool designed for PostgreSQL. No more typing DDL commands. Let pgModeler do the work for you!
awesomeDataTool/PolarDB-FileSystem
awesomeDataTool/surrealdb
A scalable, distributed, collaborative, document-graph database, for the realtime web
awesomeDataTool/terarkdb
A RocksDB compatible KV storage engine with better performance
awesomeDataTool/usql
Universal command-line interface for SQL databases
awesomeDataTool/v6d
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF)
awesomeDataTool/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
awesomeDataTool/vega
A new arguably faster implementation of Apache Spark from scratch in Rust
awesomeDataTool/xenon
The MySQL Cluster Autopilot Management with GTID and Raft
awesomeDataTool/xtdb
General purpose bitemporal database for SQL, Datalog & graph queries. Developed by @juxt