shangeyao's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
gin-gonic/gin
Gin is a HTTP web framework written in Go (Golang). It features a Martini-like API with much better performance -- up to 40 times faster. If you need smashing performance, get yourself some Gin.
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
kubernetes/minikube
Run Kubernetes locally
apache/flink
Apache Flink
baomidou/mybatis-plus
An powerful enhanced toolkit of MyBatis for simplify development
tusen-ai/naive-ui
A Vue 3 Component Library. Fairly Complete. Theme Customizable. Uses TypeScript. Fast.
prestodb/presto
The official home of the Presto distributed SQL query engine for big data
apache/pulsar
Apache Pulsar - distributed pub-sub messaging system
theonedev/onedev
Git Server with CI/CD, Kanban, and Packages. Seamless integration. Unparalleled experience.
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
datahub-project/datahub
The Metadata Platform for your Data Stack
FasterXML/jackson
Main Portal page for the Jackson project
apache/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
antvis/X6
🚀 JavaScript diagramming library that uses SVG and HTML for rendering.
apache/flink-cdc
Flink CDC is a streaming data integration tool
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
ovh/cds
Enterprise-Grade Continuous Delivery & DevOps Automation Open Source Platform
apache/incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
dbeaver/cloudbeaver
Cloud Database Manager
apache/linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
DataLinkDC/dinky
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
WeBankFinTech/DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
reata/sqllineage
SQL Lineage Analysis Tool powered by Python
mymarilyn/clickhouse-driver
ClickHouse Python Driver with native interface support
linkedin/coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
cubefs/compass
Compass is a task diagnosis platform for bigdata
apache/dolphinscheduler-operator
Apache DolphinScheduler Kubernetes Operator.