Pinned Repositories
hudi
Upserts, Deletes And Incremental Processing on Big Data.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
tosfs
Pythonic file-system interface for TOS(Tinder Object Storage)https://tosfs.readthedocs.io/en/latest/
lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
awesome-flink
😎 A curated list of amazingly awesome Flink and Flink ecosystem resources
banyan
a message bus implemented with RabbitMQ
dingdang-robot
叮当是一款可以工作在 Raspberry Pi 上的中文语音对话机器人/智能音箱项目。
FixedAssetManagerServer
it's a node server
flink
Mirror of Apache Flink
flume-customized
customized some flume component.
yanghua's Repositories
yanghua/flink
Mirror of Apache Flink
yanghua/SparkInsight
Spark auto performance tuning and failure analysis tool
yanghua/kylin
Apache Kylin
yanghua/datacollector
StreamSets Data Collector - Continuous big data and cloud platform ingest infrastructure
yanghua/davinci
Davinci is a DVsaaS (Data Visualization as a Service) Platform
yanghua/DBus
DBus
yanghua/edgex-go
EdgeX Golang Services Monorepo
yanghua/edgex-ui-go
yanghua/edgex-ui-go-holding
yanghua/flink-jdbc-driver
yanghua/flink-sql-gateway
yanghua/glossary
Open Glossary of Edge Computing
yanghua/griffin
Mirror of Apache griffin
yanghua/incubator-kyuubi-website
Apache Kyuubi Site
yanghua/incubator-yunikorn-core
Apache YuniKorn Core
yanghua/incubator-yunikorn-k8shim
Apache YuniKorn K8shim
yanghua/incubator-yunikorn-site
Apache Yunikorn website - see the master branch for instructions
yanghua/incubator-yunikorn-web
Apache YuniKorn Web UI - Incubating
yanghua/kyuubi
yanghua/Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
yanghua/moonbox
Moonbox is a DVtaaS (Data Virtualization as a Service) Platform
yanghua/plynx
PLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.
yanghua/Qualitis
Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis
yanghua/Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
yanghua/spider
A configurable web spider with a easy-to-use web console
yanghua/streamingpro
Build big Data processing and Machine Learning platform with MLSQL
yanghua/webmagic
A scalable web crawler framework for Java.
yanghua/wormhole
Wormhole is a SPaaS (Stream Processing as a Service) Platform
yanghua/xskipper
An Extensible Data Skipping Framework
yanghua/xxl-job
A lightweight distributed task scheduling framework.(分布式任务调度平台XXL-JOB)