Pinned Repositories
AdcCrawlImage
ADC 图片爬取脚本
bitsail
BitSail is a distributed, high-performance data integration engine and provides global data integration solutions in batch, streaming, and incremental scenarios. At present, BitSail has been widely used and synchronizes hundreds of trillions data every day.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
docker-notes
docker 笔记
docs.zh-cn
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
doris-manager
Cluster manager for Apache Doris
doris-website
Apache Doris Website
flink
Apache Flink
wanghuan2054's Repositories
wanghuan2054/AdcCrawlImage
ADC 图片爬取脚本
wanghuan2054/bitsail
BitSail is a distributed, high-performance data integration engine and provides global data integration solutions in batch, streaming, and incremental scenarios. At present, BitSail has been widely used and synchronizes hundreds of trillions data every day.
wanghuan2054/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
wanghuan2054/dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
wanghuan2054/docker-notes
docker 笔记
wanghuan2054/docs.zh-cn
wanghuan2054/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
wanghuan2054/doris-manager
Cluster manager for Apache Doris
wanghuan2054/doris-website
Apache Doris Website
wanghuan2054/flink
Apache Flink
wanghuan2054/flink-cdc-connectors
CDC Connectors for Apache Flink®
wanghuan2054/geektime-downloader
极客时间课程下载器,支持下载极客时间专栏/视频课/每日一课/大厂实践/训练营视频
wanghuan2054/gluten
wanghuan2054/honey
Bee is an AI, easy and high efficiency ORM framework. Honey is the implementation of the Bee.
wanghuan2054/hudi
Upserts, Deletes And Incremental Processing on Big Data.
wanghuan2054/ImageProcess
adc(图片自动缺陷检测)相关处理脚本
wanghuan2054/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
wanghuan2054/incubator-seatunnel-website
Apache SeaTunnel documents
wanghuan2054/mysql
wanghuan2054/mysql-server
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.
wanghuan2054/netty-practice
netty practice
wanghuan2054/oracle
wanghuan2054/PLC-OPC
PLC-OPC
wanghuan2054/spark
Apache Spark - A unified analytics engine for large-scale data processing
wanghuan2054/starrocks
StarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
wanghuan2054/streamx
Make stream processing easier! Flink & Spark development scaffold, The original intention of StreamX is to make the development of Flink easier. StreamX focuses on the management of development phases and tasks. Our ultimate goal is to build a one-stop big data solution integrating stream processing, batch processing, data warehouse and data laker.
wanghuan2054/streamx-website
StreamX Official Website
wanghuan2054/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)