Pinned Repositories
chunjun
A data integration framework
Taier
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
canal
阿里巴巴 MySQL binlog 增量订阅&消费组件
chunjun
Based on Apache Flink. Support data synchronization/integration.
flink
Apache Flink
iceberg
Apache Iceberg
spark
Apache Spark - A unified analytics engine for large-scale data processing
Taier
大数据平台-分布式任务调度系统
tools
平时工作中积累的相关样例,希望能够带来启发
FlechazoW's Repositories
FlechazoW/tools
平时工作中积累的相关样例,希望能够带来启发
FlechazoW/flink
Apache Flink
FlechazoW/Taier
大数据平台-分布式任务调度系统
FlechazoW/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
FlechazoW/canal
阿里巴巴 MySQL binlog 增量订阅&消费组件
FlechazoW/chunjun
Based on Apache Flink. Support data synchronization/integration.
FlechazoW/datavines
DataVines makes it easier to know your data
FlechazoW/iceberg
Apache Iceberg
FlechazoW/dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
FlechazoW/FlechazoW
FlechazoW/flechazow.github.io
FlechazoW's blog
FlechazoW/flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
FlechazoW/flink-connectors
Apache Flink connector repository
FlechazoW/flinkStreamSQL
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
FlechazoW/guava
Google core libraries for Java
FlechazoW/hadoop
Apache Hadoop
FlechazoW/hudi
Upserts, Deletes And Incremental Processing on Big Data.
FlechazoW/incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
FlechazoW/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
FlechazoW/incubator-streampark
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
FlechazoW/linux_kernel_wiki
linux内核学习资料:200+经典内核文章,100+内核论文,50+内核项目,500+内核面试题,80+内核视频
FlechazoW/NotionNext
使用 NextJS + Notion API 实现的,支持多种部署方案的静态博客,无需服务器、零门槛搭建网站,为Notion和所有创作者设计。 (A static blog built with NextJS and Notion API, supporting multiple deployment options. No server required, zero threshold to set up a website. Designed for Notion and all creators.)
FlechazoW/patterns-of-distributed-systems
《Patterns of Distributed Systems》中文版
FlechazoW/pulsar
Apache Pulsar - distributed pub-sub messaging system
FlechazoW/seatunnel-shade
Apache seatunnel
FlechazoW/seatunnel-web
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
FlechazoW/templates
Document templates for open-source projects (README, CONTRIBUTING, GitHub templates)
FlechazoW/tis
Support agile DataOps Based on DataX and Flink-CDC with Web-UI
FlechazoW/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
FlechazoW/zookeeper
Mirror of Apache Hadoop ZooKeeper