Pinned Repositories
flink-cdc
Flink CDC is a streaming data integration tool
iceberg
Apache Iceberg
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
spark
Apache Spark - A unified analytics engine for large-scale data processing
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
aliyun-odps-java-sdk
ODPS SDK for Java Developers
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
flink
Apache Flink
incubator-iceberg
Apache Iceberg (Incubating)
zhaomin1423's Repositories
zhaomin1423/aliyun-odps-java-sdk
ODPS SDK for Java Developers
zhaomin1423/BigData-Notes
大数据入门指南 :star:
zhaomin1423/Cch1996.github.io
Cch1996.github.io
zhaomin1423/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
zhaomin1423/DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
zhaomin1423/DataX
DataX是阿里云DataWorks数据集成的开源版本。
zhaomin1423/druid
Apache Druid: a high performance real-time analytics database.
zhaomin1423/Exchangis
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
zhaomin1423/flinkStreamSQL
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
zhaomin1423/flinkx
Based on Apache Flink. support data synchronization/integration and streaming SQL computation.
zhaomin1423/God-Of-BigData
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
zhaomin1423/hadoop
Mirror of Apache Hadoop
zhaomin1423/hive
Apache Hive
zhaomin1423/hudi
Upserts, Deletes And Incremental Processing on Big Data.
zhaomin1423/incubator-doris-spark-connector
Flink/Spark Connectors for Apache Doris(Incubating)
zhaomin1423/incubator-linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
zhaomin1423/Java-Interview
「Java面试小抄」一份通向理想互联网公司的面试汇总,包括 Java基础、Java并发、JVM、MySQL、Redis、Spring、MyBatis、Kafka、计算机操作系统、计算机网络、系统设计、分布式、Java 项目实战等
zhaomin1423/JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
zhaomin1423/JavaInterview
zhaomin1423/javapoet
A Java API for generating .java source files.
zhaomin1423/kudu
Mirror of Apache Kudu
zhaomin1423/presto
The official home of the Presto distributed SQL query engine for big data
zhaomin1423/spark-connector
This component acts as a bridge between Spark and Vertica, allowing the user to either retrieve data from Vertica for processing in Spark, or store processed data from Spark into Vertica.
zhaomin1423/spark-jobserver
REST job server for Apache Spark
zhaomin1423/spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
zhaomin1423/SparkDataLineageCapture
Capture the logical plan from Spark (SQL)
zhaomin1423/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
zhaomin1423/universalJavaApplicationStub
An alternative Application Launcher Script for Java Apps on Mac OS X that works with both Apple's and Oracle's PList format and Java 6, 7, 8, 9 and 10. Plus it supports drag&drop to the Dock icon.
zhaomin1423/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
zhaomin1423/zhaomin1423.github.io
利用 GitHub Pages + jekyll + namecheap 搭建个人博客