Pinned Repositories
beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
clash-rule
A repository to store clash rule/config
compute-platform
wapper compute-platform by spark/flink
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Designers-Learn-Git
为设计师而作的GitHub的快速学习教程
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
doris-spark-connector
Spark Connector for Apache Doris
elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
fancyss_history_package
科学上网插件的离线安装包储存在这里
flink-java-demo
Flink Demo with Java
smokeriu's Repositories
smokeriu/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
smokeriu/clash-rule
A repository to store clash rule/config
smokeriu/compute-platform
wapper compute-platform by spark/flink
smokeriu/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
smokeriu/dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
smokeriu/doris-spark-connector
Spark Connector for Apache Doris
smokeriu/elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
smokeriu/flink-java-demo
Flink Demo with Java
smokeriu/flink-stu-java
smokeriu/graphframes
smokeriu/hbase-connectors
Apache HBase Connectors
smokeriu/hudi
Upserts, Deletes And Incremental Processing on Big Data.
smokeriu/iceberg
Apache Iceberg
smokeriu/incubator-amoro
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
smokeriu/incubator-livy
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
smokeriu/incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
smokeriu/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
smokeriu/inlong
Apache InLong - a one-stop integration framework for massive data
smokeriu/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
smokeriu/leetcode
leetcode做题记录。之前的记录再OneNote上,不过多年下来发现OneNote并不适合记录leetcode这类问题
smokeriu/linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
smokeriu/metabase
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
smokeriu/nebula-algorithm
Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and write back results to NebulaGraph.
smokeriu/nebula-spark-connector
smokeriu/Obsidian-notes
Used for synchronizing Obsidian notes
smokeriu/ripgrep
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
smokeriu/spark
Apache Spark - A unified analytics engine for large-scale data processing
smokeriu/spark-clickhouse-connector
Spark ClickHouse Connector build on DataSourceV2 API
smokeriu/spark-jobserver
REST job server for Apache Spark
smokeriu/spark-sftp
Spark connector for SFTP