Pinned Repositories
apache-shiro-tutorial-webapp
A step-by-step tutorial showing how to secure a web app with Apache Shiro
arctic-fork
Arctic is a streaming lake warehouse service open sourced by NetEase
beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
bigdata-Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak
calcite
Apache Calcite
data-algorithms-book-stu
MapReduce, Spark, Java, and Scala for Data Algorithms Book
data-datart
Datart is a next generation Data Visualization Open Platform
flink-cdc-connectors
CDC Connectors for Apache Flink®
flink-kubernetes-operator
Apache Flink Kubernetes Operator
flink-recommandSystem-demo
:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
ai-smalleryu's Repositories
ai-smalleryu/apache-shiro-tutorial-webapp
A step-by-step tutorial showing how to secure a web app with Apache Shiro
ai-smalleryu/arctic-fork
Arctic is a streaming lake warehouse service open sourced by NetEase
ai-smalleryu/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
ai-smalleryu/bigdata-Dockerfiles
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak
ai-smalleryu/calcite
Apache Calcite
ai-smalleryu/data-algorithms-book-stu
MapReduce, Spark, Java, and Scala for Data Algorithms Book
ai-smalleryu/data-datart
Datart is a next generation Data Visualization Open Platform
ai-smalleryu/flink-cdc-connectors
CDC Connectors for Apache Flink®
ai-smalleryu/flink-kubernetes-operator
Apache Flink Kubernetes Operator
ai-smalleryu/flink-recommandSystem-demo
:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
ai-smalleryu/flink-sql-lineage
FlinkSQL字段血缘解决方案及源码。FlinkSQL field lineage solution and source code, The core idea is to parse SQL through Calcite to generate a RelNode tree of relational expressions. Then get the optimized logical paln through optimization stage, and finally call Calcite RelMetadataQuery to get the lineage relationship at the field level.
ai-smalleryu/flink-sql-security-fork
FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Masking方案。
ai-smalleryu/open-source-manual
A Ebook of Open Source Manual
ai-smalleryu/studyandtest
学习使用
ai-smalleryu/flink-training-fork
Apache Flink Training Excercises
ai-smalleryu/flink2024-realtime
尚硅谷实时数仓 4.0 正式版本 - 1.0
ai-smalleryu/gpt4free
decentralising the Ai Industry, just some language model api's...
ai-smalleryu/hudi
Upserts, Deletes And Incremental Processing on Big Data.
ai-smalleryu/incubator-livy-spark-rest
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
ai-smalleryu/incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
ai-smalleryu/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
ai-smalleryu/incubator-streampark-quickstar-forkt
Apache StreamPark quickstart
ai-smalleryu/incubator-streampark-sql-gateway
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
ai-smalleryu/incubator-wayang
Apache Wayang(incubating) is the first cross-platform data processing system.
ai-smalleryu/linkis-fork
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
ai-smalleryu/spark-extension-fork
A library that provides useful extensions to Apache Spark and PySpark.
ai-smalleryu/spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
ai-smalleryu/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)