gavin0815's Stars
pierre94/flink-notes
flink学习笔记
dwestheide/kontextfrei
Writing application logic for Spark jobs that can be unit-tested without a SparkContext
holdenk/spark-testing-base
Base classes to use when writing tests with Spark
hustguobing/oracle_hbase2kudu
spark读取oracle以及hbase写入kudu
alisheykhi/Spark-Impala-Example
This project is an example for reading data from Impala (using impala for transformation) as a Spark DataFrame and writing objects from Spark into oracle database using JDBC.
oracle/spark-oracle
On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.
igniterealtime/Spark
Cross-platform real-time collaboration client optimized for business and organizations.
leesf/hudi-demos
汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)
leesf/hudi-resources
汇总Apache Hudi相关资料
realguoshuai/hadoop_study
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
WeBankFinTech/DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
yangyichao-mango/flink-study
gingerredjade/flink-userportrait-main
基于Flink流处理的动态实时亿级全端用户画像系统
xuwei517/FlinkExample
Flink代码实例
godaai/flink-book-zh
Flink Tutorial Project
todd5167/flink-spark-submiter
从本地IDEA提交Flink/Spark任务到Yarn/k8s集群
danny0405/flink-source-code-analysis
Apache Flink 源码分析系列,基于 git tag 1.1.2
perkinls/flink-local-train
flink入门到项目实践
ivi-ru/flink-clickhouse-sink
Flink sink for Clickhouse
threeknowbigdata/flink_second_understand
该仓库专注于让读者秒懂Flink组件,包含Flink实战代码和文档、200个Flink教程知识点,Flink Datastream、Flink Table、Flink Window、Flink State、Flink Checkpoint、Flink Metrics、Flink Memory、Flink on standalone /yarn/k8s、Flink SQL、Flink CEP、Flink CDC、Flink UDF、PyFlink、Flink新特性、Flink Partition、Flink Memory等知识点。详细链接请看:https ://mp.weixin.qq.com/mp /appmsgalbum?__biz=Mzg5NDY3NzIwMA==&action=getalbum&album_id=2038088622687469575#wechat_redirect
zhp8341/flink-streaming-platform-web
基于flink的实时流计算web平台
apache/flink-cdc
Flink CDC is a streaming data integration tool
BestJex/flink-boot
懒松鼠Flink-Boot 脚手架由《深入理解Flink核心设计与实践原理》作者开发,让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。
YotpoLtd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
apache/bookkeeper
Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads
Intel-bigdata/HiBench
HiBench is a big data benchmark suite.
zhaoyachao/zdh_web
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
alldatacenter/alldata
🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
aliyun/aliyun-emapreduce-datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Y1ran/Spark-The-Definitive-Guide-Chinese-Traslation-2019
Spark权威指南( Spark The Definitive Guide) -中文版翻译项目