spancer
Big data practitioner, data architect of the smart factory. Expert in big data architecture, search engine, big data analysis, agile development.
changsha
Pinned Repositories
bigdata-docker-builds
Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.
bigdata-docker-compose
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
CS-Notes
:books: 技术面试必备基础知识、Leetcode 题解、Java、C++、Python、后端面试、操作系统、计算机网络、系统设计
elasticlake
open source data lake build on top of apache iceberg
elasticsearch-ansj-analysis-plugin
ansj analysis elasticsearch plugin
FiboRulex
FiboRulex - 实时AI智能决策引擎、规则引擎、风控引擎、数据流引擎。 通过可视化界面进行规则配置,无需繁琐开发,节约人力,提升效率,实时监控,减少错误率,随时调整; 支持规则集、评分卡、决策树,名单库管理、机器学习模型、三方数据接入、定制化开发等;
flink-es-demo
基于ES快速实现车辆碰撞分析、套牌车分析、尾随分析。
flink-iceberg-demo
flink iceberg integration tests, jobs running on yarn.
prestodb-hbase-connector
prestodb hbase connector, using zookeepr to hold the metadata.
zeus
Zeus is an open-source, analytical engine for big data hold in data lake; it was designed to provide OLAP (Online Analytical Processing) capability in the big data era. You can use Zeus to store, query, analysis, and manage data.
spancer's Repositories
spancer/flink-iceberg-demo
flink iceberg integration tests, jobs running on yarn.
spancer/zeus
Zeus is an open-source, analytical engine for big data hold in data lake; it was designed to provide OLAP (Online Analytical Processing) capability in the big data era. You can use Zeus to store, query, analysis, and manage data.
spancer/prestodb-hbase-connector
prestodb hbase connector, using zookeepr to hold the metadata.
spancer/flink-es-demo
基于ES快速实现车辆碰撞分析、套牌车分析、尾随分析。
spancer/bigdata-libs
bigdata open libs hosting, just for fast speed downloading in docker build.
spancer/cassandra-cdc-example
Example project for using Commit Log API for reading Apache Cassandra Change Data Capture log
spancer/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
spancer/debezium-server-iceberg
Replicates database CDC events to Iceberg Tables
spancer/dev-rules
协同开发规范
spancer/eagle
Real time data processing system based on flink and CEP
spancer/elasticsearch-custom-query-demo
elasticsearch-custom-query-demo
spancer/elasticsearch-learning-to-rank
Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
spancer/elasticsearch-maxspeed-aggregation-plugin
elasticsearch-maxspeed-aggregation-plugin
spancer/elasticsearchbitmapplugin
在es中,使用RoaringBitmap精确去重,能够在秒级返回
spancer/elasticsearchdistanceplugin
spancer/flink
Apache Flink
spancer/flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
spancer/flink-spark-submiter
从本地IDEA提交Flink/Spark任务到Yarn/k8s集群
spancer/fraud-detection-demo
Repository for Advanced Flink Application Patterns series
spancer/hello-algorithm
🌍「算法面试+算法知识」针对小白的算法训练 | 还包括:1、阿里、字节、滴滴 百篇大厂面经汇总 2、千本开源电子书 3、百张思维导图 (右侧来个 star 吧 🌹,English version supported)
spancer/HIS
HIS英文全称 hospital information system(医疗信息就诊系统),系统主要功能按照数据流量、流向及处理过程分为临床诊疗、药品管理、财务管理、患者管理。诊疗活动由各工作站配合完成,并将临床信息进行整理、处理、汇总、统计、分析等。本系统包括以下工作站:门诊医生工作站、药房医生工作站、医技医生工作站、收费员工作站、对帐员工作站、管理员工作站。需求为东软提供的云医院。
spancer/kafka-connect-http
Kafka Connect connector that enables Change Data Capture from JSON/HTTP APIs into Kafka.
spancer/learning
Becoming better at data science every day
spancer/leveldb
java版,leveldb-iq80
spancer/markdown-here
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
spancer/msl
Message Security Layer
spancer/netty
Netty project - an event-driven asynchronous network application framework
spancer/presto
Home of the community managed version of Presto, the distributed SQL query engine for big data, under the auspices of the Presto Software Foundation.
spancer/presto-1
The official home of the Presto distributed SQL query engine for big data
spancer/styleguide
Style guides for Google-originated open-source projects