nimuyuhan's Stars
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
kubeflow/kubeflow
Machine Learning Toolkit for Kubernetes
shengqiangzhang/examples-of-web-crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
WeiYe-Jing/datax-web
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
alibaba/Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
AdoptOpenJDK/jitwatch
Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.
rajasekarv/vega
A new arguably faster implementation of Apache Spark from scratch in Rust
apache/griffin
Mirror of Apache griffin
mvel/mvel
MVEL (MVFLEX Expression Language)
fayson/cdhproject
hadoop各组件使用,持续更新
apache/yunikorn-core
Apache YuniKorn Core
WeBankFinTech/Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
WeBankFinTech/Qualitis
Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis
WeBankFinTech/WeDataSphere
WeDataSphere is a financial grade, one-stop big data platform suite.
baidu/Jprotobuf-rpc-socket
Protobuf RPC是一种基于TCP协议的二进制RPC通信协议的Java实现
JoeCao/qbike
A demo of share bike using DDD, MicroService and Spring Cloud
WeBankFinTech/Prophecis
Prophecis is a one-stop cloud native machine learning platform.
gingerredjade/flink-userportrait-main
基于Flink流处理的动态实时亿级全端用户画像系统
zhangjun0x01/bigdata-examples
分享一些在工作中的大数据实战案例,包括flink、kafka、hadoop、presto等等。欢迎大家关注我的公众号【大数据技术与应用实战】,一起成长。
HongZhaoHua/jstarcraft-ai
目标是提供一个完整的Java机器学习(Machine Learning/ML)框架,作为人工智能在学术界与工业界的桥梁. 让相关领域的研发人员能够在各种软硬件环境/数据结构/算法/模型之间无缝切换. 涵盖了从数据处理到模型的训练与评估各个环节,支持硬件加速和并行计算,是最快最全的Java机器学习库.
dantezhao/data-warehouse
The book of data warehouse
buglas/webgl-lesson
todd5167/flink-spark-submiter
从本地IDEA提交Flink/Spark任务到Yarn/k8s集群
kekingcn/kkbinlog
支持MySQL、MongoDB数据变更订阅分发
aistack/sql-booster
This is a library for SQL optimizing/rewriting including Materialized View rewrite
renrenche/kafka-connectors
kafka connector 插件,支持输入 mysql binlog 和 json 格式写入ClickHouse。持续更新
ambition119/QueryParse
sql解析和执行,能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行
jgrier/flink-stuff
Various things in support of Apache Flink