Pinned Repositories
aisql-bigdata-base
Bigdata_Components_Guide
cdh-deploy-robot
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
EasyScheduler
Easy Scheduler是一个分布式工作流任务调度系统,主要解决数据研发ETL错综复杂的依赖关系,而不能直观监控任务健康状态等问题。Easy Scheduler以DAG流式的方式将Task组装起来,可实时监控任务的运行状态,同时支持重试、从指定节点恢复失败、暂停及Kill任务等操作。EasyScheduler由在工作流调度方面工作多年的多位小伙伴研发而成,致力于成为大数据平台的中流砥柱,使调度变得更加容易,更可以从其中文名“易调度”看出我们的初衷,如果你对目前市面上的调度不够满意,非常欢迎使用易调度,欢迎大家加入进来,提出需求,也欢迎贡献代码
interview_python
关于Python的面试题
MapReduce
MapReduce Demo
pdf
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
spark-demo
spark高级数据分析
Spark_DB_Connector
Use Scala API to read/write data from different databases,HBase,MySQL,etc.
xiaohei-info's Repositories
xiaohei-info/cdh-deploy-robot
xiaohei-info/aisql-bigdata-base
xiaohei-info/pdf
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
xiaohei-info/algorithm-visualizer
:fireworks:Interactive Online Platform that Visualizes Algorithms from Code
xiaohei-info/analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
xiaohei-info/awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
xiaohei-info/cdhproject
hadoop各组件使用,持续更新
xiaohei-info/chia-blockchain
Chia blockchain python implementation (full node, farmer, harvester, timelord, and wallet)
xiaohei-info/d2-crud-plus-with-d2admin-starter
d2-admin集成d2-crud-plus启动模版
xiaohei-info/DataQuality
DataQuality for BigData
xiaohei-info/docker-elk
The Elastic stack (ELK) powered by Docker and Compose.
xiaohei-info/FATE
An Industrial Grade Federated Learning Framework
xiaohei-info/flinkx
基于flink的分布式数据同步工具
xiaohei-info/hadoop-ozone
Apache Hadoop Ozone
xiaohei-info/hbase-doc-zh
:book: HBase 中文参考指南
xiaohei-info/huobi-chain
The next generation high performance public chain for financial infrastructure.
xiaohei-info/infini-gateway
INFINI-GATEWAY(极限网关), a high performance and lightweight gateway written in golang, for elasticsearch and his friends.
xiaohei-info/InterOp
Repository for Interoperability of FATE
xiaohei-info/Interview
Interview = 简历指南 + LeetCode + Kaggle
xiaohei-info/juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
xiaohei-info/leeml-notes
李宏毅《机器学习》笔记,在线阅读地址:https://datawhalechina.github.io/leeml-notes
xiaohei-info/Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
xiaohei-info/Logi-KafkaManager
一站式Apache Kafka集群指标监控与运维管控平台
xiaohei-info/pulsar
Turn large Web sites into tables and charts using simple SQLs.
xiaohei-info/Python-100-Days
Python - 100天从新手到大师
xiaohei-info/Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
xiaohei-info/skywalking
APM, Application Performance Monitoring System
xiaohei-info/spu
SPU (Secure Processing Unit) aims to be a provable, measurable secure computation device, which provides computation ability while keeping your private data protected.
xiaohei-info/WhereHows
Data Discovery and Lineage for Big Data Ecosystem
xiaohei-info/zio-quill
Compile-time Language Integrated Queries for Scala