Pinned Repositories
aisql-bigdata-base
Bigdata_Components_Guide
cdh-deploy-robot
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
EasyScheduler
Easy Scheduler是一个分布式工作流任务调度系统,主要解决数据研发ETL错综复杂的依赖关系,而不能直观监控任务健康状态等问题。Easy Scheduler以DAG流式的方式将Task组装起来,可实时监控任务的运行状态,同时支持重试、从指定节点恢复失败、暂停及Kill任务等操作。EasyScheduler由在工作流调度方面工作多年的多位小伙伴研发而成,致力于成为大数据平台的中流砥柱,使调度变得更加容易,更可以从其中文名“易调度”看出我们的初衷,如果你对目前市面上的调度不够满意,非常欢迎使用易调度,欢迎大家加入进来,提出需求,也欢迎贡献代码
interview_python
关于Python的面试题
MapReduce
MapReduce Demo
pdf
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
spark-demo
spark高级数据分析
Spark_DB_Connector
Use Scala API to read/write data from different databases,HBase,MySQL,etc.
xiaohei-info's Repositories
xiaohei-info/Spark_DB_Connector
Use Scala API to read/write data from different databases,HBase,MySQL,etc.
xiaohei-info/EasyScheduler
Easy Scheduler是一个分布式工作流任务调度系统,主要解决数据研发ETL错综复杂的依赖关系,而不能直观监控任务健康状态等问题。Easy Scheduler以DAG流式的方式将Task组装起来,可实时监控任务的运行状态,同时支持重试、从指定节点恢复失败、暂停及Kill任务等操作。EasyScheduler由在工作流调度方面工作多年的多位小伙伴研发而成,致力于成为大数据平台的中流砥柱,使调度变得更加容易,更可以从其中文名“易调度”看出我们的初衷,如果你对目前市面上的调度不够满意,非常欢迎使用易调度,欢迎大家加入进来,提出需求,也欢迎贡献代码
xiaohei-info/aisql-service-hbase
xiaohei-info/beam-site-zh
Apache Beam 官方网站中文版
xiaohei-info/bigflow
Baidu Bigflow is a interface that allows for writing distributed computing programs and provides lots of simple, flexible, powerful APIs. Using Bigflow, you can easily handle data of any scale.
xiaohei-info/blockchain
区块链 - 中文资源
xiaohei-info/elasticsearch-doc-zh
:book: [译] elasticsearch 中文文档
xiaohei-info/flink-doc-zh
Apache Flink 中文文档
xiaohei-info/flink-training-course
xiaohei-info/go-ethereum
Official Go implementation of the Ethereum protocol
xiaohei-info/hbase-indexer
Lily HBase Indexer - indexing HBase, one row at a time
xiaohei-info/hive-solr
使用Hive读写solr
xiaohei-info/kafka-doc-zh
Kafka 中文文档
xiaohei-info/kudu-doc-zh
:book: [译] kudu 中文文档
xiaohei-info/mlsql-api-console
xiaohei-info/nebula
A high performance distributed Graph Database
xiaohei-info/nsfw_data_scraper
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
xiaohei-info/presto-hbase-connector
presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。
xiaohei-info/REKCARC-TSC-UHT
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
xiaohei-info/RustPrimer
The Rust primer for beginners. We need native English speaker help us modify the translation.
xiaohei-info/scala-style-guide
Databricks Scala Coding Style Guide
xiaohei-info/ServiceFramework
Java MVC framework, agile, fast, rich domain model, made especially for server side of mobile application (一个敏捷,快速,富领域模型的Java MVC 框架,专为 移动应用后端量身定做)
xiaohei-info/solidity-doc-zh
Solidity 中文文档
xiaohei-info/spark-doc-zh
Apache Spark 官方文档中文版
xiaohei-info/spark-solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
xiaohei-info/spring-boot-doc-zh
:book: [译] spring-boot 中文文档
xiaohei-info/streamingpro
Unify Big Data and Machine Learning.
xiaohei-info/USTC-Course
:heart:**科学技术大学课程资源
xiaohei-info/zeppelin-doc-zh
:book: [译] zeppelin 中文文档
xiaohei-info/zju-icicles
浙江大学课程攻略共享计划