Pinned Repositories
ambari
Fork of Apache Ambari maintained by Clemlab Company
BigData_AutomaticDeploy
大数据自动化部署,包括自动化部署hadoop、hive、hbase、spark、storm等等一系列组件
canal
阿里巴巴 MySQL binlog 增量订阅&消费组件
datasqueeze
Hadoop utility to compact small files
DataX
DataX是阿里云DataWorks数据集成的开源版本。
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
filecrush
Remedy small files by combining them into larger ones.
guhaitao
Config files for my GitHub profile.
hadoop
Apache Hadoop
hadoop-lzo
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
guhaitao's Repositories
guhaitao/ambari
Fork of Apache Ambari maintained by Clemlab Company
guhaitao/BigData_AutomaticDeploy
大数据自动化部署,包括自动化部署hadoop、hive、hbase、spark、storm等等一系列组件
guhaitao/canal
阿里巴巴 MySQL binlog 增量订阅&消费组件
guhaitao/datasqueeze
Hadoop utility to compact small files
guhaitao/DataX
DataX是阿里云DataWorks数据集成的开源版本。
guhaitao/dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
guhaitao/filecrush
Remedy small files by combining them into larger ones.
guhaitao/guhaitao
Config files for my GitHub profile.
guhaitao/hadoop
Apache Hadoop
guhaitao/hadoop-lzo
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
guhaitao/hadoop_exporter
A hadoop exporter for prometheus, scrape hadoop metrics (including HDFS, YARN, MAPREDUCE, HBASE. etc.) from hadoop components jmx url.
guhaitao/hadoop_jmx_exporter
HDFS & YARN jmx metrics prometheus exporter
guhaitao/hdfsutils
hdfs文件治理工具,文件批量解压、压缩、小文件合并
guhaitao/hive-phoenix-handler
hive-phoenix-handler is a hive plug-in that can access Apache Phoenix table on HBase using HiveQL.
guhaitao/hive-third-functions
Some useful custom hive udf functions, especial array, json, math, string functions.
guhaitao/horizon
Horizon is a Django-based project aimed at providing a complete OpenStack Dashboard along with an extensible framework for building new dashboards from reusable components.
guhaitao/ipl2sql
iptables log to SQL converter
guhaitao/java-grok
Simple API that allows you to easily parse logs and other files
guhaitao/learning-spark
Example code from Learning Spark book
guhaitao/nagios-plugins
Collection of some handy Nagios plugins
guhaitao/pentaho-kettle
Pentaho Data Integration ( ETL ) a.k.a Kettle
guhaitao/Shell_Script
Linux系统的安全,通过脚本对Linux系统进行一键检测和一键加固
guhaitao/Streamis
Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.
guhaitao/Synonyms
中文近义词工具包
guhaitao/YanX
研招网硕士专业目录下载;考研专业目录下载,招生人数,考试科目,考研专业,考研院校,A、B类地区,211、985、双一流;