jhonyazhuxq's Stars
frekele/oracle-java
Oracle Java Binaries
cwida/tpcds-result-reproduction
Reproducing TPC-DS qualification/reference results
apache/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
oshi/oshi
Native Operating System and Hardware Information
JanusGraph/janusgraph
JanusGraph: an open-source, distributed graph database
Qihoo360/Quicksql
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
OpenTSDB/opentsdb
A scalable, distributed Time Series Database.
ansible/ansible-examples
A few starter examples of ansible playbooks, to show features and how they work together. See http://galaxy.ansible.com for example roles from the Ansible community for deploying many popular applications.
BZCareer/hadoop-ansible
This big data distro contains ansible provisioning for: Apache Hadoop, Apache Spark, Apache Hive, Apache Pig, Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, ElasticSearch, Kibana, Logstash, Apache Hbase, Apache Zeppelin, Apache Flink
analytically/hadoop-ansible
Ansible playbook that installs a Hadoop cluster, with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch and Kibana for monitoring and centralized log indexing.
gregrahn/tpcds-kit
TPC-DS benchmark kit with some modifications/fixes
fayson/cdhproject
hadoop各组件使用,持续更新
JThink/spring-boot-starter-hbase
自定义的spring-boot的hbase starter,为hbase的query和更新等操作提供简易的api并集成spring-boot的auto configuration
prontera/docker-mysql-mha
基于Docker的MySQL MHA集群
ekoontz/zookeeper
Mirror of Apache Hadoop ZooKeeper
holdenk/spark-structured-streaming-ml
Structured Streaming Machine Learning example with Spark 2.0
holdenk/spark-validator
A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support.
holdenk/spark-testing-base
Base classes to use when writing tests with Spark
high-performance-spark/high-performance-spark-examples
Examples for High Performance Spark
databricks/Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
apache/ranger
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond