Pinned Repositories
AI-System
System for AI Education Resource.
azure-quickstart-templates
Azure Quickstart Templates
benchmark-kit
Testbench for experimenting with SparkSQL and FluoDB
free-programming-books-zh_CN
免费的计算机编程类中文书籍,欢迎投稿
sigar-system_runtime
基于java通过第三方jar包sigar的支持,完成对服务器系统的参数监控,包括CPU、内存、硬盘以及网络流量的实时监控
spark-application-template
spark-ml-source-analysis
spark ml 算法原理剖析以及具体的源码实现分析
SparkLearning
Learning Apache spark,including code and data .Most part can run local.
SparkML
spark 机器学习:利用jupyter工作来讲解算法原理并运行相关例子
YanjieGao's Repositories
YanjieGao/checkpoint_paper
checkpoint_paper
YanjieGao/coding-style
Python代码编写规范 - 臧致远
YanjieGao/HiTune
HiTune is a Hadoop performance analyzer. See trouble shooting and known issues here
YanjieGao/linkedin-utils
Base utilities shared by all linkedin open source projects
YanjieGao/moviedemo
mllib demo with movielens dataset
YanjieGao/osteach.github.com
开源文化
YanjieGao/spark-parquet-example
Example project to show how to use Spark to read and write Avro/Parquet files
YanjieGao/sparrow
Sparrow scheduling platform (U.C. Berkeley).
YanjieGao/Starfish
Starfish is a self-tuning system for big data analytics. Starfish builds on Hadoop while adapting to user needs and system workloads to provide good performance automatically, without any need for users to understand and manipulate the many tuning knobs in Hadoop.