StanleyLu's Stars
endymecy/spark-programming-guide-zh-cn
Spark 编程指南简体中文版
endymecy/spark-ml-source-analysis
spark ml 算法原理剖析以及具体的源码实现分析
yahoo/lopq
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
yahoo/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
XD-DENG/Spark-practice
Apache Spark (PySpark) Practice on Real Data
winghc/hadoop2x-eclipse-plugin
eclipse plugin for hadoop 2.2.0 , 2.4.1
deeplearning4j/deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
Yelp/mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
donnemartin/data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
gterzian/Python-Parallel-Collections
Python collections supporting parallel map/reduce style methods
jimcen33/Intro-to-Hadoop-MapReduce