mahmoudparsian
Mahmoud Parsian, Ph.D. in computer science, is a software architect and author. He leads Illumina's Big Data team focused on large-scale genome analytics.
mahmoudparsian's Stars
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
winterbe/java8-tutorial
Modern Java - A Guide to Java 8
jankotek/mapdb
MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java database engine.
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
asciidoctor/asciidoctor-pdf
:page_with_curl: Asciidoctor PDF: A native PDF converter for AsciiDoc based on Asciidoctor and Prawn, written entirely in Ruby.
mahmoudparsian/data-algorithms-book
MapReduce, Spark, Java, and Scala for Data Algorithms Book
lintool/MapReduceAlgorithms
Data-Intensive Text Processing with MapReduce
wix-incubator/wix-embedded-mysql
embedded mysql based on https://github.com/flapdoodle-oss/de.flapdoodle.embed.process
mahmoudparsian/data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
josephguan/scala-design-patterns
Design patterns implemented in Scala.
mahmoudparsian/big-data-mapreduce-course
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
apache/tajo
Mirror of Apache Tajo
ywilkof/spark-jobs-rest-client
Fluent client for interacting with Spark Standalone Mode's Rest API for submitting, killing and monitoring the state of jobs.
ceteri/spark-exercises
Coding exercises for Apache Spark
mahmoudparsian/pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
lintool/bigdata-2018w
CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo
bluebreezecf/SparkJobServerClient
Java Client of the Spark Job Server implementing the arranged Rest APIs
lintool/bigdata-2016w
CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo
mahmoudparsian/machine-learning-course
Machine Learning Course @ Santa Clara University
mahmoudparsian/learning-spark-examples
Examples for learning spark
ismailhammounou/db2ixf
db2ixf is a python package with a CLI that simplifies the parsing and processing of IBM Integration eXchange Format (IXF) files.
sameeraxiomine/sparkusingjava8
Sparking Using Java8
mahmoudparsian/data-warehousing
This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.
scchy/My_Learn
学习同步文档
anagarajan/Spark-on-Ubuntu
Install Spark on Ubuntu, that is running in oracle Virtual Box.
slangeberg/groovy-gradle-jersey
Groovy / Gradle sample app for Jersey REST API
deepakmca05/aAaEe.com
KoolCards/StudentAlcohol
Logistic regression algorithm that predicts elevated alcohol levels in students