Pinned Repositories
myloginid's Repositories
myloginid/Zootopia
Hadoop deployment automation
myloginid/earth-june-2017
the center of the universe
myloginid/mercury-june-2017
myloginid/kudu_pyspark_example
Demo showcasing Spark Streaming, Kafka, Kudu - all in Python
myloginid/convnet-benchmarks
Easy benchmarking of all publicly accessible implementations of convnets
myloginid/prereq-checks
Prerequisites checker for Cloudera Hadoop (CDH) installation
myloginid/PST-to-Parquet
Unpack PST files into JSON then convert them to Parquet Format
myloginid/ansible-role-elastic-stack-5.x
myloginid/spark-stream-kudu
Spark Streaming / Kudu integration examples
myloginid/got-your-back
Got Your Back (GYB) is a command line tool for backing up your Gmail messages to your computer using Gmail's API over HTTPS.
myloginid/spark-hive-udf-scryptowrapper
Scrypt as Hive User Defined Functions (UDFs), for use in Apache Spark Spark BLAKE2 hive UDF. BLAKE2 is a cryptographic hash function faster than MD5, SHA-1, SHA-2, and SHA-3, yet is at least as secure as the latest standard SHA-3. BLAKE2 has been adopted by many projects due to its high speed, security, and simplicity.
myloginid/spark-playground
myloginid/awesome-security
A collection of awesome software, libraries, documents, books, resources and cools stuffs about security.
myloginid/fastText
Library for fast text representation and classification.
myloginid/SparkInternals
Notes talking about the design and implementation of Apache Spark
myloginid/CCA175-preparation
My work to prepare the CCA175 certification
myloginid/hadoop-on-gce
Repository for deploying a 10 node CDH 5.1 cluster on GCE
myloginid/sentimentr
myloginid/xgboost
https://github.com/dmlc/xgboost
myloginid/CommNet
Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736
myloginid/distributedstorage
myloginid/kafka-spark-streaming
Project for reading data from kafka and writing to kafka and HBase with kerberos
myloginid/HadoopInternals
Diagrams describing Apache Hadoop internals (2.3.0 or later).
myloginid/gitignore
A collection of useful .gitignore templates
myloginid/ansible
myloginid/yarn-logs-helpers
Scripts for parsing / making sense of yarn logs
myloginid/spark
Mirror of Apache Spark
myloginid/sparklingwater-examples
Repository that consists of examples of H2o Sparkling Water.
myloginid/hive-udf-getAddressFromLatLong
myloginid/Kaggles