Pinned Repositories
data-scientist-roadmap
Toturial coming with "data science roadmap" graphe.
DataGenerator
DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
datasciencecoursera
datasharing
The Leek group guide to data sharing
docker-hadoop-ubuntu
A Hadoop image on Ubuntu
docker-spark
docker-spark-1
Apache Spark docker image
docker-ubuntu16-kafka
example-voting-app
Example Docker Compose app
first-contributions
🚀✨ Help beginners to contribute to open source projects
goutham470's Repositories
goutham470/data-scientist-roadmap
Toturial coming with "data science roadmap" graphe.
goutham470/DataGenerator
DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
goutham470/datasciencecoursera
goutham470/datasharing
The Leek group guide to data sharing
goutham470/docker-hadoop-ubuntu
A Hadoop image on Ubuntu
goutham470/docker-spark
goutham470/docker-spark-1
Apache Spark docker image
goutham470/docker-ubuntu16-kafka
goutham470/example-voting-app
Example Docker Compose app
goutham470/first-contributions
🚀✨ Help beginners to contribute to open source projects
goutham470/hadoop-docker
Hadoop docker image
goutham470/hbase-book
Contains the code used in the HBase: The Definitive Guide book.
goutham470/hellogit
Learning Git
goutham470/mapr-docker-multi
goutham470/spark
Mirror of Apache Spark
goutham470/spark-dashboard
Tooling to deploy an Apache Spark performance dashboard. Run this as a standalone Docker container or install the helm chart on Kubernetes.
goutham470/spark-gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
goutham470/spark-kubernetes
spark on kubernetes
goutham470/SparkInternals
Notes talking about the design and implementation of Apache Spark
goutham470/sparkMeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task and stage metrics data.
goutham470/sql-metadata
Uses tokenized query returned by python-sqlparse and generates query metadata
goutham470/support
goutham470/training
goutham470/UVa-Online-Judge
Solutions to UVa Programming Challenges
goutham470/vk-wiki-notes