Pinned Repositories
Apache-Spark-Scala-Project-Template
A simple Scala Based Project Template for Apache Spark
awesome-public-datasets
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!
bigdata-babysteps
This project is aimed towards introduction of big data. It includes all the files and scrips that are required to get a good understanding of the Hadoop Eco System.
bigdata-mini-project1-stack_exchange
This project is aimed at performing data analysis of large volume of Stack Exchange data. It is required to interpret the raw data and make some logical conclusions using the data.
bigdata-mini-project2-imdb_data
This project is aimed at performing data analysis of large volume of IMDB movie reviews data. It is required to interpret the raw data and make some logical conclusions using the data.
bigdata-mini-project3-market_analysis
This project is aimed at performing data analysis of large volume of market analysis data. It is required to interpret the raw data and make some logical conclusions using the data.
Bigdata_preparation
BIgdata prepration and scenarios
cca175
scala-spark-tutorial
Project for James' Apache Spark with Scala course
satyamnijam's Repositories
satyamnijam/scala-spark-tutorial
Project for James' Apache Spark with Scala course
satyamnijam/Apache-Spark-Scala-Project-Template
A simple Scala Based Project Template for Apache Spark
satyamnijam/awesome-public-datasets
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!
satyamnijam/bigdata-babysteps
This project is aimed towards introduction of big data. It includes all the files and scrips that are required to get a good understanding of the Hadoop Eco System.
satyamnijam/bigdata-mini-project1-stack_exchange
This project is aimed at performing data analysis of large volume of Stack Exchange data. It is required to interpret the raw data and make some logical conclusions using the data.
satyamnijam/bigdata-mini-project2-imdb_data
This project is aimed at performing data analysis of large volume of IMDB movie reviews data. It is required to interpret the raw data and make some logical conclusions using the data.
satyamnijam/bigdata-mini-project3-market_analysis
This project is aimed at performing data analysis of large volume of market analysis data. It is required to interpret the raw data and make some logical conclusions using the data.
satyamnijam/Bigdata_preparation
BIgdata prepration and scenarios
satyamnijam/CCA175PracticeCode
Repository to capture all Practice code and examples for "CCA Spark and Hadoop Developer Certification - Cloudera" (CCA175) Exam.
satyamnijam/code
satyamnijam/config-server
satyamnijam/data
satyamnijam/data_analysis_using_apache_hive_and_apache_pig
Apache Hive, an open-source data warehouse system, is used with Apache Pig for loading and transforming unstructured, structured, or semi-structured data for data analysis and getting better business insights. Pig, a standard ETL scripting language, is used to export and import data into Apache Hive and to process large number of datasets.
satyamnijam/hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
satyamnijam/hadoop_lab
satyamnijam/ipl-data-analysis
IPL is one of the leading cricket tournaments held as T20 format. It is played in India from April-May every year. There is lot of analysis data available that can be used for data analysis.
satyamnijam/kubernetes
Production-Grade Container Scheduling and Management
satyamnijam/LearnWebhookTest
satyamnijam/oozie-examples
Sample oozie jobs
satyamnijam/oozie-tutorials
Oozie tutorials for beginners
satyamnijam/Python-Data-Visualization
satyamnijam/python-spark-tutorial
satyamnijam/satyamnijam.github.io
satyamnijam/single-cell-spark-demo
Experiments on Single Cell data from 10x Genomics using Apache Spark.
satyamnijam/spark
Apache Spark
satyamnijam/sparkTutorial
Source code for James Lee's Aparch Spark with Java course
satyamnijam/springboot_lab-service-registry
satyamnijam/springboot_lab-user-service
satyamnijam/testing
satyamnijam/usql
U-SQL Examples and Issue Tracking