Pinned Repositories
analytics-engineer-bootcamp
Apache_Spark_Basics
This series explores the basics of Apache Spark with the application of some practical elements of Spark, PySpark & SparkSQL
Apache_Spark_ETL_AND_SparkML
Final project for the IBM Data Engineer course on Apache Spark and Machine Learning covering ETL processes using Elyra and Apache Spark and running SparkML jobs
claimed-component-library
The goal of CLAIMED is to enable low-code/no-code rapid prototyping style programming to seamlessly CI/CD into production.
ETL-using-Linux-Shell
An ETL process using Linux shell command and Postgres SQL
hadoop_mapreduce_setup
Launch a single node Hadoop cluster using Docker and run MapReduce jobs
ibm_data-engineering_capstone_project
The final project for the IBM Data Engineering Professional Certificate
WAZIMAP-CSV-ETL-SCRIPT
A python ETL script to retrieve CSVs from wazimap. The final CSVs were loaded to PowerBI for visualisation
youtube-pipeline-DE-project
WazirRohiman's Repositories
WazirRohiman/hadoop_mapreduce_setup
Launch a single node Hadoop cluster using Docker and run MapReduce jobs
WazirRohiman/analytics-engineer-bootcamp
WazirRohiman/Apache_Spark_Basics
This series explores the basics of Apache Spark with the application of some practical elements of Spark, PySpark & SparkSQL
WazirRohiman/Apache_Spark_ETL_AND_SparkML
Final project for the IBM Data Engineer course on Apache Spark and Machine Learning covering ETL processes using Elyra and Apache Spark and running SparkML jobs
WazirRohiman/claimed-component-library
The goal of CLAIMED is to enable low-code/no-code rapid prototyping style programming to seamlessly CI/CD into production.
WazirRohiman/ETL-using-Linux-Shell
An ETL process using Linux shell command and Postgres SQL
WazirRohiman/etl_pipeline_using_airflow
Building an ETL Pipeline using Airflow
WazirRohiman/getting-started-with-mongodb
A guide to getting started with MongoDB. Connecting to the MongoDB server from the CLI and using python to access and run operations
WazirRohiman/highway_streaming_data_pipeline_using_kafka
Project that aims to de-congest the national highways by analyzing the road traffic data from different toll plazas. As a vehicle passes a toll plaza, the vehicle’s data like vehicle_id,vehicle_type,toll_plaza_id and timestamp are streamed to Kafka.
WazirRohiman/ibm_data-engineering_capstone_project
The final project for the IBM Data Engineering Professional Certificate
WazirRohiman/kafka_workflow
A workflow to stream data with kafka
WazirRohiman/MySQL_backup_restore_command_codes
Steps to follow for creating, backing up and restoring a logical backup of a MySQL database
WazirRohiman/populating_data_warehouse
Example of how to populate a data warehouse
WazirRohiman/Python-Project-DE---IBM_SN_Final_Assignment
The final assignment for the IBM Python Project for Data Engineering Course.
WazirRohiman/WAZIMAP-CSV-ETL-SCRIPT
A python ETL script to retrieve CSVs from wazimap. The final CSVs were loaded to PowerBI for visualisation
WazirRohiman/youtube-pipeline-DE-project
WazirRohiman/IBM_Cloudant_Lab
The repo describes the steps and the command used to create a document database on IBM Cloundant and Parsing JSON queries on the documents stored in the DB
WazirRohiman/importing_exporting_from_nosql_databases
WazirRohiman/IU_WebApplicationDev
WazirRohiman/saas-boilerplate_1_F
SaaS Boilerplate - Open Source and free SaaS stack that lets you build SaaS products faster in React, Django and AWS. Focus on essential business logic instead of coding repeatable features!
WazirRohiman/using_cassandra_query_language_shell
A guide to connecting to the Cassandra client and running Keyspace, Table and CRUD operations using CQL Shell