Sabareh
Data Geek passionate about Data Science, Data Engineering and Artificial Intelligence. Keen on using data to drive business insights and improve efficiency.
HFC Bank KenyaNairobi,Kenya
Pinned Repositories
Azure-for-Data-Engineering
dag-pipeline-with-dbt
The project focuses on the development and deployment of an ELT (Extract, Load, Transform) pipeline utilizing industry-standard tools such as dbt (data build tool), Snowflake, and Airflow. The pipeline is designed to handle the transformation and loading of data from source tables to final data marts, ensuring efficient data processing.
data-engineering-project-using-sales-data
Data Engineering Project using Sales Data in Hadoop using Cloudera
Fraud-Detection-Using-Kafka-Streams
This project demonstrates how to use Apache Kafka Streams to detect fraudulent activities by analyzing IP logs in real-time. By processing the streaming data, the system flags potential fraud by identifying suspicious patterns, such as repeated login attempts or access from unusual IP addresses.
maalik
Feature-rich Post Exploitation Framework with Network Pivoting capabilities.
Product-Network-Analysis-Using-R
This Shiny web application analyzes product transactions to discover frequently purchased product pairs and visualize the relationships between them. The app uses association rule mining (Apriori algorithm) to identify frequent itemsets, and it applies community detection to find clusters of related products.
Retail-Recommender-System
The Retail Recommender System is a Shiny-based web application that provides recommendations for cross-sell opportunities using association rule mining. Built with R, it analyzes customer transaction data, extracts purchasing patterns, and generates rules for cross-sell recommendations.
stock-price-prediction-spark-cassandra
This is a data pipeline for predicting stock prices using Apache Spark, Apache Cassandra, and machine learning techniques. It collects and preprocesses stock data from Alpha Vantage API, engineers features, trains models, and performs data analysis and predictions.
Stock_Price_Data_Analysis
This repository contains the code and analysis for my data analysis project on stock price analysis and forecasting for my Internal attachment at Jomo Kenyatta University of Agriculture and Technology. The project analyzes historical stock price data, visualizes trends, and develops a forecasting model using Python and data science techniques.
tailwind-nextjs-starter-blog
This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical writing a breeze. Easily configurable and customizable. Perfect as a replacement to existing Jekyll and Hugo individual blogs.
Sabareh's Repositories
Sabareh/stock-price-prediction-spark-cassandra
This is a data pipeline for predicting stock prices using Apache Spark, Apache Cassandra, and machine learning techniques. It collects and preprocesses stock data from Alpha Vantage API, engineers features, trains models, and performs data analysis and predictions.
Sabareh/Fraud-Detection-Using-Kafka-Streams
This project demonstrates how to use Apache Kafka Streams to detect fraudulent activities by analyzing IP logs in real-time. By processing the streaming data, the system flags potential fraud by identifying suspicious patterns, such as repeated login attempts or access from unusual IP addresses.
Sabareh/Product-Network-Analysis-Using-R
This Shiny web application analyzes product transactions to discover frequently purchased product pairs and visualize the relationships between them. The app uses association rule mining (Apriori algorithm) to identify frequent itemsets, and it applies community detection to find clusters of related products.
Sabareh/dag-pipeline-with-dbt
The project focuses on the development and deployment of an ELT (Extract, Load, Transform) pipeline utilizing industry-standard tools such as dbt (data build tool), Snowflake, and Airflow. The pipeline is designed to handle the transformation and loading of data from source tables to final data marts, ensuring efficient data processing.
Sabareh/Retail-Recommender-System
The Retail Recommender System is a Shiny-based web application that provides recommendations for cross-sell opportunities using association rule mining. Built with R, it analyzes customer transaction data, extracts purchasing patterns, and generates rules for cross-sell recommendations.
Sabareh/ds-comm-ke
Data Science Communities in Kenya
Sabareh/Forecasting-ML-App
R machine learning application that performs forecasting on pharmaceutical medicine sales data using information obtained form NHS (UK) General Practitioner (GP) datasets.
Sabareh/sabareh
Config files for my GitHub profile.
Sabareh/Technical_Writing_Training
This is a repository containing the course work that I have done in the course "Technical Writing: How to Write Software Documentation" on Udemy offered by Jordan Stanchev
Sabareh/A-Visual-History-of-Nobel-Prize-Winners
Sabareh/The-Forex-Data-Pipeline
The Forex Data Pipeline is a comprehensive solution designed to collect, process, and prepare currency exchange rate data for downstream machine-learning pipelines. This repository showcases the creation of a data pipeline that fetches currency rates from an external API and performs data transformation using PySpark.
Sabareh/Airports-Average-Distances
In this project I import an open dataset from socrata that contains airport codes, latitude coordinates and longitude coordinates for 13,429 US airports.
Sabareh/Analyzing-Students-Mental-Health
In this project I explore and analyze the students data to see how the study reached its conclusions and gain a better understanding of it. Specifically, I explore and analyze how the length of stay (stay) impacts the mental health of the international students present in the study.
Sabareh/Apache_Kafka_Project
In this project, I explored the Apache Kafka concepts such us asynchronous messaging, real-time stream processing, logging and monitoring, event sourcing, and real-time analytics.
Sabareh/data-project
Sabareh/Databriks-Golang-SDK
This repository holds code for installing and configuring the databriks SDK with Go on Visual Studio Code for ELT tasks with Spark SQL and Python
Sabareh/dp-203-azure-data-engineer
Exercise files for Microsoft Data Engineer curriculum
Sabareh/Dr.-Semmelweis-and-the-Discovery-of-Handwashing
Reanalyzed the data behind one of the most important discoveries of modern medicine: handwashing
Sabareh/Drive-Safe
🚗 DriveSafe: Your Guardian Angel on the Road 🛡️ Stay alert and safe behind the wheel with DriveSafe! Powered by facial recognition and machine learning, it detects signs of fatigue in real-time, ensuring every journey is a safe one. Let DriveSafe be your co-pilot on the road to safer driving! 🌟
Sabareh/elective-abroad
Sabareh/ETL-DAG-with-Airflow
Welcome to my repository for building a Directed Acyclic Graph (DAG) using Apache Airflow for analyzing top-level domains (TLDs). This project aims to provide a robust framework for systematically collecting data on TLD usage and performing insightful analyses using Airflow's powerful workflow automation capabilities.s.
Sabareh/Getting-Started-as-User-Assistance-Developer
A repository to share content and helpful resources about user assistance, information architecture and technical writing.
Sabareh/hands-on-introduction-to-data-engineering-
Sabareh/Hugging-Face
Sabareh/Job-descriptions
Loking for a job in data science? In this project, I have "scraped"(taken from the web) 1000 job descriptions for companies
Sabareh/Movie-Data-ETL-Project
Sabareh/supervisedml_lineReg_boston_dataset
Sabareh/textbook
The textbook Computational and Inferential Thinking: The Foundations of Data Science
Sabareh/workshop-library
A library of workshops written by and for Microsoft Learn Student Ambassadors and Cloud Advocates and their local communities
Sabareh/zenorocha.com
My personal website ❤️