priyam-choksi
Data Science & Engineering | Python, ML, AWS, Apache Kafka | Passionate About Big Data, Real-Time Analytics, and Building Scalable Solutions
Northeastern UniversityBoston
Pinned Repositories
aws-glue-cheat-sheet
cheat-sheets
This is my personal knowledge-base. Here you'll find code-snippets, technical documentation, and command reference for various tools, and technologies.
City-Real-Time-Streaming-Data-Pipeline
This project covers each phase from data ingestion to processing and finally storage. We'll utilize tools like IOT devices, Apache Zookeeper, ApacheKafka, Apache Spark, Docker, Python, AWS Cloud, AWS Glue, AWS Athena, AWS IAM, AWS Redshift and finally PowerBI.
Clinical_dataset
A Clinical dataset sourced from Huggingface. The aim here is to test csv and parquet processing times and test out parseRDPR and ehrapy packages
Data-Integration-and-Business-Intelligence
This project involves creating a robust data warehouse to support the sales and purchasing operations of the AdventureWorks company. Utilizing multiple data sources from different database systems, the main goal is to provide a unified view that facilitates complex queries and reporting.
Diabetes-Streamlit-App
This Diabetes Prediction App aims to assess the likelihood of diabetes based on various health metrics provided by the user. The application leverages a Logistic Regression model, well-suited for binary classification tasks, to predict the onset of diabetes. It features a user-friendly web interface developed with Streamlit.
DS-ML-Notebooks
This repository is a showcase of my data science and machine learning projects. Each notebook is an independent project where I explore different datasets, apply various data processing techniques, and build machine learning models. The goal is to demonstrate my skills and share my learning journey with the community.
Real-Time-Stock-Market-Data-Processing
This project focuses on constructing a real-time data engineering pipeline for stock market data using Apache Kafka, Python, and various AWS services. The goal is to demonstrate an end-to-end implementation that collects, processes, stores, and queries stock market data in real-time.
Scalable-Data-Engineering-Pipeline-using-Apache-Kafka-Apache-Spark-and-Cassandra
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Uber-ETL-Data-Engineering-Project
The goal of this project is to perform data analytics on Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.
priyam-choksi's Repositories
priyam-choksi/aws-glue-cheat-sheet
priyam-choksi/cheat-sheets
This is my personal knowledge-base. Here you'll find code-snippets, technical documentation, and command reference for various tools, and technologies.
priyam-choksi/City-Real-Time-Streaming-Data-Pipeline
This project covers each phase from data ingestion to processing and finally storage. We'll utilize tools like IOT devices, Apache Zookeeper, ApacheKafka, Apache Spark, Docker, Python, AWS Cloud, AWS Glue, AWS Athena, AWS IAM, AWS Redshift and finally PowerBI.
priyam-choksi/Clinical_dataset
A Clinical dataset sourced from Huggingface. The aim here is to test csv and parquet processing times and test out parseRDPR and ehrapy packages
priyam-choksi/cracking-the-data-science-interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
priyam-choksi/Data-Integration-and-Business-Intelligence
This project involves creating a robust data warehouse to support the sales and purchasing operations of the AdventureWorks company. Utilizing multiple data sources from different database systems, the main goal is to provide a unified view that facilitates complex queries and reporting.
priyam-choksi/Diabetes-Streamlit-App
This Diabetes Prediction App aims to assess the likelihood of diabetes based on various health metrics provided by the user. The application leverages a Logistic Regression model, well-suited for binary classification tasks, to predict the onset of diabetes. It features a user-friendly web interface developed with Streamlit.
priyam-choksi/DS-ML-Notebooks
This repository is a showcase of my data science and machine learning projects. Each notebook is an independent project where I explore different datasets, apply various data processing techniques, and build machine learning models. The goal is to demonstrate my skills and share my learning journey with the community.
priyam-choksi/INFO6105_DS
Repo for Uploading Assignments and Pet Projects.
priyam-choksi/Introduction-to-Machine-Learning
This repo will house all our course material and code snippets from the Introduction to Machine Learning Class
priyam-choksi/languagemodel
priyam-choksi/ML-Algorithms-from-Scratch
priyam-choksi/Real-Time-Stock-Market-Data-Processing
This project focuses on constructing a real-time data engineering pipeline for stock market data using Apache Kafka, Python, and various AWS services. The goal is to demonstrate an end-to-end implementation that collects, processes, stores, and queries stock market data in real-time.
priyam-choksi/Scalable-Data-Engineering-Pipeline-using-Apache-Kafka-Apache-Spark-and-Cassandra
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
priyam-choksi/Uber-ETL-Data-Engineering-Project
The goal of this project is to perform data analytics on Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.
priyam-choksi/Machine-Learning-Visualizers
priyam-choksi/NBA-Injury-Prediction-Model
priyam-choksi/Personalized-Movie-Recommendation-Engine
priyam-choksi/pokemonGAN
priyam-choksi/priyam-choksi
priyam-choksi/RNN
priyam-choksi/streamlit-image-to-pixel
priyam-choksi/streamlitdiffusion