Pinned Repositories
chest-x-ray-classification
This study tries to compare the detection of lung diseases using xray scans from three different datasets using three different neural network architectures using Pytorch and perform an ablation study by changing learning rates. The dimensional understanding is visualised using t-SNE and Grad-CAM for visualisation of diseases in x-ray scans.
credit-risk-modelling
The aim of the project is to create a robust machine learning model that predicts the likelihood for a bank's customers to fail on their credit payments for the next month. The dataset used contains information on 24028 customers across 26 variables that includes information regarding whether customer defaulted, credit limits, bill history etc.
data-ingestion-pipeline
It is kubernetes microservice that ingests Steam data, deployed as a service using sQLite for storing processes followed by an event driven message queue controlled by RabbitMQ. This message queue is also deployed as a service with an interconnection with celery workers integrated with Elasticsearch and Redis for caching and ingestion.
Deep-LearningAI
🇦🇮 Deep Learning AI course on Coursera (Andrew Ng)
eeg-signal-analysis
EEG signals can be used to detect the mental state of a person which can have applications in machine learning. This repo gives an introduction to EEG signal analysis using MNE python library and describes the setup, the data preprocessing, data visualization and segmentation into epochs.
employee-information-retrieval-system
An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.
ipu-engineering-notes
This repository contains notes required by CSE/ IT students for IP University. They are managed in respective folders along with some relevant PDFs also.
malicious-network-detection
It focuses on identifying malicious and non-malicious network connections using machine learning. By processing Snort Intrusion Detection Logs, the project applies algorithms like Random Forest, AdaBoost, and XGBoost, achieving high accuracy in classifying network traffic and enhancing cybersecurity.
player-market-value-prediction
There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.
pyspark-yelp-data-analysis
A comparative study to understand the computing efficiencies of Pyspark architectures vs python based distributed programming methodologies such as MPI, multi-threading or multi-processing on the Yelp kaggle dataset.
coderjolly's Repositories
coderjolly/ipu-engineering-notes
This repository contains notes required by CSE/ IT students for IP University. They are managed in respective folders along with some relevant PDFs also.
coderjolly/player-market-value-prediction
There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.
coderjolly/data-ingestion-pipeline
It is kubernetes microservice that ingests Steam data, deployed as a service using sQLite for storing processes followed by an event driven message queue controlled by RabbitMQ. This message queue is also deployed as a service with an interconnection with celery workers integrated with Elasticsearch and Redis for caching and ingestion.
coderjolly/chest-x-ray-classification
This study tries to compare the detection of lung diseases using xray scans from three different datasets using three different neural network architectures using Pytorch and perform an ablation study by changing learning rates. The dimensional understanding is visualised using t-SNE and Grad-CAM for visualisation of diseases in x-ray scans.
coderjolly/eeg-signal-analysis
EEG signals can be used to detect the mental state of a person which can have applications in machine learning. This repo gives an introduction to EEG signal analysis using MNE python library and describes the setup, the data preprocessing, data visualization and segmentation into epochs.
coderjolly/health-analytics-using-ML-and-wearble-sensors
This project leverages machine learning to analyze health data from wearable sensors. It focuses on classifying obesity levels, age groups, and calorie consumption habits. With a dataset of physical, lifestyle & demographic features, we used techniques like Support Vector Machines, Random Forests and XGBoost.
coderjolly/news-recommender
This is a news recommender system that uses beautiful-soup to scrape news articles, their categories and descriptions to create a data dump. It then uses word embedding techniques such tf-idf, word2vec for content based news recommendation and LightRF, LightFM to explore hybrid and collaborative filtering based recommender models.
coderjolly/spotify-api-data-analysis
The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.
coderjolly/bird-call
There are already many projects underway to extensively monitor birds by continuously recording natural soundscapes over long periods. However, as many living and nonliving things make noise, the analysis of these datasets is often done manually by domain experts. These analyses are painstakingly slow, and results are often incomplete.
coderjolly/credit-risk-modelling
The aim of the project is to create a robust machine learning model that predicts the likelihood for a bank's customers to fail on their credit payments for the next month. The dataset used contains information on 24028 customers across 26 variables that includes information regarding whether customer defaulted, credit limits, bill history etc.
coderjolly/employee-information-retrieval-system
An information retrieval system constructed using Flask to perform ETL operations while ingesting New York Payroll data and provide a frontend user interface for manipulating filters and queries.
coderjolly/malicious-network-detection
It focuses on identifying malicious and non-malicious network connections using machine learning. By processing Snort Intrusion Detection Logs, the project applies algorithms like Random Forest, AdaBoost, and XGBoost, achieving high accuracy in classifying network traffic and enhancing cybersecurity.
coderjolly/processing-advertisements
Processing image advertisements to predict the context conveyed through them using CNNs. The images are further visualised using GradCAM to understand how the first and the last layers perceive the image dataset for the classification.
coderjolly/pyspark-yelp-data-analysis
A comparative study to understand the computing efficiencies of Pyspark architectures vs python based distributed programming methodologies such as MPI, multi-threading or multi-processing on the Yelp kaggle dataset.
coderjolly/azure-pipelining-DE
coderjolly/coderjolly
coderjolly/coderjolly.github.io
My portfolio managed using Jekyll pages for updating my personal pages, projects, writings and certifications.
coderjolly/customer-churn-prediction
It is prediction strategy used to understand the the factors required to retains customers. Showcased classical machine learning algorithm to analyse and understand the telecom customer dataset using SVMs, Decisions Trees, Random Forests etc.
coderjolly/customer-segmentation-cohort-analysis
coderjolly/Data-Science-Portofolio
coderjolly/Data_Science_Masters_Program_2021
coderjolly/findhelp
Java SpringBoot backend for a depression consultation portal where patients fill an evaluation form, followed by a counsellor viewing it to decide whether a counselling is required or if the patient needs to be directed to a doctor for and online meeting.
coderjolly/hacker-scripts
Based on a true story
coderjolly/hackingtool
ALL IN ONE Hacking Tool For Hackers
coderjolly/healthcare-analytics
coderjolly/home-decor-classification-api
A timed company task aiming to build & deploy an end-to-end DL based image classification for furniture images using Tensorflow and Flask that returns the predicted label as output when accessed using an API. This is dockerized and can be deployed on AWS as well.
coderjolly/ml-pipelining
coderjolly/sentiment-synthesis-LLMs
coderjolly/ufc-predictor-mma
coderjolly/utilisation-analysis
This provides a small glimpse of the IISc's, Supercomputer Education Research Centre (SERC) resource data, and how it was ingested, extracted to produced relevant results for data analysis between actual resource utilisation and simulated resource utilisation.