Pinned Repositories
Artificial-Neural-Networks-for-Fraud-Detection-in-Supply-Chain-Analytics-MLPClassifier-and-Keras
In this study, we aimed to detect fraudulent activities in the supply chain through the use of neural networks. The study focused on building two machine learning models using the MLPClassifier algorithm from the scikit-learn library and a custom neural network using the Keras library in Python.
Chess-Queens--EDA-using-Plotly
Introduction About the Game - Chess is a two-player strategy board game played on a checkered board with 64 squares arranged in an 8×8 square grid. Governing Body - The International Chess Federation (FIDE) governs international chess competition. FIDE used Elo rating system for calculating the relative skill levels of players. Dataset Details - The dataset contains details of Top women chess players in the world sorted by their Standard FIDE rating (highest to lowest above 1800 Elo) as updated in August 2020. The data includes all active and inactive players which can be identified by the Inactive_flag column. Note: All ratings are updated as published by FIDE in August 2020.
climatrack-platform
ClimaTrack is a real-time weather monitoring and analytics platform.
Credit-EDA-Case-Study
This case study aims to give us an idea of applying EDA in a real business scenario. In this case study, we develop a basic understanding of risk analytics in banking and financial services and understand how data is used to minimise the risk of losing money while lending to customers.
Data-Cleaning-and-Preparation-of-Boston-Housing-Dataset---Python-Pandas
This project involves analysis of the Boston Housing Dataset using Python's Pandas library. Data cleaning is performed by dropping genuine outliers, resetting the index, and imputing missing values with the median of the columns. It is substituted with NaN for further analysis. The objective of this project is to clean and prepare the data
Data-Modelling-and-Dashboarding-with-IBM-Cognos-Analytics
Digital-Music-Store---Data-Analysis-using-SQL
Data Analysis project to help Chinook Digital Music Store to help how they can optimize their business opportunities and to help answering business related questions.
Exploring-The-Space-Missions--EDA-using-Plotly
Introduction This DataSet was scraped from https://nextspaceflight.com/launches/past/?page=1 and includes all the space missions since the beginning of Space Race (1957).
Flight-Delays-Prediction-Models-based-on-Na-ve-Bayes-Regression-Tree-and-Logistic-Regression-Algos
US Flight Delays Prediction Models based on Naïve Bayes, Regression Tree, and Logistic Regression Algorithms
subhanjandas
This is My Portfolio
subhanjandas's Repositories
subhanjandas/Artificial-Neural-Networks-for-Fraud-Detection-in-Supply-Chain-Analytics-MLPClassifier-and-Keras
In this study, we aimed to detect fraudulent activities in the supply chain through the use of neural networks. The study focused on building two machine learning models using the MLPClassifier algorithm from the scikit-learn library and a custom neural network using the Keras library in Python.
subhanjandas/subhanjandas
This is My Portfolio
subhanjandas/Chess-Queens--EDA-using-Plotly
Introduction About the Game - Chess is a two-player strategy board game played on a checkered board with 64 squares arranged in an 8×8 square grid. Governing Body - The International Chess Federation (FIDE) governs international chess competition. FIDE used Elo rating system for calculating the relative skill levels of players. Dataset Details - The dataset contains details of Top women chess players in the world sorted by their Standard FIDE rating (highest to lowest above 1800 Elo) as updated in August 2020. The data includes all active and inactive players which can be identified by the Inactive_flag column. Note: All ratings are updated as published by FIDE in August 2020.
subhanjandas/climatrack-platform
ClimaTrack is a real-time weather monitoring and analytics platform.
subhanjandas/Credit-EDA-Case-Study
This case study aims to give us an idea of applying EDA in a real business scenario. In this case study, we develop a basic understanding of risk analytics in banking and financial services and understand how data is used to minimise the risk of losing money while lending to customers.
subhanjandas/Data-Cleaning-and-Preparation-of-Boston-Housing-Dataset---Python-Pandas
This project involves analysis of the Boston Housing Dataset using Python's Pandas library. Data cleaning is performed by dropping genuine outliers, resetting the index, and imputing missing values with the median of the columns. It is substituted with NaN for further analysis. The objective of this project is to clean and prepare the data
subhanjandas/Data-Modelling-and-Dashboarding-with-IBM-Cognos-Analytics
subhanjandas/Digital-Music-Store---Data-Analysis-using-SQL
Data Analysis project to help Chinook Digital Music Store to help how they can optimize their business opportunities and to help answering business related questions.
subhanjandas/Exploring-The-Space-Missions--EDA-using-Plotly
Introduction This DataSet was scraped from https://nextspaceflight.com/launches/past/?page=1 and includes all the space missions since the beginning of Space Race (1957).
subhanjandas/Flight-Delays-Prediction-Models-based-on-Na-ve-Bayes-Regression-Tree-and-Logistic-Regression-Algos
US Flight Delays Prediction Models based on Naïve Bayes, Regression Tree, and Logistic Regression Algorithms
subhanjandas/IBM-SPSS---A-Comprehensive-Guide-to-Data-Analysis-and-Data-Modeling
This repository provides a comprehensive guide to IBM SPSS Modeler, including an overview of its key features and functionality.
subhanjandas/IMDb-Movie-Data-Visualisation
Python codes for IMDb movie data visualisation.
subhanjandas/MoneyBall-Sports-Predictive-Analytics-using-Excel
Maximized Expected Return to Risk Ratio by building a baseball team from a catalogue of 3,000+ with several complex constraints including salary
subhanjandas/Predicting-Housing-Prices-Using-Multiple-Linear-Regression-and-k-NearestNeighbours-kNN
The goal was to accurately estimate the value of real estate and uncover relevant factors that directly influence property prices. The study used two different models, multiple linear regression and k-Nearest Neighbours (kNN), to predict housing prices.
subhanjandas/Predictive-Modeling-using-SAS-Enterprise-Miner---Decision-Trees
subhanjandas/provenance-article-collection-and-workflow-management-system
Provenance is an application designed to collect and store articles from infoq.com using a Netflix Conductor-esque architecture. The application showcases a common architecture used for data collection and workflow management, demonstrating the integration of background workers, data gateways, and RESTful endpoints.
subhanjandas/RDBMS-to-GraphDB---Big-Data-Analytics-using-Neo4j
This project involves migration from a traditional RDBMS to Neo4j for big data analytics. Using graph database technology, various business-critical questions are addressed, including identifying the employees who sold Tofu, the products sold with Tofu, the total number of products, top 5 products by sales, and the category with the highest sales.
subhanjandas/simple-aged-cache
This project presents my implementation of a simple cache system that supports automatic expiration of entries.
subhanjandas/soccer-match-predictor-end-to-end
The Match Predictor is a web application that predicts the outcomes of soccer matches using various machine learning models. The backend is written in Python with Flask, and the frontend is built using TypeScript and React.
subhanjandas/Supermarket-Organic-Product-Purchase-Prediction---Data-Mining-and-Modeling-with-SAS
This project aims to predict which customers are likely to purchase the new line of organic products offered by a supermarket. The supermarket has a customer loyalty program and collected data through a coupon incentive program for organic products. The analysis was performed using SAS Enterprise Miner.
subhanjandas/User-Occupation-and-Movies-Ratings-Data-Exploration-using-Apache-Hive
In this project, the objective was to analyze the "User, Occupation, Movies, and Ratings" dataset using Apache Hive. The data was processed and analyzed using Hive's SQL-like query language and MapReduce framework, making it easier to handle large datasets. The focus of the analysis was to provide a comprehensive breakdown of the data
subhanjandas/Worldwide-Sales-Data-Analysis-and-Exploration-using-Zeppelin-HDFS-and-Spark
This project aimed to analyze and understand worldwide sales data through the use of Zeppelin and HDFS. The primary objective was to utilize Spark's basic Scala commands and SQL to query and manipulate the data, providing valuable insights and findings for the customer.