amitkedia007
A data enthusiast at heart, I am a Data Scientist with a knack for uncovering insights from a sea of data. My joy lies in working in Python and R.
Brunel University LondonLondon, United Kingdom
Pinned Repositories
Analysis-of-AirBnB-data-Hadoop-Mapreduce
This repo explains the implementation of Map-Reduce Algorithm on the AirBnb data to understand the consumer satisfaction region and country wise. This is the effective use of parallel distributed computing to resolve the big data problems
Australian-Accident-Tableau-Analysis
This project presents a Tableau dashboard built on Australian road crash data. From research question formulation to final implementation, it provides insights into improving road safety. The journey from prototype to the final dashboard, and the learning experience, is shared.
Blueberry-Yield-Prediction
This project is an end-to-end machine learning solution for predicting blueberry yield based on various environmental and biological factors. Using Python and Flask for the back-end and Bootstrap for the front-end, it incorporates data ingestion, transformation, model training, and prediction stages. The prediction model is powered by CatBoost Algo
Cat-Breed-Data-Analysis
This repository contains a comprehensive analysis of a dataset about cat breeds. The project was part of the CS5802 module, a course under the Computer Science department of the College of Engineering, Design & Physical Sciences (CEDPS), Brunel University London.
Consumer-Behavior-Analysis-Using-DBT-PostgreSQL
Financial-Fraud-Detection-Using-LLMs
The aim of this dissertation is to assess the effectiveness of LLMs such as FinBERT and GPT-2 in detecting fraudulent activities in financial reports and statements. This repo provides the code for implementing LLMs, traditional machine learning and deep learning models on the labelled dataset
Online-News-Popularity-Prediction
This project focuses on predicting the popularity of online news articles based on a variety of features such as the article's title length, the number of images, the number of videos, and more. The dataset used in this project is derived from the UCI Machine Learning Repository's Online News Popularity dataset.
School_ERP_System
Walmart-Sales-Tableau-Dashboard
This repository contains the Tableau Dashboard of Walmart sales data. It also contains the detailed analysis and usage of the dashboard.
Whatsapp-chat-analyzer
This project is a social media chat analyzer built with Python and Streamlit. The application provides various analyses on a chat log, including top statistics, activity timelines, activity maps, word cloud, most common words, emoji analysis, and sentiment analysis. The analysis can be done for a specific user or for the overall chat.
amitkedia007's Repositories
amitkedia007/Financial-Fraud-Detection-Using-LLMs
The aim of this dissertation is to assess the effectiveness of LLMs such as FinBERT and GPT-2 in detecting fraudulent activities in financial reports and statements. This repo provides the code for implementing LLMs, traditional machine learning and deep learning models on the labelled dataset
amitkedia007/Whatsapp-chat-analyzer
This project is a social media chat analyzer built with Python and Streamlit. The application provides various analyses on a chat log, including top statistics, activity timelines, activity maps, word cloud, most common words, emoji analysis, and sentiment analysis. The analysis can be done for a specific user or for the overall chat.
amitkedia007/Blueberry-Yield-Prediction
This project is an end-to-end machine learning solution for predicting blueberry yield based on various environmental and biological factors. Using Python and Flask for the back-end and Bootstrap for the front-end, it incorporates data ingestion, transformation, model training, and prediction stages. The prediction model is powered by CatBoost Algo
amitkedia007/Analysis-of-AirBnB-data-Hadoop-Mapreduce
This repo explains the implementation of Map-Reduce Algorithm on the AirBnb data to understand the consumer satisfaction region and country wise. This is the effective use of parallel distributed computing to resolve the big data problems
amitkedia007/Australian-Accident-Tableau-Analysis
This project presents a Tableau dashboard built on Australian road crash data. From research question formulation to final implementation, it provides insights into improving road safety. The journey from prototype to the final dashboard, and the learning experience, is shared.
amitkedia007/Cat-Breed-Data-Analysis
This repository contains a comprehensive analysis of a dataset about cat breeds. The project was part of the CS5802 module, a course under the Computer Science department of the College of Engineering, Design & Physical Sciences (CEDPS), Brunel University London.
amitkedia007/Consumer-Behavior-Analysis-Using-DBT-PostgreSQL
amitkedia007/Online-News-Popularity-Prediction
This project focuses on predicting the popularity of online news articles based on a variety of features such as the article's title length, the number of images, the number of videos, and more. The dataset used in this project is derived from the UCI Machine Learning Repository's Online News Popularity dataset.
amitkedia007/School_ERP_System
amitkedia007/Walmart-Sales-Tableau-Dashboard
This repository contains the Tableau Dashboard of Walmart sales data. It also contains the detailed analysis and usage of the dashboard.
amitkedia007/YoutubeChannelAnalysis
All the Data Analysis related projects are uploaded here.
amitkedia007/AI-Ethics-Research-Healthcare
amitkedia007/amitkedia007
amitkedia007/APTAttribution
Code for Benchmarking two ML Approaches performing Authorship Attribution
amitkedia007/APTMalware
APT Malware Dataset Containing over 3,500 State-Sponsored Malware Samples
amitkedia007/Credit-Card-Analysis-Using-Map-Reduce
amitkedia007/FreeCodeCamp-Pandas-Real-Life-Example
amitkedia007/github-slideshow
A robot powered training repository :robot:
amitkedia007/House-Price-Prediction-Analysis
This repository contains the materials and codes for the course project. The main goal of this project is to clean, analyze, and model a dataset of housing prices. The analysis is done in R and the codes are presented in an R Notebook..
amitkedia007/LEARNING_TENSORFLOW
amitkedia007/Predictive-Analysis-of-Student-Outcomes-in-Higher-Education
This project is a comprehensive data analysis endeavor aimed at uncovering the key factors influencing student dropout and completion rates in higher education. Using a blend of Python and R, the project delves into the complexities of educational data, offering insights into student success and retention.
amitkedia007/Text-Summerizer-using-NLP
amitkedia007/TitanicPredictionModelling
amitkedia007/Tkinter-ML-Prediction-App