Pinned Repositories
-Natural-Language-Processing
Natural Language Processing (NLP) is a branch of data science that consists of systematic processes for analyzing, understanding, and deriving information from the text data in a smart and efficient manner
Apriori-Algorithm-Machine-Learning
The Apriori algorithm is a classic algorithm used in data mining for learning association rules. It is designed to identify frequent itemsets in a transactional database and generate association rules from these itemsets
Data-Engineering-POC
This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da
Decision-Tree-Random-Forest-Algorithm-Data-Science-
This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms
False-Alarm-Detection
The data was first pre-processed and analysis libraries like Numpy and Pandas were used to make it ready to be utilized by a machine learning algorithm.Problems like standard scaling, categorical data and missing values were handled with appropriate techniques. Then, we used LogisticRegression model.
Healthcare-Data-Collection-and-Analysis
This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.
K-Means-Clustering-Data-Science-
K-means clustering is a popular unsupervised machine learning algorithm used for partitioning a dataset into a pre-defined number of clusters. The goal is to group similar data points together and discover underlying patterns or structures within the data.
Swiggy-Restaurant-Analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
web-scraping-forbes
The 22nd annual Global 2000 ranking of the world’s largest public companies.The Global 2000 ranks the largest companies in the world using four metrics: sales, profits, assets and market value. As a group, the companies on the 2023 list account for $51.7 trillion in sales, $4.5 trillion in profits, $238 trillion in assets and $88 trillion in market
World-Population-Analysis-1990-2023-
This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.
LJadhav25's Repositories
LJadhav25/-Natural-Language-Processing
Natural Language Processing (NLP) is a branch of data science that consists of systematic processes for analyzing, understanding, and deriving information from the text data in a smart and efficient manner
LJadhav25/Apriori-Algorithm-Machine-Learning
The Apriori algorithm is a classic algorithm used in data mining for learning association rules. It is designed to identify frequent itemsets in a transactional database and generate association rules from these itemsets
LJadhav25/Data-Engineering-POC
This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da
LJadhav25/Decision-Tree-Random-Forest-Algorithm-Data-Science-
This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms
LJadhav25/Django-Data-Analyzer
Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.
LJadhav25/False-Alarm-Detection
The data was first pre-processed and analysis libraries like Numpy and Pandas were used to make it ready to be utilized by a machine learning algorithm.Problems like standard scaling, categorical data and missing values were handled with appropriate techniques. Then, we used LogisticRegression model.
LJadhav25/Healthcare-Data-Collection-and-Analysis
This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.
LJadhav25/K-Means-Clustering-Data-Science-
K-means clustering is a popular unsupervised machine learning algorithm used for partitioning a dataset into a pre-defined number of clusters. The goal is to group similar data points together and discover underlying patterns or structures within the data.
LJadhav25/Swiggy-Restaurant-Analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
LJadhav25/web-scraping-forbes
The 22nd annual Global 2000 ranking of the world’s largest public companies.The Global 2000 ranks the largest companies in the world using four metrics: sales, profits, assets and market value. As a group, the companies on the 2023 list account for $51.7 trillion in sales, $4.5 trillion in profits, $238 trillion in assets and $88 trillion in market
LJadhav25/World-Population-Analysis-1990-2023-
This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.
LJadhav25/Hangman-Game
This repository contains a simple yet fun implementation of the classic Hangman game using Python. The objective is to guess the hidden word by suggesting letters within a certain number of guesses. This project is designed to demonstrate basic programming concepts, game logic, and provide an engaging way to learn Python.
LJadhav25/KNN-Algorithm-Data-Science-
This repository contains a project demonstrating the implementation and application of the K-Nearest Neighbors (K-NN) algorithm in Data Science. The objective is to provide a comprehensive understanding of the K-NN algorithm, including data preprocessing, model training, evaluation, and visualization of results. This project is ideal for beginners
LJadhav25/Linear_regression_data_science
Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.
LJadhav25/Logistic-Regression-Data-Science-
Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.
LJadhav25/lokesh.github.io
Portfolio Website
LJadhav25/portfolio-new
LJadhav25/Principle-Component-Analysis-PCA---Machine-Learning
Principal Component Analysis (PCA) is a statistical technique used in machine learning and data science for dimensionality reduction. The main goal of PCA is to reduce the number of variables in a dataset while preserving as much information as possible.
LJadhav25/Sales-Dashboard-with-Excel
This repository provides a comprehensive analysis of sales and customer data to offer insights into various aspects of sales performance, customer behavior, and regional distribution. The analysis includes evaluations of total sales revenue, product categories, top-selling products, customer sales, regional performance, and profit margins.
LJadhav25/Support-Vector-Machine-Data-Science-
Support Vector Machines (SVM) are supervised learning models used for classification and regression analysis. They are particularly effective in high-dimensional spaces and situations where the number of dimensions exceeds the number of samples. SVMs are also memory efficient as they use a subset of training points called support vectors.