Anas1108
Data Scientist with a Strong Foundation in Machine learning, Deep learning, Data Analysis, Building predictive models, Flask/Django Front-end Dev.
National University OF Computer And Emerging Sciences G11/3 Islamabad
Pinned Repositories
Amazon-Product-Scraper
This project implements an automated bot (Autobot) using Python and Selenium to scrape product data from Amazon. It extracts details such as product title, price, image URL, and reviews, storing the data in JSON files. Additionally, it provides a FastAPI-based web API for accessing the scraped product data.
CandyCrush_Game
A C++ implementation of the popular Candy Crush game using object-oriented programming & SFML library for graphics. Matches candies to score, swap adjacent candies. Game ends when score reaches a certain level or out of moves. User input via mouse clicks. No saved progress.
Daraz-Product-Sentiment-Analysis
This project aims to conduct sentiment analysis on product reviews collected from Daraz website. The user can choose the desired product category to scrape data from. The collected data will be cleaned and preprocessed, and then the sentiment of the reviews will be analyzed
Exploratory-Data-Analysis-using-Pandas-and-Matplotlib
In this project, we will perform Exploratory Data Analysis (EDA) on three datasets, `ufo`, `u.user` and `movies`. We will use the Python library Pandas for data cleaning, transforming, and manipulation and Matplotlib for data visualization.
Genetic_Algorithm_for_Feature_Selection
Implements a genetic algorithm to select the most impactful features in a dataset to improve classifier performance. Written in Jupyter Notebook using pandas, numpy, scikit-learn. Results displayed with accuracy, precision, recall, F1 score comparison to using all features.
Heart_Attack_Prediction
Classify Heart Attack dataset using 3+ ML models and perform Exploratory Data Analysis for insights. Preprocess data, apply majority voting for final prediction, aim for accuracy & F-score over 65%. Use Numpy, Pandas, Sklearn, Matplotlib. Final report & insights on methodology & results expected. Run code in Jupyter Notebook.
Particle-_Swarm_Optimization-PSO-_for_Feature_Selection
PSO feature selection improves classifier performance. Implemented in Jupyter Notebook with pandas, numpy, scikit-learn. PSO done from scratch. Results compared using accuracy, precision, recall, F1 score. Improves results compared to using all features. Can be applied to various classification problems.
Poetry-Generation-in-Roman-Urdu
This project aims to generate poetry in Roman Urdu using a dataset of poems by famous Urdu poets such as Allam Iqbal and Ghalib.
Predicting-Mortality-and-Cardiovascular-Diseases
A dataset containing 12 features for predicting mortality by heart failure caused by Cardiovascular Diseases (CVDs) is analyzed. The dataset includes demographic information such as gender, age, and presence of risk factors like diabetes, anemia, high blood pressure, and smoking habits.
Scrap-Millions-of-Emails
This project is focused on scraping millions of emails dynamically from thousands of web pages automatically from the website [fredmiranda.com](https://fredmiranda.com/). The goal of this project is to create a dataset of email addresses that can be used for various purposes.
Anas1108's Repositories
Anas1108/Amazon-Product-Scraper
This project implements an automated bot (Autobot) using Python and Selenium to scrape product data from Amazon. It extracts details such as product title, price, image URL, and reviews, storing the data in JSON files. Additionally, it provides a FastAPI-based web API for accessing the scraped product data.
Anas1108/Daraz-Product-Sentiment-Analysis
This project aims to conduct sentiment analysis on product reviews collected from Daraz website. The user can choose the desired product category to scrape data from. The collected data will be cleaned and preprocessed, and then the sentiment of the reviews will be analyzed
Anas1108/Poetry-Generation-in-Roman-Urdu
This project aims to generate poetry in Roman Urdu using a dataset of poems by famous Urdu poets such as Allam Iqbal and Ghalib.
Anas1108/Road-Condition-Detection-using-Accelerometer-and-Gyro-Sensors
A research project on road condition detection using accelerometer and gyro sensors. The goal is to determine the accuracy of detecting road conditions by augmenting accelerometer readings with gyro sensor data.
Anas1108/Scraping-Google-Scholar-Data-of-University-Professors
This project is aimed at scraping the data of university professors from Google Scholar, including their citation count, h-index, and other relevant information. The scraped data will be stored in a CSV file for further analysis.
Anas1108/Transliteration-RomantoUrdu-And-ViceVersa
This project aims to develop a program that can perform transliteration between Roman and Urdu scripts using the `urduhack` library. Transliteration is the process of converting text from one script to another while preserving the pronunciation of the words.
Anas1108/Auto-Messaging-on-Facebook
This project is aimed at automating the process of sending messages on Facebook using the Python Selenium library. It can be used to send messages to friends or anyone on Facebook.
Anas1108/Auto-Messaging-on-Whatsapp
This project aims to automate the process of messaging on Whatsapp. Using Selenium in Python, the script will open Whatsapp Web, search for the desired contact and send the pre-defined message.
Anas1108/Exploratory-Data-Analysis-using-Pandas-and-Matplotlib
In this project, we will perform Exploratory Data Analysis (EDA) on three datasets, `ufo`, `u.user` and `movies`. We will use the Python library Pandas for data cleaning, transforming, and manipulation and Matplotlib for data visualization.
Anas1108/Predicting-Mortality-and-Cardiovascular-Diseases
A dataset containing 12 features for predicting mortality by heart failure caused by Cardiovascular Diseases (CVDs) is analyzed. The dataset includes demographic information such as gender, age, and presence of risk factors like diabetes, anemia, high blood pressure, and smoking habits.
Anas1108/Scrap-Millions-of-Emails
This project is focused on scraping millions of emails dynamically from thousands of web pages automatically from the website [fredmiranda.com](https://fredmiranda.com/). The goal of this project is to create a dataset of email addresses that can be used for various purposes.
Anas1108/Auto-Searching-on-YouTube
This project uses selenium to automatically search for videos on YouTube. It is implemented from scratch in python and is executed in a jupyter notebook (.ipynb) file.
Anas1108/Circle-Overlap-Detection-in-Animation-Video
A project that plays an animation video and pauses the video where the center of a green circle and a blue circle exactly overlap (or very close to each other).
Anas1108/Exploratory-Data-Analysis-with-Pandas
This project is focused on performing Exploratory Data Analysis (EDA) on a dataset using Pandas. The goal of EDA is to get insights about the data, identify patterns and relationships, and prepare the data for further analysis or modeling.
Anas1108/Finding-the-Wave-with-Most-Frequencies
This repository contains an implementation of finding the wave with the highest number of frequencies (peaks) from an image, and then drawing a bounding rectangle around it.
Anas1108/Grammy-Award-Analysis
This project aims to analyze the Grammy award data to understand the trends and patterns in the award distribution among young and old artists. The objective is to determine if the claim that judges are inclined to give the Grammy to old artists is true or false.
Anas1108/Hand-Boundary-Detection-with-OpenCV
This repository contains a script that utilizes OpenCV to find the boundary of a hand in an image and display it on the original image.
Anas1108/Hotel-Management-System
A website frontend created using HTML, CSS, and JavaScript for hotel management purposes.
Anas1108/house-price-prediction
Predicting house prices using Linear Regression and GBR
Anas1108/Image-Processing-Finding-Shape-Corners-and-Area
This repository contains a solution to find the corners of an object in an image using the Sobel Edge Detector and Thresholding, and then use these corners to calculate the area of the object.
Anas1108/Image-Processing-Overlapping-Area-between-Two-Objects
This repository contains an implementation of finding the overlapping area between two objects using OpenCV.
Anas1108/Internet-Usage-in-Denmark-and-Belarus-Analysis
This project aims to compare the adoption of the internet in Denmark and Belarus and determine if income level has an impact on the speed of adoption. The data used for this analysis is from the World Bank Data (1990-present) and is stored in the file "WorldBankData.csv".
Anas1108/Latin-Word-Scrapper
A Python program to scrape Latin words and related information from the Dickinson College Commentaries website using BeautifulSoup, Requests, and other tools.
Anas1108/MlopsA2
Anas1108/news-paper
Anas1108/Payroll-Analysis-in-Los-Angeles-City
Los Angeles City employee payroll data analyzed using hypothesis testing to answer 5 questions on bonus pay, department, overtime and health cost comparison. Results based on examination of provided dataset with empty fields handled.
Anas1108/Scraping-Top-50-Psychology-Universities-Faculty-Data
The purpose of this project is to scrape faculty data (such as name, qualification, etc.) of the top 50 psychology universities and save it in a CSV file. This project uses Beautiful Soup and the requests library in Python to accomplish this task.
Anas1108/Spell-Correction-for-Roman-Urdu-using-Noisy-Channel-Model
This project develops an effective spell correction system for Roman Urdu using the Noisy Channel model. 4 components: language model, error model, candidate generation, and selection model. Suggests the most likely correction for a given incorrect word using probabilistic approach.
Anas1108/Traffic-Analysis
This project aims to develop a program that can accurately count the number of vehicles moving on a road in a given video and classify the traffic as high, low, or medium based on the count.
Anas1108/Zebra-Crossing-Detection-and-Counting
A project locates the zebra crossing in an image and draws a bounding rectangle around it while counting the number of white strips.