preprocessing-data

There are 232 repositories under preprocessing-data topic.

vanderschaarlab/hyperimpute
A framework for prototyping and benchmarking imputation methods
Language:Python195 5 916
Unstructured-IO/community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
29 23 298
imyjk729/Memristor
In-sensor reservoir computing for language learning via two-dimensional memristors
Language:Jupyter Notebook23 1 04
ELHoussineT/AutoDataCleaner
Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtypes, cleaning dirty/empty values, normalizing values and removing unwanted columns all in one line of code. Get your data ready for model training and fitting quickly.
Language:Python20 2 04
dlite-tools/NLPiper
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
Language:Python19 4 111
weiglszonja/meeg-tools
EEG/MEG data preprocessing and analyses framework
Language:Jupyter Notebook12 1 35
data-analyst-praktikum/Projects
Jupyter Notebook Praktikum Projects. This is repository with data analyst educational projects from Yandex.Praktikum.
Language:HTML10 0 08
cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas
Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas
Language:Jupyter Notebook9 4 00
UniFeat/unifeat
An open-source tool for performing feature selection process in different areas of research
Language:Java9 4 23
tuanio/backend-recommender-system-book
Flask REST API for Recommender System Book App on Android
Language:Jupyter Notebook7 2 11
ArthurMangussi/pymdatagen
A Python Library for the Generation of Artificial Missing Data
Language:Python6 1 23
bharadwaj-chukkala/Data-driven-motion-planning-using-various-machine-learning-algorithms
ENPM808A: Introduction to Machine Learning Final Project
Language:Jupyter Notebook6 1 01
Brokttv/food101-preprocessing
A clean and modular pipeline for preprocessing the Food-101 dataset using both folder-based and CSV-based workflows.
Language:Python5
ChristianGoueguel/specProc
The specProc package is a collection of preprocessing tools for spectroscopy data analysis.
Language:R5 1 01
FaezehAbedi2023/Statistical-Analysis-in-Sensor-Data-Processing-with-Machine-Learning-Models
This project develops an activity recognition model for a mobile fitness app using statistical analysis and machine learning. By processing smartphone sensor data, it extracts features to train models that accurately recognize user activities.
Language:Jupyter Notebook5 1 00
subhadipsinha722133/Multiple-Disease-Prediction
🤖This is an interactive Streamlit web application that predicts the likelihood of multiple diseases(Diabetes Prediction, Heart Disease Prediction, Parkinson's Disease Prediction) using Machine Learning models.
Language:Jupyter Notebook5
courtois-neuromod/ds_prep
All the scripts to prepare the Courtois-Neuromod dataset
Language:Python4 6 94
msche81/2-Jedha_Fullstack
450h Data Scientist training - Collect and store large amounts of data - Build prediction models in Machine Learning and Deep Learning - Deploy your models in real conditions
Language:Jupyter Notebook4 1 00
RafiQamar/HR-Analytics-Project
Cleaned and processed HR data using Python for analysis and visualization. Analyzed employee trends and performance using SQL and Python. Built an interactive Power BI dashboard connected to MySQL for dynamic insights.
Language:Jupyter Notebook4 1 0
CCaribe9/AdaptStdEPF
Code and experiments related to the paper: 'An adaptive standardisation methodology for Day-Ahead electricity price forecasting'
Language:Jupyter Notebook3 1 00
damaniayesh/Cognifyz_Internship_Tasks
The project provides Four Tasks which is given by Cognifyz Technology.
Language:Jupyter Notebook3 1 00
drleniaw/Analysis_Sentiment_Twitter_Free_Sex_In_Indonesian
Analysis Sentiment on Twitter Free Sex In Indonesia
Language:Jupyter Notebook3 1 00
fezzibasma/Speed-Dating-Experiment
What attributes influence the selection of a romantic partner?
Language:Jupyter Notebook3 2 00
functorism/snapcrop
CLI for crop/resize of large amounts of images with configurable resolutions
Language:Rust3 2 00
kkmk11/BLIGHT-VISION
This is a ML based Web App that aims to detect the presence of late blight or early blight on potato leaves, which are the primary causes of crop damage. Additionally, the system recommends appropriate precautions and pesticides to help farmers eliminate the blight and protect their crops and increasing their yields.
Language:PureBasic3 1 00
Navaneeth-Sharma/Speech_Recognition_of_Digits
This project of recognizing digit and converting it to text uses Signal processing techniques such as MFCC and other Advanced Signal Processing techniques for the preprocessing of the data. Then the Preprocessed data is used by the Neural Network algorithms to learn the pattern or structure of the sound.
Language:Jupyter Notebook3 1 00
rifkyahmadsaputra/Hollywood-Movies-Visualizations-and-Recommender-System
In this project, I do some analysis, visualizations, and then create movie recommender system on imdb data. I do that because I want to know more about movies, especially Hollywood movies. Therefore, I do analysis and visualization on imdb data which is contain informations about movies, e.g. who is produced, when the movies release, rating movies, budget and income, etc. After that, I create movie recommender system, which is the system will recommend top 10 similar movies based on the movie that has been input by the user.
Language:Jupyter Notebook3 1 00
Shaheer-khan-github/Natural-Language-Processing-in-Python-DataCamp
Language:Jupyter Notebook3 1 00
XuanyiJennyMa/pupil_cloud_data_preprocessing_Phase_1
Scripts for pre-processing eye-tracker data from pupil cloud
Language:Python3 2 00
ALEXUSCR-27/Amazon-Books-Genre-Classifier
This classifier predicts the genre of books based on titles or descriptions using a Machine Learning model trained on an Amazon books dataset.
Language:Jupyter Notebook2 1 00
AlwaysDhruv/Images-Preprocessing
Hi their, My self Dhruv. So this repository are fully work on the images preprocessing.
Language:C++2
lawl2/object-detection-and-spatial-relation
Language:Python2 2 00
LuisFelipePoma/Machine_Learning
Learning about the algorithms used in machine learning, along with techniques for training and testing models.
Language:Jupyter Notebook2 1 0
Mohammed061/Transportation-and-logistics-Challenge
Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.
Language:Jupyter Notebook2
nlqthinh/WeaviateAnime
Explore your favorite anime with this interactive search app! 🚀 This project leverages Weaviate for vector search and Gradio for a seamless user interface. Using embeddings from a custom anime dataset, you can perform quick and accurate similarity searches for anime titles
Language:Python2
RafiQamar/IMDb-Movie-Analysis
This project involves web scraping, data preprocessing, database storage and visualization of IMDb movie data from the last decade (2014-2024). The dataset includes details of 10,000 movies such as name, release year, genre, ratings, metascore and more. The project culminates in an interactive Power BI dashboard for in-depth insights and reporting.
Language:Jupyter Notebook2

preprocessing-data

vanderschaarlab/hyperimpute

Unstructured-IO/community

imyjk729/Memristor

ELHoussineT/AutoDataCleaner

dlite-tools/NLPiper

weiglszonja/meeg-tools

data-analyst-praktikum/Projects

cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas

UniFeat/unifeat

tuanio/backend-recommender-system-book

ArthurMangussi/pymdatagen

bharadwaj-chukkala/Data-driven-motion-planning-using-various-machine-learning-algorithms

Brokttv/food101-preprocessing

ChristianGoueguel/specProc

FaezehAbedi2023/Statistical-Analysis-in-Sensor-Data-Processing-with-Machine-Learning-Models

subhadipsinha722133/Multiple-Disease-Prediction

courtois-neuromod/ds_prep

msche81/2-Jedha_Fullstack

RafiQamar/HR-Analytics-Project

CCaribe9/AdaptStdEPF

damaniayesh/Cognifyz_Internship_Tasks

drleniaw/Analysis_Sentiment_Twitter_Free_Sex_In_Indonesian

fezzibasma/Speed-Dating-Experiment

functorism/snapcrop

kkmk11/BLIGHT-VISION

Navaneeth-Sharma/Speech_Recognition_of_Digits

rifkyahmadsaputra/Hollywood-Movies-Visualizations-and-Recommender-System

Shaheer-khan-github/Natural-Language-Processing-in-Python-DataCamp

XuanyiJennyMa/pupil_cloud_data_preprocessing_Phase_1

ALEXUSCR-27/Amazon-Books-Genre-Classifier

AlwaysDhruv/Images-Preprocessing

lawl2/object-detection-and-spatial-relation

LuisFelipePoma/Machine_Learning

Mohammed061/Transportation-and-logistics-Challenge

nlqthinh/WeaviateAnime

RafiQamar/IMDb-Movie-Analysis