data-classification
There are 54 repositories under data-classification topic.
mthh/jenkspy
Compute Natural Breaks in Python (Fisher-Jenks algorithm)
openraven/mockingbird
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
chgl16/data-mining-algorithm
:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
nightfallai/nightfall-python-sdk
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
xinglab-ai/genomap
Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)
exploropleth/binguru
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
nightfallai/sensitive-data-scanner
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
AvaAvarai/VKD_Demo_Suite
Visual Knowledge Discovery demo tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.
debmalya92/telecom-churn-prediction
Two differrent approach to predict Churn customers and finding out important variables that drives churn
AvaAvarai/Dynamic_Coordinates_Vis_System
Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.
bhataparnak/Neural-Network-small-projects
Neural Network Deep learning specialization course offered via Coursera
MelvinMo/ROPAC-Rule-OPtimized-Aggregation-Classifier
Discover ROPAC, a novel rule-based classifier we proposed. Here, you'll find the code, data, and original paper detailing this data classification algorithm.
AvaAvarai/Java_Tabular_Vis_Toolkit
Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.
gabfr/truck-data-wrangler
ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB
AvaAvarai/DataAnalysisNotebook
This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.
AvaAvarai/ML_Classifier_Comparison_Tool
Machine Learning classifier comparison GUI application. Choose 21 classifiers, evaluation data (optional for evaluation of synthetic data), hyperparameters, cross-validation splits, and rng seed; tabulates, and visualizes in Parallel Coordinates: best, worst, average, and standard deviation of Accuracy/F1/Recall.
JeffWang0325/Microsoft-DAT275X-Principles-of-Machine-Learning-Python-Edition
In this data science course, you will be given clear explanations of machine learning theory combined with practical scenarios and hands-on experience building, validating, and deploying machine learning models. You will learn how to build and derive insights from these models using Python, and Azure Notebooks.
lorival/image-classification-by-cifar-10
This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.
mrseanryan/data-type-predictor
Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...
sanjaalcorps/NepaliDataClassifiers
Nepali Data Classifiers
slerpyyy/vscope
Command Line Data Visualizer
ddolgushin/stats
Классы для статистической обработки данных.
debmalya92/credit-card-defaulter-prediction
The model predicts for the next month credit card defaulter based on demographic and last six months behavioral data
exploropleth/resiliency-app
Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.
lucylow/ML_adversarial_images
Old ML Project - Create adversarial images to fool a MNIST classifier using TensorFlow.
mthh/classif
Library for one-dimensional data classification and simple statistics in Rust
qeeqbox/data-classification
Data classification defines and categorizes data according to its type, sensitivity, and value
Subhajit0210/Customer_Segmentation_using_Machine_Learning
This project repository helps in segment mall customers, analyzing demographics and spending to identify groups for targeted marketing.
Tanguy9862/AI-Powered-FDA-Drug-Scraper
Developed a Python-based web scraper leveraging generative AI with LangChain and GPT-4o-mini to extract and classify FDA drug approval data. Processed over 1,770 records, dynamically categorizing medications and treatment areas using LLMs to simplify complex medical information into actionable insights.
gulcihanglmz/image-classification-data-split
📂 Splits image datasets into training and testing sets for classification tasks. Useful for preparing data for machine learning models.
MelvinMo/RUMC-RUle-Mutation-Classifier
Explore RUMC, a new rule-based classifier inspired by evolutionary concepts. Access the code, data, and the preprint here.
rubyyy1118/Machine_Learning_Optimization_Study
The Learning From Data - Assignment in my MSc Business Analytics course
SamJoeSilvano/Password_Strength_Prediction_using_NLP
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
Adit-Mugdha-das/Planar-Data-Classification-with-One-Hidden-Layer
This involves building a shallow neural network with one hidden layer to classify non-linearly separable 2D data using NumPy from scratch. This repository contains my implementation of the Week 3 programming assignment from the Deep Learning Specialization (Course 1: Neural Networks and Deep Learning) by Andrew Ng.
exploropleth/exploropleth
Exploratory Analysis of Data Binning Methods in Choropleth Maps
natgluons/contextual-RAG
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.