data-classification

There are 54 repositories under data-classification topic.

  • mthh/jenkspy

    Compute Natural Breaks in Python (Fisher-Jenks algorithm)

    Language:Python23272828
  • mockingbird

    openraven/mockingbird

    A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.

    Language:Python43656
  • chgl16/data-mining-algorithm

    :bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法

    Language:Python25007
  • nightfallai/nightfall-python-sdk

    Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

    Language:Python255113
  • xinglab-ai/genomap

    Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)

    Language:Python19313
  • exploropleth/binguru

    BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.

    Language:TypeScript16
  • nightfallai/sensitive-data-scanner

    Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.

    Language:Python11302
  • AvaAvarai/VKD_Demo_Suite

    Visual Knowledge Discovery demo tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.

    Language:Python8101
  • debmalya92/telecom-churn-prediction

    Two differrent approach to predict Churn customers and finding out important variables that drives churn

    Language:Python8208
  • AvaAvarai/Dynamic_Coordinates_Vis_System

    Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.

    Language:Python6102
  • bhataparnak/Neural-Network-small-projects

    Neural Network Deep learning specialization course offered via Coursera

    Language:Jupyter Notebook5104
  • MelvinMo/ROPAC-Rule-OPtimized-Aggregation-Classifier

    Discover ROPAC, a novel rule-based classifier we proposed. Here, you'll find the code, data, and original paper detailing this data classification algorithm.

    Language:Python4100
  • AvaAvarai/Java_Tabular_Vis_Toolkit

    Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.

    Language:Java3101
  • gabfr/truck-data-wrangler

    ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB

    Language:Jupyter Notebook3101
  • AvaAvarai/DataAnalysisNotebook

    This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.

    Language:Jupyter Notebook2100
  • AvaAvarai/ML_Classifier_Comparison_Tool

    Machine Learning classifier comparison GUI application. Choose 21 classifiers, evaluation data (optional for evaluation of synthetic data), hyperparameters, cross-validation splits, and rng seed; tabulates, and visualizes in Parallel Coordinates: best, worst, average, and standard deviation of Accuracy/F1/Recall.

    Language:Python2101
  • JeffWang0325/Microsoft-DAT275X-Principles-of-Machine-Learning-Python-Edition

    In this data science course, you will be given clear explanations of machine learning theory combined with practical scenarios and hands-on experience building, validating, and deploying machine learning models. You will learn how to build and derive insights from these models using Python, and Azure Notebooks.

    Language:Jupyter Notebook2201
  • lorival/image-classification-by-cifar-10

    This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.

    Language:Jupyter Notebook2101
  • mrseanryan/data-type-predictor

    Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...

    Language:Python2240
  • slerpyyy/vscope

    Command Line Data Visualizer

    Language:C2101
  • ddolgushin/stats

    Классы для статистической обработки данных.

    Language:C#1100
  • debmalya92/credit-card-defaulter-prediction

    The model predicts for the next month credit card defaulter based on demographic and last six months behavioral data

    Language:Python1102
  • exploropleth/resiliency-app

    Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.

    Language:TypeScript1
  • lucylow/ML_adversarial_images

    Old ML Project - Create adversarial images to fool a MNIST classifier using TensorFlow.

    Language:Jupyter Notebook1201
  • mthh/classif

    Library for one-dimensional data classification and simple statistics in Rust

    Language:Rust121
  • data-classification

    qeeqbox/data-classification

    Data classification defines and categorizes data according to its type, sensitivity, and value

  • Subhajit0210/Customer_Segmentation_using_Machine_Learning

    This project repository helps in segment mall customers, analyzing demographics and spending to identify groups for targeted marketing.

    Language:Jupyter Notebook1100
  • Tanguy9862/AI-Powered-FDA-Drug-Scraper

    Developed a Python-based web scraper leveraging generative AI with LangChain and GPT-4o-mini to extract and classify FDA drug approval data. Processed over 1,770 records, dynamically categorizing medications and treatment areas using LLMs to simplify complex medical information into actionable insights.

    Language:Python1100
  • gulcihanglmz/image-classification-data-split

    📂 Splits image datasets into training and testing sets for classification tasks. Useful for preparing data for machine learning models.

    Language:Python0100
  • MelvinMo/RUMC-RUle-Mutation-Classifier

    Explore RUMC, a new rule-based classifier inspired by evolutionary concepts. Access the code, data, and the preprint here.

    Language:Jupyter Notebook00
  • rubyyy1118/Machine_Learning_Optimization_Study

    The Learning From Data - Assignment in my MSc Business Analytics course

    Language:Jupyter Notebook0100
  • SamJoeSilvano/Password_Strength_Prediction_using_NLP

    Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

    Language:Jupyter Notebook0100
  • Adit-Mugdha-das/Planar-Data-Classification-with-One-Hidden-Layer

    This involves building a shallow neural network with one hidden layer to classify non-linearly separable 2D data using NumPy from scratch. This repository contains my implementation of the Week 3 programming assignment from the Deep Learning Specialization (Course 1: Neural Networks and Deep Learning) by Andrew Ng.

  • exploropleth/exploropleth

    Exploratory Analysis of Data Binning Methods in Choropleth Maps

    Language:TypeScript
  • natgluons/contextual-RAG

    A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

    Language:Jupyter Notebook