stratified-sampling
There are 43 repositories under stratified-sampling topic.
MasashiSode/mcs_kfold
mcs_kfold stands for "monte carlo stratified k fold". This library attempts to achieve equal distribution of discrete/categorical variables in all folds. The greatest advantage of this method is that it can be applied to multi-dimensional targets.
aktgpt/onlinetripletmining
Fast Online Triplet mining in Pytorch
MsTao-68/Debt-Churn-Data-Analysis
使用比赛方提供的脱敏数据,进行客户信贷流失预测。
saeedt/CFS_Sampling
An optimal stratified sample design for Commodity Flow Survey (CFS) based on Simulated Annealing and Genetic Algorithm. A script in Procedural PostgreSQL is used to generate a frame with 100,000 records based on publicly available data.
rochitasundar/Twitter-Sentiment-Analysis
Data consists of tweets scrapped using Twitter API. Objective is sentiment labelling using a lexicon approach, performing text pre-processing (such as language detection, tokenisation, normalisation, vectorisation), building pipelines for text classification models for sentiment analysis, followed by explainability of the final classifier
StarlangSoftware/Sampling-Py
Data sampling library
dataditya/US-Airlines-Delay-Analysis-2023
The objective is to analyze flight delays in the United States. Data from airlines, airports, and runways will be collected and processed. Machine learning models will be built using logistic regression, decision trees, and XGB classifiers. Visualizations will be created in Tableau, and Excel dashboards and SQL queries will be used for analysis.
saminens/Women-in-Data-Science-2020
WiDS Datathon 2020 on patient health through data from MIT’s GOSSIS (Global Open Source Severity of Illness Score) initiative.
StarlangSoftware/Sampling
Data sampling library
StarlangSoftware/Sampling-CPP
Data sampling library
anthonyli01/Advanced-Simulation-Methods
This project focuses on applying advanced simulation methods for derivatives pricing. It includes Monte-Carlo, Variance Reduction Techniques, Distribution Sampling Methods, Euler Schemes, and Milstein Schemes.
jesussantana/Sampling
Perform Data Sampling with Python
kristoffhernan/ProfessorSurvey
Web scraper to get professor information, and a mass emailer that sends a website with a survey.
langthom/sirasac
A C library with Python bindings for efficient stratified random sampling from binary buffers or files.
Lefteris-Souflas/Business-Analytics-Case-Studies
Three business analytics case studies were undertaken, encompassing market basket analysis, customer segmentation, and campaign management. SAS Visual Data Mining and Machine Learning on SAS Viya was utilized to explore data and provide insights. A comprehensive report addressing both technical and business aspects was delivered.
Nikhilkohli1/Natural-Language-Processing
This repository contains Natural Language Processing Projects like Sarcasm Detection, Quora Insincere Questions Classification & Edgar Sentiment Analysis
shreyasbhatia09/Google-Analytics-Customer-Revenue-Prediction
Kaggle Challenge
anthonyli01/R-Derivatives-Pricing
University Project: simulation techniques to price derivatives. It will involve Monte-Carlo, variance-reduction techniques, and advanced simulation methods.
Crossed-finger/Credit-risk-analysis
CSCI316 Group assignment 1
david-garza/Credit_Risk_Analysis
Supervised machine learning model to classify loan applicants into high and low risk categories
imane-ayouni/California-Housing-Price-Predictions
Regression algorithms to predict the median house prices in California districts
m-guseva/balanced-group-assignment
This code assigns participants to an experimental group and ensures balanced physical attributes without knowing the participants in advance.
OMahmoodi/imbalanced_data
This notebook will walk you through the steps for dealing with an imbalanced dataset using an example of a real project that I recently completed.
rrfsantos/Projeto-Redes-Neurais-OCT-Images
BI Master - Automated methods to detect and classify human diseases from medical images. Convolutional Neural Network, Data Augmentation, Transfer Learning, Tensorflow, Keras, Xception, ImageNet, StratifiedKFold.
shivtosh/Stroke_prediction
Models implemented for stroke prediction amongst individuals
zca21/Statistical_Consultancy
Code to help the presentation to the client. The Sampling code.Rmd file contains the code performing the sampling method and produces the visualisations and diagnostics seen in the presentation.
aarushijain-24/Sampling-Methods
Credit card fraud detection using various sampling methods and machine learning algorithms.
arbasher/straSplit
Stratification of multi-label datasets
hunaiva-kintan/Sampling-Technique
This was a project that aimed implement sampling techniques. The sampling technique used was clustered random sampling and stratified random sampling.
joao-vitor-souza/prever-aluguel
(77,86% R) Floresta Aleatória aprimorada para a previsão de aluguel.
MatthewFound/ML-algo-collection
Repository contains from-scratch python functions for machine learning ranging from preprocessing to full classifier objects
pagoma3/Sampling
Sprint 6, Task 1
RimTouny/Image-Classification-using-Chars74K-dataset
Employing advanced techniques, the project seamlessly integrates binary and multiclass classifiers for character classification. It offers a comprehensive analysis and adeptly addresses challenges in the realm of computer vision.This project was part of my uOttawa Master's in Computer Vision course (2023).
StarlangSoftware/Sampling-Js
Data Sampling Library