Pinned Repositories
course_python_introduction
Python course setup together with Joris Vandenbossche
data_extraction
Scripts for downloading data on csv format
datasets
Download and denormalization scripts for dirtycat datasets.
ecml-pkdd-2018
Scripts for ECML PKDD 2018 article: Similarity encoding for learning with dirty categorical variables
gamma_poisson_factorization
Implementation of an online Gamma Poisson matrix factorization
kdd2018
Generalizing categorical encodings: an n-gram kernel for morphological variants
scikit-learn
scikit-learn: machine learning in Python
string_categorical_encoders
Scripts for paper "Encoding high-cardinality string categorical variables"
pcerda's Repositories
pcerda/string_categorical_encoders
Scripts for paper "Encoding high-cardinality string categorical variables"
pcerda/ecml-pkdd-2018
Scripts for ECML PKDD 2018 article: Similarity encoding for learning with dirty categorical variables
pcerda/kdd2018
Generalizing categorical encodings: an n-gram kernel for morphological variants
pcerda/gamma_poisson_factorization
Implementation of an online Gamma Poisson matrix factorization
pcerda/scikit-learn
scikit-learn: machine learning in Python
pcerda/data_extraction
Scripts for downloading data on csv format
pcerda/datasets
Download and denormalization scripts for dirtycat datasets.
pcerda/course_python_introduction
Python course setup together with Joris Vandenbossche
pcerda/data
Data and code behind the articles and graphics at FiveThirtyEight
pcerda/datacleaning-benchmark
pcerda/dirty_cat
Encoding methods for dirty categorical variables
pcerda/ds3_kernel_testing
Material for the practical of the DS3 course on "Representing and comparing probabilities with kernels"
pcerda/faiss
A library for efficient similarity search and clustering of dense vectors.
pcerda/scikit-learn-extra
scikit-learn contrib estimators