A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Maintainers - Andrei Khobnia
This page is licensed under Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported License
Please feel free to create pull requests.
- scipy.stats.boxcox
np.log (x + const)
- Featuretools
- sklearn.preprocessing.PolynomialFeatures
- Divisions
- Other interactions
- Bag-of-words model
- A Gentle Introduction to the Bag-of-Words Model
- sklearn.feature_extraction.text.CountVectorizer
- sklearn.feature_extraction.DictVectorizer
- sklearn.feature_extraction.FeatureHasher
- Word embedding
- GloVe: Global Vectors for Word Representation
- Gensim: models.word2vec – Word2vec embeddings
- fastText
- Word2Vec and FastText Word Embedding with Gensim
- Do Pretrained Embeddings Give You The Extra Edge?
- ClearTK - Feature Extraction Tutorial
- Regular Expressions
- Part-of-Speech_Tagging
- NLTK Categorizing and Tagging Words
- How to use PoS features in scikit learn classfiers
- Feature extraction and similar image search with OpenCV for newbies
- OpenCV -- Feature Detection and Description
- SimpleCV.Features package
- Scikit-image feature module
- Keras pre-trained models feature extraction
- Using Keras’ Pre-trained Models for Feature Extraction in Image Clustering
- Why One-Hot Encode Data in Machine Learning?
- How to One Hot Encode Sequence Data in Python
- sklearn.preprocessing.OneHotEncoder
- Keras - to_categorical
- Likelihood encoding of categorical features
- Python target encoding for categorical features
- Adding variance column when mean encoding
- Feature Hashing on Wikipedia
- Feature hashing and Extraction in VowpalWabbit
- Feature hashing in scikit-learn