feature-engineering

There are 3909 repositories under feature-engineering topic.

  • nni

    microsoft/nni

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Language:Python14.3k2822.1k1.8k
  • EpistasisLab/tpot

    A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

    Language:Jupyter Notebook10k2869301.6k
  • featuretools

    alteryx/featuretools

    An open source python library for automated feature engineering

    Language:Python7.5k1571k905
  • alibaba/Alink

    Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

    Language:Java3.6k137214795
  • mljar-supervised

    mljar/mljar-supervised

    Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

    Language:Python3.2k53673424
  • apachecn/fe4ml-zh

    :book: [译] 面向机器学习的特征工程

    Language:JavaScript2.6k1103678
  • Visualize-ML/Book6_First-Course-in-Data-Science

    Book_6_《数据有道》 | 鸢尾花书:从加减乘除到机器学习;欢迎大家批评指正!纠错多的同学会得到赠书感谢!

    Language:Jupyter Notebook2.4k2622441
  • salesforce/TransmogrifAI

    TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

    Language:Scala2.3k148144400
  • apache/hamilton

    Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

    Language:Jupyter Notebook2.3k21368157
  • metarank/metarank

    A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine

    Language:Scala2.2k1430797
  • rorysroes/SGX-Full-OrderBook-Tick-Data-Trading-Strategy

    Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

    Language:Jupyter Notebook2.1k1016685
  • feature_engine

    feature-engine/feature_engine

    Feature engineering package with sklearn like functionality

    Language:Python2.1k35347330
  • featureform

    featureform/featureform

    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

    Language:Go1.9k1515999
  • feathr-ai/feathr

    Feathr – A scalable, unified data and AI engineering platform for enterprise

    Language:Scala1.9k61337234
  • 4paradigm/OpenMLDB

    OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.

    Language:C++1.7k401.7k322
  • ClimbsRocks/auto_ml

    [UNMAINTAINED] Automated machine learning for analytics & production

    Language:Python1.7k98397312
  • LastAncientOne/Deep_Learning_Machine_Learning_Stock

    Deep Learning and Machine Learning stocks represent promising opportunities for both long-term and short-term investors and traders.

    Language:Jupyter Notebook1.6k406367
  • Yimeng-Zhang/feature-engineering-and-feature-selection

    A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.

    Language:Jupyter Notebook1.6k298416
  • asavinov/intelligent-trading-bot

    Intelligent Trading Bot: Automatically generating signals and trading based on machine learning and feature engineering

    Language:Python1.5k5943326
  • AutoTS

    winedarksea/AutoTS

    Automated Time Series Forecasting

    Language:Python1.3k2593115
  • logicalclocks/hopsworks

    Hopsworks - Data-Intensive AI platform with a Feature Store

    Language:Java1.3k3519150
  • DeepWisdom/AutoDL

    Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL challenge@NeurIPS.

    Language:Python1.2k3140216
  • functime-org/functime

    Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.

    Language:Python1.1k1411362
  • NVIDIA-Merlin/NVTabular

    NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

    Language:Python1.1k30793144
  • fraunhoferportugal/tsfel

    An intuitive library to extract features from time series.

    Language:Python1k1886154
  • sberbank-ai-lab/LightAutoML

    LAMA - automatic model creation framework

    Language:Python925315895
  • stitchfix/hamilton

    A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

    Language:Python8601810636
  • evalml

    alteryx/evalml

    EvalML is an AutoML library written in python.

    Language:Python826201.9k90
  • pixeltable

    pixeltable/pixeltable

    Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.

    Language:Python81847429
  • abhayspawar/featexp

    Feature exploration for supervised learning

    Language:Jupyter Notebook7622123163
  • aniketpotabatti/Data-Science-EBooks

    Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, explore these resources to deepen your knowledge and skills.

  • jeongyoonlee/Kaggler

    Code for Kaggle Data Science Competitions

    Language:Python7523824163
  • ashishpatel26/Amazing-Feature-Engineering

    Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

    Language:Jupyter Notebook746151267
  • HouJP/kaggle-quora-question-pairs

    Kaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)

    Language:Python7313421260
  • hyperparameter_hunter

    HunterMcGushion/hyperparameter_hunter

    Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries

    Language:Python70824116101
  • temporian

    google/temporian

    Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖

    Language:Python701105745