bigdatamatta
Artificial Intelligence | Big Data | IoT | Data Science | Robotic Process Automation | GEOINT| Digital Transformation Expert
Pinned Repositories
AutoBazaar
AutoBazaar: An AutoML System from the Machine Learning Bazaar
autoimpute
Python package for Imputation Methods
Automated-Exploratory-Data-Analysis--EDA--
EDA is one part of the Machine Learning. The aims of EDA is to summarize a characteristiccs of the dataset. It's usually showed on visual method.
CategoricalEmbedder
Categorical Embedder is a python package that let's you convert your categorical variables into numeric via Neural Networks
ChaLearn_Automatic_Machine_Learning_Challenge_2015
Data-Science--Cheat-Sheet
Cheat Sheets
DS_Production
ETL-pipeline-Redshift-and-S3
We will acreate a data warehouse by using AWS and build an ETL pipeline for a database hosted on Redshift. In this project we will need to load data from S3 to staging tables on Redshift and execute SQL statements that create the analytics tables from these staging tables.
lifetime_value
MALSAR
Multi-task learning via Structural Regularization
bigdatamatta's Repositories
bigdatamatta/ETL-pipeline-Redshift-and-S3
We will acreate a data warehouse by using AWS and build an ETL pipeline for a database hosted on Redshift. In this project we will need to load data from S3 to staging tables on Redshift and execute SQL statements that create the analytics tables from these staging tables.
bigdatamatta/lifetime_value
bigdatamatta/Advanced-Natural-Language-Processing-with-TensorFlow-2
Advanced Natural Language Processing with TensorFlow 2, published by Packt
bigdatamatta/AICrowd-Insurance-Pricing-Game
Top 100 among 1400 participants in insurance claims prediction leaderboard submission to the AICrowd Insurance Pricing Game
bigdatamatta/atspy
AtsPy: Automated Time Series Models in Python (by @firmai)
bigdatamatta/Auto_TS
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.
bigdatamatta/Auto_ViML
Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
bigdatamatta/autocalibration
bigdatamatta/AutoGL
An autoML framework & toolkit for machine learning on graphs.
bigdatamatta/autogluon
AutoGluon: AutoML for Text, Image, and Tabular Data
bigdatamatta/automate-insurance-pricing
bigdatamatta/aws-step-functions-etl-pipeline-pattern
bigdatamatta/Data-Pre-processing
bigdatamatta/DCC
Deep Code Curation
bigdatamatta/EDA-Spark-Pydeequ
EDA pyspark + pydeequ
bigdatamatta/Explainable-Automated-Medical-Coding
Implementation and demo of explainable coding of clinical notes with Hierarchical Label-wise Attention Networks (HLAN)
bigdatamatta/FLAML
A fast and lightweight AutoML library.
bigdatamatta/gabak
GABAK Warehouse Layout Optimizer is a software program developed in C# to optimize, test, compare warehouse layouts for order picking operations.
bigdatamatta/GTS
Discrete Graph Structure Learning for Forecasting Multiple Time Series, ICLR 2021.
bigdatamatta/hctsa
Highly comparative time-series analysis
bigdatamatta/Kats
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.
bigdatamatta/Machine-Learning-Automation-with-TPOT
Machine Learning Automation with TPOT, published by Packt
bigdatamatta/neural_prophet
NeuralProphet - a Neural Network based Time-Series model
bigdatamatta/predict-insurance-charges-with-autoai
Create an end-to-end AI solution that will help predict insurance premium cost with IBM Watson Studio and AutoAI.
bigdatamatta/PubLayNet
bigdatamatta/Semantic-Similarity-of-Sentences
Methods used: Cosine Similarity with Glove, Smooth Inverse Frequency, Word Movers Difference, Sentence Embedding Models (Infersent and Google Sentence Encoder), ESIM with pre-trained FastText embedding. Best performing method on Quora Question pair dataset was an Ensemble method with 0.27 log-loss.
bigdatamatta/transformers
bigdatamatta/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
bigdatamatta/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
bigdatamatta/wav2vec2_transformers
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.