Pinned Repositories
Amazon-Sagemaker-Algorithm-Examples
Some of the best ML algorithms handpicked from AWS
amazon-sagemaker-examples
Example notebooks that show how to apply machine learning, deep learning and reinforcement learning in Amazon SageMaker
BERT
Research & experiment on the state of the art language model for NLP
Data-Analysis-with-R
Using gglot2, tidyr, dplyr, ggmap, choroplethr, shiny, logistic regression, clustering models and more
model-selection
An efficient way of selecting the right machine learning model
Natural-Language-Processing-in-Python
Natural Language Processing in Python (Article)
NLP-with-Python
Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more
Real-Time-Speech2Insights
A remarkable Real-Time speech to insights analytics. Turning autopilot mode ON to harvest data via NLP
Text-to-Text-transfer-transformers
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
XGboost
How to win customer confidence using XGBoost
thecodemasterk's Repositories
thecodemasterk/SMS-Message-Spam-Detector
A simple Flask API to detect spam or ham using Python and Sklearn
thecodemasterk/DataScienceAndR
R語言翻轉教室
thecodemasterk/stock-prediction
Stock price prediction with recurrent neural network. The data is from the Chinese stock.
thecodemasterk/data-science-roadmap
thecodemasterk/Data-Analysis-with-R
Using gglot2, tidyr, dplyr, ggmap, choroplethr, shiny, logistic regression, clustering models and more
thecodemasterk/AnalyticsVidhya_DataSupremacy
3rd Place Solution for Data Supremacy Competition on Analytics Vidhya (https://datahack.analyticsvidhya.com/contest/the-data-supremacy/)
thecodemasterk/Duke
Duke is a fast and flexible deduplication engine written in Java
thecodemasterk/RDocumentation
R package to integrate rdocumentation.org into your R workflow
thecodemasterk/machine-learning-101
A repository related to datasets for Machine Learning.
thecodemasterk/dplyr-tutorial
Tutorials for the dplyr package in R
thecodemasterk/Movie_ratings_2016_17
Contains a dataset with movie ratings for some of the most popular movies for 2016 and 2017 (IMDB, Fandango, Metacritic, Rotten Tomatoes)
thecodemasterk/urban-data-science
Course materials, Jupyter notebooks, tutorials, guides, and demos for a Python-based urban data science course.
thecodemasterk/susanli2016.github.io
My data analysis blog, based on Minimal Mistakes theme
thecodemasterk/Deep-Learning-with-deeplearning.ai
Program assignments for the Deep Learning Specialization at Coursera by Andrew Ng
thecodemasterk/Scraped_dataset_movie_ratings
I have scraped over 2000 IMDB and Metacritic ratings using Python. The script I have used is fully available in the form of a walkthrough in this blog post.
thecodemasterk/data
Data and code behind the stories and interactives at FiveThirtyEight
thecodemasterk/Python-Code
Some Python function/code can be handy from time to time
thecodemasterk/harvard-cs50
Problem sets and projects for Harvard CS50: Introduction to Computer Science
thecodemasterk/flexdashboard-talk
Slides and resources for flexdashboard talk at UseR! 2016
thecodemasterk/dedupe-examples
Examples for using the dedupe library
thecodemasterk/fortune500
analyses of top US companies
thecodemasterk/knitr
A general-purpose tool for dynamic report generation in R
thecodemasterk/damshiny
Data Analysis Menu using Shiny