masumrumi
I am a Data Scientist//Data Engineer/Software engineer/learner, and Independent Consultant. I enjoy documenting and writing about technology.
MerckNew York, NY
Pinned Repositories
A-Data-Scientists-Arsenal
Data Science is an ever-growing field with lots of resources to learn. However, it can be really hard to choose from those resources. Therefore, I have created this repository with bunch of resources to help my fellow data scientists.
aws-glue-samples
AWS Glue code samples
code_snippets
COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
imdb_rotten_web-scraping
In this project, I am going to start learning how to do web scraping. I am planning to web scrape iMDB and Rotten Tomato to get info about each movie. My aim is to collect as much data as possible. This will be a continuous project with the implementation of machine learning and NLP. We will start with machine learning section and try to predict the movie score based on all the info we collect from those two websites. Let's get started.
Kaggle_Projects
This is a repository of all the kaggle projects
Machine-Learning-Series
SPOTIFY_API
This repository is all about accessing Spotify data using Python's "Spotipy" module and saving it in a local database.
tutorial_codes
This repository contains code used for YouTube tutorials.
masumrumi's Repositories
masumrumi/Kaggle_Projects
This is a repository of all the kaggle projects
masumrumi/SPOTIFY_API
This repository is all about accessing Spotify data using Python's "Spotipy" module and saving it in a local database.
masumrumi/imdb_rotten_web-scraping
In this project, I am going to start learning how to do web scraping. I am planning to web scrape iMDB and Rotten Tomato to get info about each movie. My aim is to collect as much data as possible. This will be a continuous project with the implementation of machine learning and NLP. We will start with machine learning section and try to predict the movie score based on all the info we collect from those two websites. Let's get started.
masumrumi/A-Data-Scientists-Arsenal
Data Science is an ever-growing field with lots of resources to learn. However, it can be really hard to choose from those resources. Therefore, I have created this repository with bunch of resources to help my fellow data scientists.
masumrumi/aws-glue-samples
AWS Glue code samples
masumrumi/code_snippets
masumrumi/COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
masumrumi/Machine-Learning-Series
masumrumi/tutorial_codes
This repository contains code used for YouTube tutorials.
masumrumi/cv
masumrumi/data-science-on-aws
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker
masumrumi/deploying-machine-learning-models
Example Repo for the Udemy Course "Deployment of Machine Learning Models"
masumrumi/docs.getdbt.com
The code behind docs.getdbt.com
masumrumi/dragon-book-exercise-answers
Compilers Principles, Techniques, & Tools (purple dragon book) second edition exercise answers. 编译原理(紫龙书)第2版习题答案。
masumrumi/Line-of-Therapy-Algorithm
This is the Line of Therapy Algorithm, as described in the paper "Temporal phenotyping by mining healthcare data to derive lines of therapy for cancer" pending submission in the Journal of Biomedical Informatics.
masumrumi/masumrumi
Config files for my GitHub profile.
masumrumi/Parser
This is simple parser written in python to do simple calculations.
masumrumi/pdtools2
masumrumi/pyspark_titanic
masumrumi/temp_dir
masumrumi/Tindog
masumrumi/tindog-final
masumrumi/vim-config
my vim config to share amongst my machines