tanaymukherjee

Data Science Enthusiast. Digital Marketing Expert. Past exp in Analytics, Research & Strategy. Academics: Comp. Science Engg & MS in Statistics and Data Science

IBM | Ogilvy | Maersk | CUNY | TeslaNew York

Pinned Repositories

Case-Study-Predicting-Bankruptcy
Based on available data from bank and parameters to identify the variables that influence the most, predict the bankruptcy of the given financial model
Language:R4 2 01
Complex-SQL-Exercise
SQL queries of all kind being put together as a single repository
5 2 06
Deep-Learning-with-PyTorch
PyTorch is an open source machine learning library based on the Torch library, used for applications such as computer vision and natural language processing, primarily developed by Facebook's AI Research lab. It is free and open-source software released under the Modified BSD license.
Language:Jupyter Notebook1 2 00
Dimensionality-Reduction
In statistics, machine learning, and information theory, dimensionality reduction or dimension reduction is the process of reducing the number of random variables under consideration by obtaining a set of principal variables. Approaches can be divided into feature selection and feature extraction.
Language:Jupyter Notebook2 1 00
Dissecting-Yelp-Dataset
This dataset is a subset of Yelp's businesses, reviews, and user data. It was originally put together for the Yelp Dataset Challenge which is a chance for students to conduct research or analysis on Yelp's data and share their discoveries. In the dataset you'll find information about businesses across 11 metropolitan areas in four countries.
Language:Jupyter Notebook2 3 00
Exploring-SQL-with-R
The idea is to use the SQL skills in R by converting data into relational database from text files and then using it to run queries to filter data by SQL
Language:R20
Google-Analytics-with-R
How to automate reporting suite from GA to R, so that one can pull data at will without even interacting with Google Analytics interface. There are various things one can do and we will cover each one of them.
Language:R20
Investigating-NYC-Parking-Violations
For this project, we will analyze millions of NYC Parking violations since January 2016
Language:Python1 2 01
Natural-Language-Processing
Natural language processing is a subfield of linguistics, computer science, information engineering, and artificial intelligence concerned with the interactions between computers and human languages, in particular how to program computers to process and analyze large amounts of natural language data.
Language:Jupyter Notebook00
Time-Series-Modeling
A time series is a series of data points indexed in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data.
Language:Jupyter Notebook0 2 01

tanaymukherjee's Repositories

tanaymukherjee/Dissecting-Yelp-Dataset
This dataset is a subset of Yelp's businesses, reviews, and user data. It was originally put together for the Yelp Dataset Challenge which is a chance for students to conduct research or analysis on Yelp's data and share their discoveries. In the dataset you'll find information about businesses across 11 metropolitan areas in four countries.
Language:Jupyter Notebook2 3 00
tanaymukherjee/A-B-Testing-in-R
A/B testing (or split-testing) is a randomized experiment with two variants A and B. It includes application of statistical hypothesis testing (or two-sample hypothesis testing), as used in the field of statistics. A/B testing is a way to compare two versions of a single variable, typically by testing a subject's response to variant A against variant B, and determining which of the two variants is more effective.
Language:R1 2 0
tanaymukherjee/Deep-Learning-with-PyTorch
PyTorch is an open source machine learning library based on the Torch library, used for applications such as computer vision and natural language processing, primarily developed by Facebook's AI Research lab. It is free and open-source software released under the Modified BSD license.
Language:Jupyter Notebook1 2 00
tanaymukherjee/Investigating-NYC-Parking-Violations
For this project, we will analyze millions of NYC Parking violations since January 2016
Language:Python1 2 01
tanaymukherjee/Shapley-Value
Language:Jupyter Notebook1 2 0
tanaymukherjee/Spoken-Language-Processing-in-Python
Language:Jupyter Notebook1 2 0
tanaymukherjee/Natural-Language-Processing
Natural language processing is a subfield of linguistics, computer science, information engineering, and artificial intelligence concerned with the interactions between computers and human languages, in particular how to program computers to process and analyze large amounts of natural language data.
Language:Jupyter Notebook00
tanaymukherjee/CIS_9440_Project_YouTube-and-Netflix-Viewership-Analysis
This is a repository to put together all the work for the final project from CIS 9440 - Data Warehousing and Analytics
Language:Jupyter Notebook2 0
tanaymukherjee/Data-Science-Hacks-in-Python-Part-2
Simple hacks to speed up your Data Analysis
Language:Jupyter Notebook2 0
tanaymukherjee/Debugging-NY-Times-library
This is a web scrapping project and I am trying to gather info from NY Times using APIs
Language:Jupyter Notebook2 0
tanaymukherjee/Epileptic-Seizure-Recognition
Language:R2 0
tanaymukherjee/Flow-in-R
Language:R2 0
tanaymukherjee/HackerRank-Challenges
Language:Jupyter Notebook
tanaymukherjee/Humana-Mays-Healthcare-Analytics-Case-Competition-2020
Mays Business School in partnership with Humana presents the fourth annual Humana-Mays Healthcare Analytics Case Competition. The competition will be held virtually and offers an opportunity for U.S. masters students to showcase their analytical skills and solve a real-world business problems for Humana utilizing real data.
Language:Jupyter Notebook2 0
tanaymukherjee/Kaggle-Competition-Santander-Customer-Transaction-Prediction
https://www.kaggle.com/c/santander-customer-transaction-prediction
Language:Jupyter Notebook2 0
tanaymukherjee/Learning-Kafka
2 0
tanaymukherjee/Linear-Regression-in-SQL
In this exercise we will try to learn how can we implement linear regression just using SQL.
Language:TSQL2 0
tanaymukherjee/Machine-Learning-Fall-2020
This repo includes all the work/assignments I did as part of my coursework in Fall 2020 under the subject code STA 9891 with Prof. Rad.
Language:R2 0
tanaymukherjee/ML-in-Bioinformatics
Bioinformatics is a subdiscipline of biology and computer science concerned with the acquisition, storage, analysis, and dissemination of biological data, most often DNA and amino acid sequences.
Language:Jupyter Notebook2 0
tanaymukherjee/Network-Analysis
The promise of network analysis is the placement of significance on the relationships between actors, rather than seeing actors as isolated entities. The emphasis on complexity, along with the creation of a variety of algorithms to measure various aspects of networks, makes network analysis a central tool for digital humanities.
Language:R2 0
tanaymukherjee/NLP-Class-Fall-2020
Language:Jupyter Notebook
tanaymukherjee/No-SQL-in-Python
Language:Jupyter Notebook2 0
tanaymukherjee/OOP-in-Python
Demystifying the world of object oriented programming in Python
Language:Python2 0
tanaymukherjee/PB_Challenge_2021
In this exercise we are trying to predict that for given information can we predict whether a device will fail in next 7 days.
Language:Jupyter Notebook2 0
tanaymukherjee/Real-and-Fake-News-Analysis
Language:Jupyter Notebook2 0
tanaymukherjee/SQL-Exercise-2
In this exercise we will try to answer a specific data requirement.
Language:SQLPL2 0
tanaymukherjee/Tableau-Dashboards
This repository is a showcase of all the tableau dashboards I have built so far.
2 0
tanaymukherjee/tanaymukherjee
1 0
tanaymukherjee/Useful-Python-libraries-for-Data-Science
In this repository, I am trying to compile some useful Python libraries for data science tasks other than the commonly used ones like pandas, scikit-learn, matplotlib, etc. My idea is to regularly update the kernel to include some awesome Python libraries which can real come in handy for the Data Analysis and Machine learning tasks.
Language:Jupyter Notebook1
tanaymukherjee/Working-With-Python-Functions
Language:Jupyter Notebook2 0