cansumericli

Data Engineer

New York City

Pinned Repositories

BreastCancer-PCA
Principal component analysis (PCA) is a technique used for identification of a smaller number of uncorrelated variables known as principal components from a larger set of data. The technique is widely used to emphasize variation and capture strong patterns in a data set.
Language:Jupyter Notebook1 0 00
ClickOnAd
In this project I will be working with a fake advertising data set, indicating whether or not a particular internet user clicked on an Advertisement on a company website. We will try to create a model that will predict whether or not they will click on an ad based off the features of that user with using Logistic Regression.
Language:Jupyter Notebook1 0 00
CrimeRate
Crime Rate Analysis with Python ML
Language:Jupyter Notebook1 0 00
ETH_Cryptocurrency
Ethereum token (also known as Ether, ETH) is the second largest cryptocurrency by market capitalization. It is the native token of the decentralized Ethereum platform that has an ambition to become the largest platform for decentralized applications and smart contracts. The primary purpose of the Ethereum token is to be used for the platform itself, particularly with the design and execution of its smart contracts and decentralized applications.
Language:Jupyter Notebook1 0 00
IBM_Capstone
This repository contains files of IBM's Applied Data Science Capstone Project.
Language:Jupyter Notebook1 1 00
MovieRecommenderSystem
I will create a basic recommendation system by suggesting items that are most similar to a particular item, in this case, movies. This is not a true robust recommendation system, to describe it more accurately, it just tells you what movies/items are most similar to your movie choice.
Language:Jupyter Notebook2 0 00
Python_DeepLearning
This repo contains Python Deep Learning notebooks for educational purpose.
Language:Jupyter Notebook0 0 00
Python_ML
This repository contains statistical modeling with Python Machine Learning
Language:Jupyter Notebook1 1 00
R_DataAnalysis
This repository contains data analysis instructions for R language.
Language:R0 1 00
YELPreviews
In this NLP project I will be attempting to classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews. This will be a simpler procedure than the lecture, since we will utilize the pipeline methods for more complex tasks. I will use the Yelp Review Data Set from Kaggle. Each observation in this dataset is a review of a particular business by a particular user. The "stars" column is the number of stars (1 through 5) assigned by the reviewer to the business. (Higher stars is better.) In other words, it is the rating of the business by the person who wrote the review. The "cool" column is the number of "cool" votes this review received from other Yelp users. All reviews start with 0 "cool" votes, and there is no limit to how many "cool" votes a review can receive. In other words, it is a rating of the review itself, not a rating of the business. The "useful" and "funny" columns are similar to the "cool" column.
Language:Jupyter Notebook0 0 00

cansumericli's Repositories

cansumericli/MovieRecommenderSystem
I will create a basic recommendation system by suggesting items that are most similar to a particular item, in this case, movies. This is not a true robust recommendation system, to describe it more accurately, it just tells you what movies/items are most similar to your movie choice.
Language:Jupyter Notebook2 0 00
cansumericli/BreastCancer-PCA
Principal component analysis (PCA) is a technique used for identification of a smaller number of uncorrelated variables known as principal components from a larger set of data. The technique is widely used to emphasize variation and capture strong patterns in a data set.
Language:Jupyter Notebook1 0 00
cansumericli/ClickOnAd
In this project I will be working with a fake advertising data set, indicating whether or not a particular internet user clicked on an Advertisement on a company website. We will try to create a model that will predict whether or not they will click on an ad based off the features of that user with using Logistic Regression.
Language:Jupyter Notebook1 0 00
cansumericli/CrimeRate
Crime Rate Analysis with Python ML
Language:Jupyter Notebook1 0 00
cansumericli/ETH_Cryptocurrency
Ethereum token (also known as Ether, ETH) is the second largest cryptocurrency by market capitalization. It is the native token of the decentralized Ethereum platform that has an ambition to become the largest platform for decentralized applications and smart contracts. The primary purpose of the Ethereum token is to be used for the platform itself, particularly with the design and execution of its smart contracts and decentralized applications.
Language:Jupyter Notebook1 0 00
cansumericli/IBM_Capstone
This repository contains files of IBM's Applied Data Science Capstone Project.
Language:Jupyter Notebook1 1 00
cansumericli/Python_ML
This repository contains statistical modeling with Python Machine Learning
Language:Jupyter Notebook1 1 00
cansumericli/Python_DeepLearning
This repo contains Python Deep Learning notebooks for educational purpose.
Language:Jupyter Notebook0 0 00
cansumericli/R_DataAnalysis
This repository contains data analysis instructions for R language.
Language:R0 1 00
cansumericli/YELPreviews
In this NLP project I will be attempting to classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews. This will be a simpler procedure than the lecture, since we will utilize the pipeline methods for more complex tasks. I will use the Yelp Review Data Set from Kaggle. Each observation in this dataset is a review of a particular business by a particular user. The "stars" column is the number of stars (1 through 5) assigned by the reviewer to the business. (Higher stars is better.) In other words, it is the rating of the business by the person who wrote the review. The "cool" column is the number of "cool" votes this review received from other Yelp users. All reviews start with 0 "cool" votes, and there is no limit to how many "cool" votes a review can receive. In other words, it is a rating of the review itself, not a rating of the business. The "useful" and "funny" columns are similar to the "cool" column.
Language:Jupyter Notebook0 0 00

cansumericli

Pinned Repositories

BreastCancer-PCA

ClickOnAd

CrimeRate

ETH_Cryptocurrency

IBM_Capstone

MovieRecommenderSystem

Python_DeepLearning

Python_ML

R_DataAnalysis

YELPreviews

cansumericli's Repositories

cansumericli/MovieRecommenderSystem

cansumericli/BreastCancer-PCA

cansumericli/ClickOnAd

cansumericli/CrimeRate

cansumericli/ETH_Cryptocurrency

cansumericli/IBM_Capstone

cansumericli/Python_ML

cansumericli/Python_DeepLearning

cansumericli/R_DataAnalysis

cansumericli/YELPreviews