Pinned Repositories
BreastCancer-PCA
Principal component analysis (PCA) is a technique used for identification of a smaller number of uncorrelated variables known as principal components from a larger set of data. The technique is widely used to emphasize variation and capture strong patterns in a data set.
ClickOnAd
In this project I will be working with a fake advertising data set, indicating whether or not a particular internet user clicked on an Advertisement on a company website. We will try to create a model that will predict whether or not they will click on an ad based off the features of that user with using Logistic Regression.
CrimeRate
Crime Rate Analysis with Python ML
ETH_Cryptocurrency
Ethereum token (also known as Ether, ETH) is the second largest cryptocurrency by market capitalization. It is the native token of the decentralized Ethereum platform that has an ambition to become the largest platform for decentralized applications and smart contracts. The primary purpose of the Ethereum token is to be used for the platform itself, particularly with the design and execution of its smart contracts and decentralized applications.
IBM_Capstone
This repository contains files of IBM's Applied Data Science Capstone Project.
MovieRecommenderSystem
I will create a basic recommendation system by suggesting items that are most similar to a particular item, in this case, movies. This is not a true robust recommendation system, to describe it more accurately, it just tells you what movies/items are most similar to your movie choice.
Python_DeepLearning
This repo contains Python Deep Learning notebooks for educational purpose.
Python_ML
This repository contains statistical modeling with Python Machine Learning
R_DataAnalysis
This repository contains data analysis instructions for R language.
YELPreviews
In this NLP project I will be attempting to classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews. This will be a simpler procedure than the lecture, since we will utilize the pipeline methods for more complex tasks. I will use the Yelp Review Data Set from Kaggle. Each observation in this dataset is a review of a particular business by a particular user. The "stars" column is the number of stars (1 through 5) assigned by the reviewer to the business. (Higher stars is better.) In other words, it is the rating of the business by the person who wrote the review. The "cool" column is the number of "cool" votes this review received from other Yelp users. All reviews start with 0 "cool" votes, and there is no limit to how many "cool" votes a review can receive. In other words, it is a rating of the review itself, not a rating of the business. The "useful" and "funny" columns are similar to the "cool" column.
cansumericli's Repositories
cansumericli/MovieRecommenderSystem
I will create a basic recommendation system by suggesting items that are most similar to a particular item, in this case, movies. This is not a true robust recommendation system, to describe it more accurately, it just tells you what movies/items are most similar to your movie choice.
cansumericli/BreastCancer-PCA
Principal component analysis (PCA) is a technique used for identification of a smaller number of uncorrelated variables known as principal components from a larger set of data. The technique is widely used to emphasize variation and capture strong patterns in a data set.
cansumericli/ClickOnAd
In this project I will be working with a fake advertising data set, indicating whether or not a particular internet user clicked on an Advertisement on a company website. We will try to create a model that will predict whether or not they will click on an ad based off the features of that user with using Logistic Regression.
cansumericli/CrimeRate
Crime Rate Analysis with Python ML
cansumericli/ETH_Cryptocurrency
Ethereum token (also known as Ether, ETH) is the second largest cryptocurrency by market capitalization. It is the native token of the decentralized Ethereum platform that has an ambition to become the largest platform for decentralized applications and smart contracts. The primary purpose of the Ethereum token is to be used for the platform itself, particularly with the design and execution of its smart contracts and decentralized applications.
cansumericli/IBM_Capstone
This repository contains files of IBM's Applied Data Science Capstone Project.
cansumericli/Python_ML
This repository contains statistical modeling with Python Machine Learning
cansumericli/Python_DeepLearning
This repo contains Python Deep Learning notebooks for educational purpose.
cansumericli/R_DataAnalysis
This repository contains data analysis instructions for R language.
cansumericli/YELPreviews
In this NLP project I will be attempting to classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews. This will be a simpler procedure than the lecture, since we will utilize the pipeline methods for more complex tasks. I will use the Yelp Review Data Set from Kaggle. Each observation in this dataset is a review of a particular business by a particular user. The "stars" column is the number of stars (1 through 5) assigned by the reviewer to the business. (Higher stars is better.) In other words, it is the rating of the business by the person who wrote the review. The "cool" column is the number of "cool" votes this review received from other Yelp users. All reviews start with 0 "cool" votes, and there is no limit to how many "cool" votes a review can receive. In other words, it is a rating of the review itself, not a rating of the business. The "useful" and "funny" columns are similar to the "cool" column.