Pinned Repositories
big-data-challenge
Many of Amazon's shoppers depend on product reviews to make a purchase. Amazon makes these datasets publicly available. However, they are quite large and can exceed the capacity of local machines to handle. One dataset alone contains over 1.5 million rows; with over 40 datasets, this can be quite taxing on the average local computer. First goal for this assignment will be to perform the ETL process completely in the cloud and upload a DataFrame to an RDS instance. The second goal will be to use PySpark or SQL to perform a statistical analysis of selected data.
deep-learning-challenge
Create an algorithm to predict whether or not applicants for funding will be successful. With knowledge of machine learning and neural networks, use the features in the provided dataset to create a binary classifier that is capable of predicting whether applicants will be successful if funded.
geoml_workshops
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Houston_real_estate
Project analyses asking price for residential homes in Houston versus their tax value.
javascript-challenge
The extra-terrestrial menace has come to Earth and we here at ALIENS-R-REAL have collected all of the eye-witness reports we could to prove it! All we need to do now is put this information online for the world to see and then the matter will finally be put to rest.
leaflet-challenge
Your first task is to visualize an earthquake data set using leaflet and javascript.
matplotlib-challenge
Compare the performance of drug of interest versus the other treatment regimens on laboratory mice using MatPlotLib.
pandas-challenge
Pandas application to real data
Pet_Pals
RunTimeTerrorFootball app deployed on Heroku
sgkuzmin's Repositories
sgkuzmin/Pet_Pals
RunTimeTerrorFootball app deployed on Heroku
sgkuzmin/big-data-challenge
Many of Amazon's shoppers depend on product reviews to make a purchase. Amazon makes these datasets publicly available. However, they are quite large and can exceed the capacity of local machines to handle. One dataset alone contains over 1.5 million rows; with over 40 datasets, this can be quite taxing on the average local computer. First goal for this assignment will be to perform the ETL process completely in the cloud and upload a DataFrame to an RDS instance. The second goal will be to use PySpark or SQL to perform a statistical analysis of selected data.
sgkuzmin/deep-learning-challenge
Create an algorithm to predict whether or not applicants for funding will be successful. With knowledge of machine learning and neural networks, use the features in the provided dataset to create a binary classifier that is capable of predicting whether applicants will be successful if funded.
sgkuzmin/Unsupervised-machine-learning-challenge
Fit unsupervised machine learning models to cryptocurrencies data.
sgkuzmin/Supervised-Machine-Learning-challenge
Build a machine learning model that attempts to predict whether a loan from LendingClub will become high risk or not.
sgkuzmin/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
sgkuzmin/tableau_challenge
Tableau study of citi bikes data
sgkuzmin/leaflet-challenge
Your first task is to visualize an earthquake data set using leaflet and javascript.
sgkuzmin/plotly-challenge
An interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.
sgkuzmin/javascript-challenge
The extra-terrestrial menace has come to Earth and we here at ALIENS-R-REAL have collected all of the eye-witness reports we could to prove it! All we need to do now is put this information online for the world to see and then the matter will finally be put to rest.
sgkuzmin/Houston_real_estate
Project analyses asking price for residential homes in Houston versus their tax value.
sgkuzmin/geoml_workshops
sgkuzmin/web-scraping-challenge
A web application that scrapes various websites for data related to the Mission to Mars and displays the information in a single HTML page
sgkuzmin/Web-Design-Challenge
Using HTML and CSS to create a dashboard showing off the analysis done on the weather data.
sgkuzmin/sqlalchemy-challenge
Climate analysis using sql-alchemy and flask
sgkuzmin/sgkuzmin.github.io
sgkuzmin/sql-challenge
Data engineering and data analysis of employee database
sgkuzmin/shockwave
Earthquake project to determine if frequency and magnitude have been increasing since the 1970's.
sgkuzmin/python-api-challenge
Evaluation of the weather patterns as one approaches equator
sgkuzmin/matplotlib-challenge
Compare the performance of drug of interest versus the other treatment regimens on laboratory mice using MatPlotLib.
sgkuzmin/pandas-challenge
Pandas application to real data
sgkuzmin/python-challenge
PyPoll and PyBank Python scripts analyzing the financial records of the company and creates a vote counting process.
sgkuzmin/VBA-challenge
Stock market data analysis in excel using VBA script
sgkuzmin/StartUp_data_analysis_in-excel
sgkuzmin/pyGeoPressure
Pore pressure prediction using seismic velocity and well log data