Pinned Repositories
Movie-Scraper-App
A simple web app scraping data from popular movie webpage "Filmweb" and visualizing basic statistics based on scraped data using Python (including bottle library) JavaScript for visualization.
association-rules
Asscociation Rules implemented on the transaction dataset
cinema-booking-app
dimension-reduction-pca
Dimension reduction using principal component analysis (PCA) on the medical dataset
econometrics-data-processing-bachelor-thesis
Automated data collection and processing with Python in order to create econometric OLS model. Model constructed and tested with R programming language
image-clustering
Image clustering with a help of CLARA algorithm. Calculating the percentage of Warsaw’s green space
ml-classification-knn
Drug use prediction using multiple machine learning algorithms (Random Forrest, k-Nearest Neighbors Classifier, Logistic Regression, Support Vector Classifier). Data manipulation and feature engineering. Balancing data with SMOTE+Tomek in order to reduce recall and increase precision. Cross Validation used for parameter tuning
ml-deep-learning-neural-network
Comparison of neural network performanc with other ML methods for two different types of data (image classification and regression)
ml-regression-random-forest
Traffic prediction using multiple machine learning algorithms (Linear, Lasso, Ridge, and Elastic Net Regression, Random Forest, KNN regressorm, SVM). Data manipulation and feature engineering
monte-carlo-covid-prediction-shinyapp
szymonsocha's Repositories
szymonsocha/association-rules
Asscociation Rules implemented on the transaction dataset
szymonsocha/cinema-booking-app
szymonsocha/dimension-reduction-pca
Dimension reduction using principal component analysis (PCA) on the medical dataset
szymonsocha/econometrics-data-processing-bachelor-thesis
Automated data collection and processing with Python in order to create econometric OLS model. Model constructed and tested with R programming language
szymonsocha/image-clustering
Image clustering with a help of CLARA algorithm. Calculating the percentage of Warsaw’s green space
szymonsocha/ml-classification-knn
Drug use prediction using multiple machine learning algorithms (Random Forrest, k-Nearest Neighbors Classifier, Logistic Regression, Support Vector Classifier). Data manipulation and feature engineering. Balancing data with SMOTE+Tomek in order to reduce recall and increase precision. Cross Validation used for parameter tuning
szymonsocha/ml-deep-learning-neural-network
Comparison of neural network performanc with other ML methods for two different types of data (image classification and regression)
szymonsocha/ml-regression-random-forest
Traffic prediction using multiple machine learning algorithms (Linear, Lasso, Ridge, and Elastic Net Regression, Random Forest, KNN regressorm, SVM). Data manipulation and feature engineering
szymonsocha/monte-carlo-covid-prediction-shinyapp
szymonsocha/logit-probit-models
Prediction of student dropout using Logit and Probit models. Hypothesis testing to find the best model. Comparing Logit and Probit models performance. Interpreting the results and making conclusions
szymonsocha/machine-learning-university-project
Compairing the performance of various Machine Learning algorithms on the given medical dataset (disease prediction)
szymonsocha/master-thesis
szymonsocha/szymonsocha.github.io
Github.io Portfolio
szymonsocha/text-mining-nlp