andiosika
CSR and operations specialist who fell in love with philanthropy and more so with data. 'Learning how to use it as a superpower for do-gooding!
https://andiosika.wixsite.com/mysite/blogPhoenix, AZ
Pinned Repositories
-git_practice
andiosika.github.io
Non-profit and operations analyst learning as much as she can about data science theory and application in hopes to one day use her superpowers for good.
applying-gradient-descent-data-science-intro-000
applying-gradient-descent-lab-data-science-intro-000
Binomial-Classification-Ranom-Forest-hyper-imbalance
Using binomial classification to predict COVID-19 infection on a large dataset (>618K samples) with extreme imbalance and minority class (.13% of samples) as target. The final iteration is a manually tuned random forsest classifier with >95% accuracy and >64% recall.
Example_Tableau
Examples of Tableau dashboards that have been created using various datasets
Flatiron_Capstone
Sentiment Analysis using Natural Language Processing (NLP) Multi-Classification Support Vector Modeling for & Clustering / Segmentation using Latent Dirichlet Allocation (LDA):
Multiple-linear-regression-for-predicting-home-prices
Using 21 categorical and numeric features in a multivariate linear regression to find that 79% of a home price can be positively affected by a combination of certain features like location, square feet, condition and age of the home.
NLP-to-identify-toxic-or-abusive-language-for-online-conversation-using-Keras-Deep-Learning-Models
Natural Language Processing: A multi-headed model capable of detecting different types of online discussion toxicity like threats, obscenity, insults, and identity-based hate using Keras RNN LSTM and focal loss to address a hyper-imbalanced dataset.
SQL_Hypothesis_Testing_Workflow
SQL & A/B / Hypothesis Testing to inform business intelligence