Pinned Repositories
2016-US-President-Election-Primary-Results-Analysis
Correlation analysis between candidates and county facts based on 2016 US President Election Primary Results by county
Analyzing-sequences
Auto-Insurance-Risk-Classification-and-Claim-Prediction
XGB model and feature importance to predict At Fault Auto Claims
AWS-Sage-Maker-Machine-Learning-Experiments-Automation
Machine Learning experiments automation with the help of AWS Sage Maker using XGBoost Classification and Insurance Property data
Boto3-Demo
Examples of dynamic creation and use of VPC, EC2, Load Balancer, Auto Scaling group, Launch Configuration, Redshift cluster, S3, SQS, SNS
dbt_scd2_plus
Slowly Changing Dimension Type 2 (scd2) custom materialization
Mini-ETL-Tool
Mini ETL Tool is a Python module. It allows to run SQL and CLI commands in parallel or sequential mode, set up preconditions, dependencies and notifications
TweetsAutorshipAttributionModelsEvaluation
In this notebook I work on the question whether the author of a tweet (very short text) can be successfully identified. I try to choose the best classification method its parameters set and features
KaterynaD's Repositories
KaterynaD/Boto3-Demo
Examples of dynamic creation and use of VPC, EC2, Load Balancer, Auto Scaling group, Launch Configuration, Redshift cluster, S3, SQS, SNS
KaterynaD/TweetsAutorshipAttributionModelsEvaluation
In this notebook I work on the question whether the author of a tweet (very short text) can be successfully identified. I try to choose the best classification method its parameters set and features
KaterynaD/Auto-Insurance-Risk-Classification-and-Claim-Prediction
XGB model and feature importance to predict At Fault Auto Claims
KaterynaD/Mini-ETL-Tool
Mini ETL Tool is a Python module. It allows to run SQL and CLI commands in parallel or sequential mode, set up preconditions, dependencies and notifications
KaterynaD/2016-US-President-Election-Primary-Results-Analysis
Correlation analysis between candidates and county facts based on 2016 US President Election Primary Results by county
KaterynaD/Analyzing-sequences
KaterynaD/AWS-Sage-Maker-Machine-Learning-Experiments-Automation
Machine Learning experiments automation with the help of AWS Sage Maker using XGBoost Classification and Insurance Property data
KaterynaD/aws_data_pipeline_samples
Few AWS Data Pipeline samples to demo export from MS SQL to a file in S3 bucket, load a DynamoDB table to Redshift, multiple dependencies in the flow
KaterynaD/BartScraper
The application collects real time train departures from Bart API
KaterynaD/Data-Feeds
Advanced SQL
KaterynaD/data-pipeline-samples
This repository hosts sample pipelines
KaterynaD/Dimensional-Modeling---PolicyTransactions
KaterynaD/eva.ru
What Russian women talk about - Natural Language Processing (NLP) research of Russian women eva.ru forum
KaterynaD/Growing-Stocks
KaterynaD/HelpDesk-Tickets-Forecasting
KaterynaD/HelpDesk-Tickets-Surveys-Summarization
Snowflake Cortex based
KaterynaD/Insurance-Data-Pipelines
Pentaho Data Integration ETL and Matillion ELT
KaterynaD/Insurance-Data-Warehouse
Data Warehouse Modeling
KaterynaD/KaterynaD
Config files for my GitHub profile.
KaterynaD/KaterynaD.github.io
Personal site
KaterynaD/Load-Fundamental-Data
KaterynaD/Redshift_DW_Deployment
CI/CD pipeline to deploy DW schema changes in Redshift based on AWS CodePipeline, CodeBuild, FlyWay and JUnit for testing
KaterynaD/Snowflake_dbt_DW_deployment
CI/CD GitHub actions FlyWay Snowflake schema changes and dbt pipeline
KaterynaD/TechcrunchPostsMulticlassPostsClassification
In this notebook I search the best classifier and its parameters for posts multi-class classifications based on authorship attributes
KaterynaD/TweetsListener
Collects tweets and performs sentiment analysis based on emoticons and NLP (TextBlob)
KaterynaD/Water-Peril-Claims-Research-with-XGBoost-and-GLM-models