/Credit_Card_Fraud

Credit Card Fraud Detection with Kaggle Dataset

Primary LanguageJupyter Notebook

Credit_Card_Fraud

Data science project:

  • EDA on the dataset with visually appealing graphs
  • Model built:
    1. SVM with grid search for hyperparameters tuning
    1. SVM with feature selection
    1. SVM with SMOTE upsampling on the frauds
    1. Random Forest with grid search for hyperparameter tuning
    1. 3-layer ANN with Keras
  • Metrics used: AUC, Precision, Recall
  • Sample model results (for 3-layer ANN): AUC 98%, Precision 93%, Recall 74%

File organization

  • Data: folder containing credit card transactions by Kaggle https://www.kaggle.com/dalpozz/creditcardfraud
  • ProgressCheck: preliminary results of proposal, EDAs, inferential stats
  • FinalResults: code and graphs wrapped together in jupyterbooks: Part1 for EDA and inferential stats, Part2 for predictive modeling, and a separate report

Important packages

  • [Sklearn]
  • [Tensorflow]
  • [Keras]
  • [statsmodel]
  • [matplotlib]
  • [seaborn]
  • [pandas]
  • [numpy]
  • [scipy]

Contact

Acknowledgments

  • Special thanks to Amir Ziai, Jenny Hung for feedbacks on the project