Datasets:
fraud_detection_data
: https://www.kaggle.com/datasets/rohitrox/healthcare-provider-fraud-detection-analysis/discussionclaim_prediction_data
: https://www.kaggle.com/datasets/easonlai/sample-insurance-claim-prediction-dataset
Hidden datasets (within processed_data/
):
icd9_diagnosis.csv
, obtained from runningicd9_scraper.py
icd9_procedure.csv
, obtained from runningicd9_scraper.py
final_training_set.csv
, obtained from runningpreprocessing_train.ipynb
final_test_set.csv
, obtained from runningpreprocessing_test.ipynb