Machine Learning-Based Individualized Survival Prediction Model for Prognosis in Osteosarcoma: Data From the SEER Database

Data preprocessing is handled by R code: data_preprocessing.R, which imports codes for data cleaning from preprocessing.R

Model construction, hyperparameters tuning and evaluation are handled with pysurvival, scikit-learn and lifelines packages: ModelDevelopment-Without_tunning_output.ipynb

Web application based on streamlit package: app.py

The original data read in R code is not provided in this repository and needs to be extracted in the SEER database according to inclusion criteria (AYA site = 4.1 Osteosarcoma)

The data after data preprocessing is provided. To reproduce this study, first run the following codes to install packages:

git clone https://github.com/WHUH-ML/Osteosarcoma.git
pip install -r requirements.txt

Then run ModelDevelopmentWithoutTuningOutput.ipynb in Jupyter Notebook.

Run streamlit run app.py in terminal to open the web application locally.

Online web application

Paper link(To be updated)

Jaqen00/Osteosarcoma

Machine Learning-Based Individualized Survival Prediction Model for Prognosis in Osteosarcoma: Data From the SEER Database