Predicting the Effect of Mutations on Clinical Drug Response using Graph Representation and Transformer Encoders
- Python == 3.9
- Pytorch == 1.12
- Scikit-learn == 1.1
We provide the test dataset used in this study, you can use test_data.pt in pytorch format or testset.csv in .csv format to evaluate our method. The trained model weights can be downloaded from OneDrive url.
python evaluation.py
- HH-suite for generating HHblits files from protein sequences (with the file suffix of .hhm)
- Alphafold2 for generating PDB files from protein sequences (with the file suffix of .pdb)
- DSSP for generating DSSP files from pdb files (with the file suffix of .dssp)
cd /src
python prepareData.py
Note: Due to the hardware limitation, the server version of Emden do not contain features generated by Alphafold2(dssp). We strongly recommend that you use this locally installed version.