ComputationalBiophysics Term project
Instructions for Running the Scrapper:
- Install all required modules with
pip install -r requirements.txt
. - Download suitable chrome driver version from 'https://chromedriver.chromium.org/downloads'
- Change the path to chromedriver in
imgt_scrape.py
to the path where chrome driver is present. - Run the imgt_scrape.py file to scrape all the data for humans & pMH1 from IMGT-3Dstructure-DB website.
- Run the process_imgt.py file to get the fasta files from the scraped data.
- Perform multiple sequence aligments for all the fasta files and save results in aligned_hla folder.
- Run the process_aligned_hla.py file to get the consensus paratope for each MHC ligand and the fasta file corresponding to that.
- Perform multiple sequence aligment for the fasta file and save result in hla_msa.txt in the format '{MHC Ligand 4 digit Name} {Paratope sequence}'.