patrickbryant1/Umol

limit in protein length or msa sequence number?

Closed this issue · 3 comments

hi as a test case i was using a protein of 539 residue lenth and msa of 25 protein sequences.
but i am getting memory exhaustion error. Is colab limited to protein length or msa size?

Hi, the limit is on protein length. A statistical representation is created from the MSA and the size of this thereby doesn't matter. However, 25 sequences is probably too shallow to get good results.

This depends on the available RAM. Umol has no limit. I don't know what you have available in Colab, but recommend to run it locally for bigger proteins. We have not trained on phosphorylation sites.