Questtion about novelty calculation

Question

Questtion about novelty calculation

Wangchentong opened this issue a year ago · 3 comments

Thanks for you authors share this great work, i wonder how the pdb dataset is curated for the calculation of novelty? do you split each chain in the pdb database or just based on the training single chain or some thing else?

Answer 1 · 2023-12-04T13:22:52.000Z

i guess you use default foldseek PDBdatabse?

Answer 2 · 2023-12-08T02:56:04.000Z

Hi, we take the whole PDB dataset as instructed in foldseek's documentation. I believe this is all the single sequences. We use the following flags to run novelty calculations.

-alignment-type 1 --format-output query,target,alntmscore,lddt --tmscore-threshold 0.0 --exhaustive-search --max-seqs 10000000000

Answer 3 · 2023-12-15T06:38:42.000Z

Thanks jason!