/proteinshake_release

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

This repository contains the code for releasing proteinshake datasets. It also computes the random, sequence, and structure splits.

To release, install proteinshake, foldseek, and cd-hit:

Adjust the TAG, SCRATCH, DESTINATION, NJOBS variables in release.py according to your system.

Then run: python release.py

Note that this repository is licensed under GPLv3, due to dependency licenses.