songlab-cal/tape

Filtered Remote Homology Pretraining Dataset

cutecows opened this issue · 0 comments

In the supplementary section, it says that the filtered dataset used for supervised pretraining on the remote homology task is available in the repository. I'm not able to find this dataset. Unzipping the remote homology data only gives me the train, val, and test sets. Could you potentially clarify?