It is desirable to have one-line annotations for newly annotated proteins, but InterProScan outputs a TSV with multiple lines per protein, often with redundant annotations. This collapses each protein's annotations into a single, human readable line. #Python
ggavelis/condense_InterProScan_annots
InterProScan is useful, but its annots are multiline and redundant. This collapses them into a single, human-readable line for each annotated sequence.
Python