r2dt-bio/R2DT

Add FAIL and NO_HIT to metadata.tsv

Opened this issue · 1 comments

The results file metadata.tsv should record the status of each sequence, even if it crashed or had no 2D.

An example of how this plays out is:

[rnaprod@hl-codon-22-03 49bba251b34645cc922cdd76617007]$ grep URS00006D37C6 output/results/tsv/metadata.tsv
URS00006D37C6	E_Glu	GtRNAdb
[rnaprod@hl-codon-22-03 49bba251b34645cc922cdd76617007]$ find output/results/ -name 'URS00006D37C6*'
output/results/fasta/URS00006D37C6.fasta

In effect we only get some of the required files. Currently, RNAcentral verfies all results and then treats any incomplete ones as having no 2D and warns about it. It would be much easier to know from the results file if it worked or not instead of having to check.