Add FAIL and NO_HIT to metadata.tsv
Opened this issue · 1 comments
AntonPetrov commented
The results file metadata.tsv should record the status of each sequence, even if it crashed or had no 2D.
blakesweeney commented
An example of how this plays out is:
[rnaprod@hl-codon-22-03 49bba251b34645cc922cdd76617007]$ grep URS00006D37C6 output/results/tsv/metadata.tsv
URS00006D37C6 E_Glu GtRNAdb
[rnaprod@hl-codon-22-03 49bba251b34645cc922cdd76617007]$ find output/results/ -name 'URS00006D37C6*'
output/results/fasta/URS00006D37C6.fasta
In effect we only get some of the required files. Currently, RNAcentral verfies all results and then treats any incomplete ones as having no 2D and warns about it. It would be much easier to know from the results file if it worked or not instead of having to check.