ablab/quast

wrong format of all_alignments_*.tsv file in contigs_reports

LuJiansen opened this issue · 1 comments

Hi,
it looks like the tsv format of all_alignments_*.tsv file in contigs_reports is wrong, as below:

$ head all_alignments_scaffolds_FINAL.tsv | cat -A
S1^IE1^IS2^IE2^IReference^IContig^IIDY^IAmbiguous^IBest_group$
115297652^I115395629^I57^I98004^Ichr5^Iscaffold_1^I99.75^I^ITrue$
indel: indel (> 5bp)$
115404026^I115421305^I106323^I123581^Ichr5^Iscaffold_1^I99.66^I^ITrue$
local misassembly$

there are two tabs bewteen IDY and Ambiguous column and a wrong EndOfLine after Ambiguous column, except the header

sorry for my misunderstanding of this file ....
But if anyone want to just extract the alignment location of each contig blocks like me, you may use this command:

cat <(head -n 1 all_alignments_file.tsv | awk '{print $1,$2,$3,$4,$5,$6,$7,$8}' OFS='\t') <(sed 1d all_alignments_file.tsv | grep ^[1-9] | sed 's/\t\t/\t/g')