eldariont/svim

SV types in output VCF

Closed this issue · 3 comments

Hi,

In your WIKI you describe five different SVTYPEs: DEL, INV, DUP:TANDEM, DUP:INT, and INS:NOVEL.

I get all the above plus:

]chr19:107955]N
N[chr19:107969[
N[chr19:111124[
N[chr6:69267[
]chr1:278176]N

all SVTYPE=BND

What are these? Rubbish to filter out?

Liam

Hi Liam,

sorry for the delay. I was out of the office for a week.
The records you mention are breakends reported in the format that is described in the VCF specification. Have a look at http://samtools.github.io/hts-specs/VCFv4.2.pdf and particularly at section 5.4.

Best
David

Hi David,

I actually have same question.
I think what Liam meant to say is that svim doesn't report BND type of SV according to your wiki explanation.
Did I misunderstand because I did find svim report translocations?
I check a couple of translocations reported by svim by visualising the read alignment with ribbon and they seem to be true.

Best wishes,
Zih-Hua

Hi Zih-Hua, hi Liam,

yes, you are right. I should mention breakends (BNDs) in the wiki to avoid confusion. I did not mention breakends before because they do not represent one SV type but rather a mix of different things: Some represent translocations, many of them are caused by spurious split-read alignments (often close to telomeres and centromeres) while others could be part of more complex rearrangements. The best definition I have for BNDs is that they represent one or more reads that connect two distant genomic loci on the same or different chromosomes.

So every translocation will be detected as a BND variant by SVIM but not every BND variant is a translocation. I hope that makes it a bit clearer what they are :) Thanks for making me aware that I should be more explicit about this in the wiki.

Cheers
David