Illumina/paragraph

Error processing an INV

jjfarrell opened this issue · 6 comments

This error below occurs when processing this INV called by MANTA. Any suggestions? The variant record is supplying a REF. ALT, SVLEN and an SVINSEQ.

chr6 2893187 MantaINV:4:20498:20498:4:0:0;MantaINV:61660:0:0:1:0:0 C <INV> 548 PASS END=2893191;SVTYPE=INV;SVLEN=4;CIPOS=0,4;CIEND=-4,0;HOMLEN=4;HOMSEQ=TATA;SVINSLEN=51;SVINSSEQ=ACGTATATATATACGTATATATAATATATATATTATATATACGTATATATA;INV5;AC=2;AN=9578;FIBC_P=-0.000417444;HWE_SLP_P=-0.195618;FIBC_I=-0.000417444;HWE_SLP_I=-0.195618;MAX_IF=0.749985;MIN_IF=0.749985;LLK0=-281075;BETA_IF=0.49997,-3.8073e-06,1.89345e-06,3.9655e-06,2.17351e-06;ANN=<INV>|intron_variant|MODIFIER|SERPINB9|ENSG00000170542|transcript|ENST00000380698.4|protein_coding|5/6|c.567+220_567+223inv||||||;NS=4789;AF=0.000208812;MAF=0.000208812;AC_Het=2;AC_Hom=0;AC_Hemi=0;HWE=1;ExcHet=0.999896

It would be nice to get the chromosome in the below error message when running when trouble shooting.

2020-03-09 10:26:55,417 ERROR    Exception when running vcf2paragraph on /scratch/tmppmn0a0d6.vcf.gz
2020-03-09 10:26:55,421 ERROR    Traceback (most recent call last):
2020-03-09 10:26:55,422 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 286, in run_vcf2paragraph    alt_paths=params["alt_paths"])
2020-03-09 10:26:55,422 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 86, in convert_vcf    ref, indexed_vcf.name, ins_info_key, chrom, start, end, ref_node_padding, allele_graph)
2020-03-09 10:26:55,422 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 128, in create_from_vcf    graph.add_record(record, allele_graph, varId, ins_info_key)
2020-03-09 10:26:55,422 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 202, in add_record    self.add_alt(vcf.pos, vcf.stop, ref_sequence, alt_sequence, alt_samples, refSamples)
2020-03-09 10:26:55,423 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 296, in add_alt    raise Exception("{}:{} missing REF or ALT sequence.".format(start, end))
2020-03-09 10:26:55,423 ERROR    Exception: 2893187:2893191 missing REF or ALT sequence.
2020-03-09 10:26:55,506 ERROR    VCF to JSON conversion failed.
2020-03-09 10:26:55,509 ERROR    multiprocessing.pool.RemoteTraceback: """Traceback (most recent call last):  File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 119, in worker    result = (True, func(*args, **kwds))  File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 44, in mapstar    return list(map(*args))  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 286, in run_vcf2paragraph    alt_paths=params["alt_paths"])  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 86, in convert_vcf    ref, indexed_vcf.name, ins_info_key, chrom, start, end, ref_node_padding, allele_graph)  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 128, in create_from_vcf    graph.add_record(record, allele_graph, varId, ins_info_key)  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 202, in add_record    self.add_alt(vcf.pos, vcf.stop, ref_sequence, alt_sequence, alt_samples, refSamples)  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 296, in add_alt    raise Exception("{}:{} missing REF or ALT sequence.".format(start, end))Exception: 2893187:2893191 missing REF or ALT sequence."""
2020-03-09 10:26:55,509 ERROR    The above exception was the direct cause of the following exception:
2020-03-09 10:26:55,510 ERROR    Traceback (most recent call last):
2020-03-09 10:26:55,510 ERROR      File "/share/pkg.7/paragraph/2.4a/install/bin/multigrmpy.py", line 52, in load_graph_description    header, records, event_list = convert_vcf_to_json(args, alt_paths=True)
2020-03-09 10:26:55,510 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 156, in convert_vcf_to_json    variants = pool.map(run_vcf2paragraph, zip(to_process, itertools.repeat(params)))
2020-03-09 10:26:55,510 ERROR      File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 266, in map    return self._map_async(func, iterable, mapstar, chunksize).get()
2020-03-09 10:26:55,510 ERROR      File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 644, in get    raise self._value
2020-03-09 10:26:55,511 ERROR    Exception: 2893187:2893191 missing REF or ALT sequence.
2020-03-09 10:26:55,511 ERROR    multiprocessing.pool.RemoteTraceback: """Traceback (most recent call last):  File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 119, in worker    result = (True, func(*args, **kwds))  File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 44, in mapstar    return list(map(*args))  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 286, in run_vcf2paragraph    alt_paths=params["alt_paths"])  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 86, in convert_vcf    ref, indexed_vcf.name, ins_info_key, chrom, start, end, ref_node_padding, allele_graph)  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 128, in create_from_vcf    graph.add_record(record, allele_graph, varId, ins_info_key)  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 202, in add_record    self.add_alt(vcf.pos, vcf.stop, ref_sequence, alt_sequence, alt_samples, refSamples)  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 296, in add_alt    raise Exception("{}:{} missing REF or ALT sequence.".format(start, end))Exception: 2893187:2893191 missing REF or ALT sequence."""
2020-03-09 10:26:55,511 ERROR    The above exception was the direct cause of the following exception:
2020-03-09 10:26:55,511 ERROR    Traceback (most recent call last):
2020-03-09 10:26:55,512 ERROR      File "/share/pkg.7/paragraph/2.4a/install/bin/multigrmpy.py", line 261, in run    graph_files = load_graph_description(args)
2020-03-09 10:26:55,512 ERROR      File "/share/pkg.7/paragraph/2.4a/install/bin/multigrmpy.py", line 52, in load_graph_description    header, records, event_list = convert_vcf_to_json(args, alt_paths=True)
2020-03-09 10:26:55,512 ERROR      File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 156, in convert_vcf_to_json    variants = pool.map(run_vcf2paragraph, zip(to_process, itertools.repeat(params)))
2020-03-09 10:26:55,512 ERROR      File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 266, in map    return self._map_async(func, iterable, mapstar, chunksize).get()
2020-03-09 10:26:55,512 ERROR      File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 644, in get    raise self._value
2020-03-09 10:26:55,513 ERROR    Exception: 2893187:2893191 missing REF or ALT sequence.
multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
  File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 119, in worker
    result = (True, func(*args, **kwds))
  File "/share/pkg.7/python3/3.6.9/install/lib/python3.6/multiprocessing/pool.py", line 44, in mapstar
    return list(map(*args))
  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 286, in run_vcf2paragraph
    alt_paths=params["alt_paths"])
  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcf2paragraph/__init__.py", line 86, in convert_vcf
    ref, indexed_vcf.name, ins_info_key, chrom, start, end, ref_node_padding, allele_graph)
  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 128, in create_from_vcf
    graph.add_record(record, allele_graph, varId, ins_info_key)
  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 202, in add_record
    self.add_alt(vcf.pos, vcf.stop, ref_sequence, alt_sequence, alt_samples, refSamples)
  File "/share/pkg.7/paragraph/2.4a/install/lib/python3/grm/vcfgraph/vcfgraph.py", line 296, in add_alt
    raise Exception("{}:{} missing REF or ALT sequence.".format(start, end))
Exception: 2893187:2893191 missing REF or ALT sequence.
"""

For , Paragraph inverts REF sequences as ALT. Here, the REF sequence is C (padding base) + ATAT. After reverse complement, ATAT is still ATAT. There is a trimming step in graph building phase that eliminates same bases between REF and ALT. In this variant, all bases will be trimmed because they're the same...That's why you see this error message.

Actually this representation is confusing. It is a but it also has a INS sequence. @ctsa , what is the supposed ALT of this variant?

Thanks for reporting! v2.5 release will have a better error message on this situation.

There is also a dup occurring at that site which may be complicating things:

chr6    2893187 MantaDUP:TANDEM:1:7410:7410:0:0:0;MantaDUP:TANDEM:8:21094:21094:1:0:0;MantaDUP:TANDEM:129143:0:0:0:0:0;MantaDUP:TANDEM:6:14973:14973:1:0:0;MantaDUP:TANDEM:98545:0:0:1:0:0;MantaDUP:TANDEM:62990:0:0:0:0:0;MantaDUP:TANDEM:66816:0:0:0:0:0;MantaDUP:TANDEM:2:21984:21984:1:0:0;MantaDUP:TANDEM:80362:0:0:0:0:0;MantaDUP:TANDEM:98083:0:0:0:0:0;MantaDUP:TANDEM:0:14436:14436:1:0:0;MantaDUP:TANDEM:7:10192:10192:0:0:0;MantaDUP:TANDEM:64160:0:0:0:0:0;MantaDUP:TANDEM:1:7753:7753:1:0:0;MantaDUP:TANDEM:4:10949:10949:1:0:0;MantaDUP:TANDEM:7:18966:18966:1:0:0;MantaDUP:TANDEM:7:12669:12669:1:0:0;MantaDUP:TANDEM:92715:0:0:1:0:0;MantaDUP:TANDEM:4:20498:20498:2:0:0;MantaDUP:TANDEM:75085:0:0:0:0:0;MantaDUP:TANDEM:0:13648:13648:1:0:0;MantaDUP:TANDEM:4:27357:27357:2:0:0;MantaDUP:TANDEM:75213:0:0:1:0:0;MantaDUP:TANDEM:0:5270:5270:1:0:0;MantaDUP:TANDEM:1:19103:19103:0:0:0;MantaDUP:TANDEM:2:15517:15517:1:0:0;MantaDUP:TANDEM:9:21216:21216:1:0:0;MantaDUP:TANDEM:5:13582:13582:1:0:0;MantaDUP:TANDEM:10:22557:22557:1:0:0;MantaDUP:TANDEM:0:13311:13311:1:0:0;MantaDUP:TANDEM:5:17416:17416:1:0:0;MantaDUP:TANDEM:63471:0:0:0:0:0;MantaDUP:TANDEM:5:21672:21672:1:0:0;MantaDUP:TANDEM:72641:0:0:0:0:0;MantaDUP:TANDEM:98874:0:0:1:0:0;MantaDUP:TANDEM:6:657:657:1:0:0;MantaDUP:TANDEM:64844:0:0:1:0:0;MantaDUP:TANDEM:69361:0:0:1:0:0;MantaDUP:TANDEM:8:18459:18459:1:0:0;MantaDUP:TANDEM:72455:0:0:0:0:0;MantaDUP:TANDEM:0:15177:15177:0:0:0;MantaDUP:TANDEM:82543:0:0:3:0:0;MantaDUP:TANDEM:100101:0:0:1:0:0;MantaDUP:TANDEM:3:34491:34491:2:0:0;MantaDUP:TANDEM:5:453:453:1:0:0;MantaDUP:TANDEM:71920:0:0:0:0:0;MantaDUP:TANDEM:75375:0:0:1:0:0;MantaDUP:TANDEM:8:22540:22540:1:0:0;MantaDUP:TANDEM:64714:0:0:0:0:0;MantaDUP:TANDEM:7:22858:22858:1:0:0;MantaDUP:TANDEM:68345:0:0:1:0:0;MantaDUP:TANDEM:64591:0:0:1:0:0;MantaDUP:TANDEM:7:9560:9560:1:0:0;MantaDUP:TANDEM:7:13563:13563:1:0:0;MantaDUP:TANDEM:64147:0:0:0:0:0;MantaDUP:TANDEM:71934:0:0:1:0:0;MantaDUP:TANDEM:2:15250:15250:1:0:0;MantaDUP:TANDEM:99981:0:0:1:0:0;MantaDUP:TANDEM:9:17467:17467:1:0:0;MantaDUP:TANDEM:0:22764:22764:1:0:0;MantaDUP:TANDEM:62283:0:0:0:0:0;MantaDUP:TANDEM:63120:0:0:2:0:0;MantaDUP:TANDEM:71271:0:0:1:0:0;MantaDUP:TANDEM:67500:0:0:1:0:0;MantaDUP:TANDEM:2:16398:16398:2:0:0;MantaDUP:TANDEM:6:13162:13162:0:0:0;MantaDUP:TANDEM:1:14161:14161:1:0:0;MantaDUP:TANDEM:3:24085:24085:1:0:0;MantaDUP:TANDEM:7:12967:12967:1:0:0;MantaDUP:TANDEM:0:9569:9569:2:0:0;MantaDUP:TANDEM:0:14270:14270:0:0:0;MantaDUP:TANDEM:7:17630:17630:0:0:0;MantaDUP:TANDEM:6:18095:18095:1:0:0;MantaDUP:TANDEM:6:23905:23905:1:0:0;MantaDUP:TANDEM:0:22231:22231:1:0:0;MantaDUP:TANDEM:0:486:486:0:0:0;MantaDUP:TANDEM:8:14872:14872:1:0:0;MantaDUP:TANDEM:65293:0:0:0:0:0;MantaDUP:TANDEM:2:22295:22295:0:0:0;MantaDUP:TANDEM:58519:0:0:0:0:0;MantaDUP:TANDEM:72661:0:0:1:0:0;MantaDUP:TANDEM:3:10030:10030:0:0:0;MantaDUP:TANDEM:12:23318:23318:0:0:0;MantaDUP:TANDEM:68906:0:0:0:0:0;MantaDUP:TANDEM:2:23725:23725:1:0:0;MantaDUP:TANDEM:0:15831:15831:0:0:0;MantaDUP:TANDEM:0:14511:14511:1:0:0;MantaDUP:TANDEM:72897:0:0:0:0:0;MantaDUP:TANDEM:74685:0:0:1:0:0;MantaDUP:TANDEM:77171:0:0:0:0:0;MantaDUP:TANDEM:59940:0:0:1:0:0;MantaDUP:TANDEM:3:8655:8655:0:0:0;MantaDUP:TANDEM:121919:0:0:1:0:0;MantaDUP:TANDEM:3:25904:25904:1:0:0;MantaDUP:TANDEM:182664:0:0:0:0:0;MantaDUP:TANDEM:175535:0:0:1:0:0;MantaDUP:TANDEM:62179:0:0:0:0:0;MantaDUP:TANDEM:8:16507:16507:1:0:0;MantaDUP:TANDEM:72402:0:0:0:0:0;MantaDUP:TANDEM:2:23155:23155:1:0:0;MantaDUP:TANDEM:1:14600:14600:1:0:0;MantaDUP:TANDEM:83098:0:0:2:0:0;MantaDUP:TANDEM:9:14467:14467:1:0:0;MantaDUP:TANDEM:80811:0:0:1:1:0;MantaDUP:TANDEM:86188:0:0:0:0:0;MantaDUP:TANDEM:10:22971:22971:0:0:0;MantaDUP:TANDEM:79300:0:0:1:0:0;MantaDUP:TANDEM:0:7401:7401:1:0:0;MantaDUP:TANDEM:45452:2:2:0:0:0;MantaDUP:TANDEM:3:15870:15870:1:0:0;MantaDUP:TANDEM:3:11374:11374:2:0:0;MantaDUP:TANDEM:72609:0:0:1:0:0;MantaDUP:TANDEM:137152:0:0:1:0:0;MantaDUP:TANDEM:0:9487:9487:1:0:0;MantaDUP:TANDEM:82080:0:0:2:0:0;MantaDUP:TANDEM:7:9696:9696:1:0:0;MantaDUP:TANDEM:3:14348:14348:1:0:0;MantaDUP:TANDEM:9:20435:20435:2:0:0;MantaDUP:TANDEM:2:24123:24123:1:0:0;MantaDUP:TANDEM:3:22140:22140:1:0:0;MantaDUP:TANDEM:0:16235:16235:1:0:0;MantaDUP:TANDEM:4:14045:14045:1:0:0;MantaDUP:TANDEM:10:15659:15659:1:0:0;MantaDUP:TANDEM:1:18638:18638:1:0:0;MantaDUP:TANDEM:3:16875:16875:1:0:0;MantaDUP:TANDEM:4:25043:25043:0:0:0;MantaDUP:TANDEM:0:9348:9348:1:0:0;MantaDUP:TANDEM:4:17754:17754:1:0:0;MantaDUP:TANDEM:70281:0:0:0:0:0;MantaDUP:TANDEM:163067:0:0:1:0:0;MantaDUP:TANDEM:100520:0:0:1:0:0;MantaDUP:TANDEM:70090:0:0:1:0:0;MantaDUP:TANDEM:159967:0:0:0:0:0;MantaDUP:TANDEM:5:12184:12184:0:0:0;MantaDUP:TANDEM:75650:0:0:1:0:0;MantaDUP:TANDEM:2:23861:23861:1:0:0;MantaDUP:TANDEM:69456:0:0:2:0:0;MantaDUP:TANDEM:23549:0:0:1:1:0;MantaDUP:TANDEM:68231:0:0:0:0:0;MantaDUP:TANDEM:72959:0:0:1:0:0;MantaDUP:TANDEM:72122:0:0:0:0:0;MantaDUP:TANDEM:61304:0:0:1:0:0;MantaDUP:TANDEM:55396:0:0:1:0:0;MantaDUP:TANDEM:57607:0:0:0:0:0;MantaDUP:TANDEM:15:23890:23890:2:0:0;MantaDUP:TANDEM:65646:0:0:1:0:0;MantaDUP:TANDEM:73940:0:0:1:0:0;MantaDUP:TANDEM:64731:0:0:1:0:0;MantaDUP:TANDEM:59342:0:0:1:0:0;MantaDUP:TANDEM:63461:0:0:0:0:0;MantaDUP:TANDEM:60535:0:0:0:0:0;MantaDUP:TANDEM:63063:0:0:1:0:0;MantaDUP:TANDEM:63281:0:0:1:0:0;MantaDUP:TANDEM:61343:0:0:1:0:0;MantaDUP:TANDEM:61130:0:0:1:0:0;MantaDUP:TANDEM:59978:0:0:0:0:0;MantaDUP:TANDEM:10:34233:34233:2:0:0;MantaDUP:TANDEM:66151:0:0:1:0:0;MantaDUP:TANDEM:58924:0:0:0:0:0;MantaDUP:TANDEM:63717:0:0:2:0:0;MantaDUP:TANDEM:66734:0:0:0:0:0;MantaDUP:TANDEM:8:30345:30345:0:1:0;MantaDUP:TANDEM:63098:0:0:0:0:0;MantaDUP:TANDEM:61660:0:0:0:0:0;MantaDUP:TANDEM:8:28796:28796:0:0:0;MantaDUP:TANDEM:3:11653:11653:0:0:0;MantaDUP:TANDEM:63674:0:0:1:0:0;MantaDUP:TANDEM:65859:0:0:0:0:0;MantaDUP:TANDEM:67949:0:0:0:0:0;MantaDUP:TANDEM:59436:0:0:1:0:0;MantaDUP:TANDEM:65757:0:0:0:0:0;MantaDUP:TANDEM:63077:0:0:1:0:0;MantaDUP:TANDEM:63016:0:0:2:1:0;MantaDUP:TANDEM:60566:0:0:1:0:0;MantaDUP:TANDEM:67776:0:0:1:0:0;MantaDUP:TANDEM:63475:0:0:1:0:0;MantaDUP:TANDEM:7:21517:21517:1:0:0;MantaDUP:TANDEM:63181:0:0:1:0:0;MantaDUP:TANDEM:65564:0:0:1:0:0;MantaDUP:TANDEM:61845:0:0:0:0:0;MantaDUP:TANDEM:71188:0:0:0:0:0;MantaDUP:TANDEM:65551:0:0:1:0:0;MantaDUP:TANDEM:69657:0:0:1:0:0;MantaDUP:TANDEM:8:19230:19230:0:0:0;MantaDUP:TANDEM:55093:0:0:0:0:0;MantaDUP:TANDEM:64871:0:0:1:0:0;MantaDUP:TANDEM:0:4703:4703:1:0:0;MantaDUP:TANDEM:65280:0:0:0:0:0;MantaDUP:TANDEM:75149:0:0:1:0:0;MantaDUP:TANDEM:68448:0:0:1:0:0;MantaDUP:TANDEM:54878:0:0:0:0:0;MantaDUP:TANDEM:0:21068:21068:1:0:0;MantaDUP:TANDEM:69793:0:0:0:0:0;MantaDUP:TANDEM:71422:0:0:1:0:0;MantaDUP:TANDEM:5:18792:18792:1:0:0;MantaDUP:TANDEM:65960:0:0:1:0:0;MantaDUP:TANDEM:68768:0:0:1:0:0;MantaDUP:TANDEM:61647:0:0:1:0:0;MantaDUP:TANDEM:2:20092:20092:1:0:0;MantaDUP:TANDEM:58980:0:0:0:0:0;MantaDUP:TANDEM:62613:0:0:1:1:0;MantaDUP:TANDEM:61240:0:0:0:0:0;MantaDUP:TANDEM:59156:0:0:0:0:0;MantaDUP:TANDEM:6:21712:21712:1:0:0;MantaDUP:TANDEM:63856:0:0:0:0:0;MantaDUP:TANDEM:60431:0:0:0:0:0;MantaDUP:TANDEM:59602:0:0:1:0:0;MantaDUP:TANDEM:58669:0:0:1:0:0;MantaDUP:TANDEM:60393:0:0:1:0:0;MantaDUP:TANDEM:6:26846:26846:0:0:0;MantaDUP:TANDEM:2:4072:4072:1:0:0;MantaDUP:TANDEM:58768:0:0:1:0:0;MantaDUP:TANDEM:58525:0:0:0:0:0;MantaDUP:TANDEM:60602:0:0:1:0:0  C       <DUP:TANDEM>    999     PASS    END=2893231;SVTYPE=DUP;SVLEN=44;SVINSLEN=73;SVINSSEQ=ATTATATATACGTATATATATACGTATATATAATATATATATTATATATACGTATATATAATATATATATATT;AC=217;AN=9578;FIBC_P=-0.0433043;HWE_SLP_P=-42.1212;FIBC_I=-0.0433133;HWE_SLP_I=-42.1553;MAX_IF=0.759877;MIN_IF=0.748001;LLK0=-268836;BETA_IF=0.505578,0.0461633,0.0721978,0.0701312,0.0841303;ANN=<DUP:TANDEM>|intron_variant|MODIFIER|SERPINB9|ENSG00000170542|transcript|ENST00000380698.4|protein_coding|5/6|c.567+180_567+223dup||||||;NS=4789;AF=0.0226561;MAF=0.0226561;AC_Het=211;AC_Hom=6;AC_Hemi=0;HWE=0.736495;ExcHet=0.771593
chr6    2893187 MantaINV:4:20498:20498:4:0:0;MantaINV:61660:0:0:1:0:0   C       <INV>   548     PASS    END=2893191;SVTYPE=INV;SVLEN=4;CIPOS=0,4;CIEND=-4,0;HOMLEN=4;HOMSEQ=TATA;SVINSLEN=51;SVINSSEQ=ACGTATATATATACGTATATATAATATATATATTATATATACGTATATATA;INV5;AC=2;AN=9578;FIBC_P=-0.000417444;HWE_SLP_P=-0.195618;FIBC_I=-0.000417444;HWE_SLP_I=-0.195618;MAX_IF=0.749985;MIN_IF=0.749985;LLK0=-281075;BETA_IF=0.49997,-3.8073e-06,1.89345e-06,3.9655e-06,2.17351e-06;ANN=<INV>|intron_variant|MODIFIER|SERPINB9|ENSG00000170542|transcript|ENST00000380698.4|protein_coding|5/6|c.567+220_567+223inv||||||;NS=4789;AF=0.000208812;MAF=0.000208812;AC_Het=2;AC_Hom=0;AC_Hemi=0;HWE=1;ExcHet=0.999896

Let me try to check with Chris internally. For now I recommend skipping such ambiguous records.

I got the same problem with the following entry

scaf000253	514459	MantaINV:5899:0:0:0:0:0	N	<INV>	34	SampleFT	END=514542;SVTYPE=INV;SVLEN=83;IMPRECISE;CIPOS=-351,352;CIEND=-355,356;INV5	GT:FT:GQ:PL:PR	1/1:MinGQ:9:86,12,0:0,4

That same problem: Exception: 514459:514542 missing REF or ALT sequence.

Funny enough, this is for sure a wrong call or misassembly because it's the same individual that was used to build the reference.

If I would understand better on what paragraph expect, I could ad appropriate adjustments in the following script to get them the manta output to paragraph compatible vcf file

Your error is exactly the same as above. For now I suggest not including such records before figuring out what Manta really means for them.

As discussed in the last post of #42, SVINSSEQ is part of variants that are breakpoints with insertions. I.e. nested variants; inversions with an insertion on aside.

My solution was to filter them out.