full-length or exon typing

Question

full-length or exon typing

bsb2014 opened this issue 10 months ago · 9 comments

bsb2014 commented 10 months ago

SpecHLA publication suggests that full-length typing outperforms exon typing. I am wondering if the reconstructed gene sequences/full-length (-u 0) are better than the reconstructed exon sequences (-u 1). Do the reads from noncoding regions (introns) improve phasing? Thanks

Answer 1 · 2023-11-09T14:19:08.000Z

Do I need to care about the message below that popped up during the full-length typing (-u 0)? Thanks

Use of uninitialized value $hash{"HLA_DRB1_1"} in split at /home/src/SpecHLA/script/whole/annoHLA.pl line 318.
Use of uninitialized value $hash{"HLA_DRB1_2"} in split at /home/src/SpecHLA/script/whole/annoHLA.pl line 318.

Answer 2 · 2023-11-10T02:40:37.000Z

Hi, the reads from noncoding regions (introns) can provide the linkage information between exons, thereby improving typing performance. And don't worry about the warning message, it has no impact.

Answer 3 · 2023-11-11T01:58:38.000Z

The warning message
"Use of uninitialized value $hash{"HLA_DRB1_1"} in split at /home/src/SpecHLA/script/whole/annoHLA.pl line 318.
Use of uninitialized value $hash{"HLA_DRB1_2"} in split at /home/src/SpecHLA/script/whole/annoHLA.pl line 318." often occurred with failure of DRB1 typing. Could you please let me know what the message means? Thanks

Answer 4 · 2023-11-11T02:13:28.000Z

Could you also explain what do ‘‘Bowtie,’’ ‘‘Exon,’’ ‘‘Whole.norealign,’’ ‘‘Whole,’’ and ‘‘Whole.SV’’ modes mean? Thanks

I found the answer, but it is not clear to me if Exon=Novoalign + exon? (It would be better if some aligner could replace Novoalign that is not free)

Answer 5 · 2023-11-11T04:08:37.000Z

If read binning with Bowtie2 + exon typing +15-20x read coverage + 150bp, how much accuracy for 2-field HLA typing? Thanks

Answer 6 · 2023-11-13T03:55:18.000Z

Hi,

The warning is caused by the strict requirement of Perl, we have removed the warning in the latest commit.
The default parameters are Novoalign + whole + realign + no SV. So, the mode name means its difference with the default parameters. E.g., exon means Novoalign + exon + realign + no SV. realign indicates using the database to link the unphased blocks.
We have not performed Bowtie2 + exon typing. But the accuracy of Bowtie2 + whole + 20x typing is roughly 0.8 in simulated data.

Answer 7 · 2023-11-13T15:55:06.000Z

I have not tested novoalign3, but i think it could work, maybe need some minor alterations in parameter settings.发自我的 iPhone在 2023年11月13日，22:43，bsb2014 ***@***.***> 写道： Do you happen to know if Novoalign 3 works with SpecHLA? Thanks —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: ***@***.***>

Answer 8 · 2023-11-14T02:09:18.000Z

Many thanks for your helpful replies. I tested the SpecHLA with Novoalign 4. The Novoalign seems to treat Illumina reads as Sanger (see below). Is it normal? Thanks.

"# Interpreting input files as Sanger FASTQ."

Answer 9 · 2023-11-14T04:25:06.000Z

Don't worry. It's normal.

…

On Tue, Nov 14, 2023 at 10:09 AM bsb2014 ***@***.***> wrote: Many thanks for your helpful replies. I tested the SpecHLA with Novoalign 4. The Novoalign seems to treat Illumina reads as Sanger (see below). Is it normal? Thanks. "# Interpreting input files as Sanger FASTQ." — Reply to this email directly, view it on GitHub <#16 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALS7DWWTVW23JQRQWH73HITYELHFTAVCNFSM6AAAAAA7ETHWNKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMBZGQZDQNBQHA> . You are receiving this because you commented.Message ID: ***@***.***>