jonathonthill/sangerseqR

Unable to understand difference between two files throwing a warning

Opened this issue · 4 comments

Hi Jonathon,

I've been using your fantastic package sangerseqR to import .ab1 files, but am quite clueless what causes the warning below that I have been observing only for one of the two provided files. I have used readsangerseq() for import.

"Invalid characters removed from primary basecalls. This may result in basecalls being shifted. Please check chromatogram."

It's quite surprising, given that they were obtained in the same experiment, with exactly the same settings etc. Do you know what's different between those two files and what might have caused it?It would be absolutely amazing if you could help me out here!

All the best,
David

no_error.zip

Thank you very much for your answer. So how might this odd inherent error in the .ab1 files influence my downstream analysis? Merely wrong representation in the graph function? Wrongly sangerseqR basecalling? Is the primary basecalling by the sequencing company affected? I am implementing a small app where users are presented your graphs to decide whether they can see a pattern, and manually resolve the ambiguity. So I was worried that peaks are shifted or similar.