wdecoster/methplotlib

Window parameter cannot deal with complicated fasta headers

bastian-wur opened this issue · 4 comments

Hi everyone,

I'm trying to use methplotlib, and I've run into some issues.
I'm trying to use it to analyze some reads I called earlier.
Issue number #1) There is no reference for this. To get around this, I converted one of the read fastq files to fasta, and used this one as reference
Issue #2) Obviously the reads don't have nice names. If I use -w "37b7bed0-436a-4676-a61a-023255f80bee runid=0fa505d79c6c1b4fe0085a62e508181b26c78340 sampleid=I22-1145-01 read=168 ch=445 start_time=2022-05-19T13:59:55Z model_version_id=dna_r10.4.1_e8.2_400bps_hac@v3.5.2:1-200"

I get ERROR: Window (-w/--window) inproperly formatted, examples of accepted formats are:
'chr5:150200605-150423790' or 'ENST00000647408'

I do understand the error, but in theory my input is proper, I guess. Just the fasta header is complicated.,

If you have any other advice how I could visualize my data, then I'd be very happy.

Thanks,
Bastian

Hi Bastian,

I assume you then aligned your data to that one fastq you used as pseudo-reference?
I think not the entire fasta identifier will be used in the sam/bam/cram file - and it might look like 37b7bed0-436a-4676-a61a-023255f80bee without the rest of the information.

Wouter

Actually, Guppy had directly outputted a BAM file, so I had converted it to CRAM with the fastq as reference. Turns out that the identifiers between both are different, and the SAM/BAM didn't have a header, so there's a bunch of issues more, I guess :/, and all not related to methplotlib. Sorry for bothering you.
(although it would be handy to directly load the Guppy output)

I assume guppy created a BAM file with unaliged reads, i.e. ubam?

Sorry, I'm getting back to this so late. Yes, I had run Guppy without a reference at that time. In the meantime I've run it again with a human reference, I've not tested methplotlib yet, but I do not expect any issues with that :).