Support for gzipped reads

Question

Support for gzipped reads

donovan-h-parks opened this issue 5 years ago · 12 comments

Hi.

Are there plans to support reads in compressed gzipped format (i.e. my_reads.fq.gz)? This would be a major help for incorporating MetaCache into workflows.

Cheers,
Donovan

Answer 1 · 2020-03-04T16:35:01.000Z

Yes, it's on my list for the next major version, but we are currently ironing out some bugs and also working on some improvements. So, it may take some time.

Answer 2 · 2020-03-04T16:51:31.000Z

Hi. Thank you for the quick response. In my testing, MetaCache is certainly among the best performing classifiers available. Are any of the upcoming bug fixes critical?

Answer 3 · 2020-03-04T16:57:35.000Z

There's currently a bug that was introduced in the last version. It leads to unnecessarily high memory consumption during database builds. The fix is already implemented and will be released shortly.
There are some other minor things, nothing that would affect the classification results.

Answer 4 · 2020-03-04T17:09:36.000Z

Thanks. I'm currently using v0.9.0 so perhaps have avoided these issues.

Answer 5 · 2020-09-28T23:21:33.000Z

Yes, gzip fastq compatibility would be very useful for me as well.

Answer 6 · 2020-10-09T13:16:13.000Z

Just to let you all know that the next version of Metacache does support reading gzipped sequence files.
Since it also contains a large portion of new code for accelerating builds and querying it might take a few weeks until we will release it.

Answer 7 · 2020-10-09T13:20:10.000Z

Nice, thanks!

Answer 8 · 2021-03-22T20:11:47.000Z

Is there any ETA on when the new release with the gzipped version will be out?

It would also be important for incorporation into my workflows as well.

Answer 9 · 2021-03-22T21:05:17.000Z

We currently have a paper under review. We will make the code of the latest version which also supports reading gzipped files (and many more capabilities) available as soon as the paper is accepted (fingers crossed). Unfortunately we don't have the time to back-port the reading of compressed files to an older version at the moment. So it will likely take a few weeks until we can make the newest version public.

Answer 10 · 2021-03-23T08:29:08.000Z

No problem, good to know paper is under review! Good luck, and looking forward to it!

Answer 11 · 2021-06-23T10:26:06.000Z

Reading gzipped files is now supported in the latest release!

Answer 12 · 2021-06-23T11:03:32.000Z

Woohoo thank you!