metagenome-atlas/atlas

atlas ,how to not bin?

Closed this issue · 15 comments

I need get a taxnomy and otu table which the input from contings , but the atlas create a taxnomy which contain binning ID.

atlas run genomes --skip-binning --resources mem=600 -j 130 is error!

Hmm, I'm happy to help ou but I don't understand what you need.
What microbiome do you have?

Atlas is designed to assemble and bin genomes from your reads and then use them for quantification.

Depending on the microbiome you want you could get reference genomes that are already assembled. In which case you don't need to assemble.

You can also taxonomically annotate your contigs but don't get an OTU.

What do you exactly mean by OTU, usually the term is used for 16S sequencing.

Thanks,I want to annotate my contigs by atlas.How?

atlas run genomes --skip-binning

image

Set "SemiBin" as final_binner in the config file. And run atlas

atlas run binning --omit-from semibin_train

Run this first with the --dryrun option to check what it does.

It should filter contigs <1500bp and annotate each sample with the taxonomy

Thanks! it works!
image
but I have a problem about raw count.I need a table which the row names are contings and the cloumns are SampleID.How?
image

For each sample you have the coverage in "{sample}/assembly/contig_stats/postfilter_coverage_stats.txt" in the same folder you even find it by base and by 1kb block if this is of interest to you.

if we want to analyze different samples for difference analysis for contings, then we can make a unified table. The columns are SampleID and the rownames are cotings. How can I get the table ?

when I use "atlas run binning --omit-from semibin_train", the output is the table which contain only one sample.

Can you explain what you want to do? I do not do a coassembly. each sample get's assembled, separately. also, the taxonomic annotation is per sample.

I create a unified genecatalog and a unified set of MAGs for quantification. However, I do not know how to combine all the contigsas there are many partially similar contigs in different samples.

Thanks, Yes,I want to combine all the contigs in different samples in one table. Maybe I need use other method

Because I changed the samples.tsv file, and save as samles_new.tsv . can I specify its absolute path?
atlas run qc --samples /home/zhiyu/atlas/samles_new.tsv ,right?

No you cannot specify the path to the samples.tsv. You should rename the old to samples_old.tsv and the new to samples.tsv. By the way if you have questions about the command line interface simply write --help .

Thanks, Yes,I want to combine all the contigs in different samples in one table. Maybe I need use other method

By the way there is a way to do what you want with atlas:

run:

atlas run None "Cobinning/vamb/coverage.tsv" <other params>

This uses minimap to map all reads to the combined filtered contigs, which you have taxonomically annotated before.
However I'm not sure how this approach handles the multi mapping.

It's uses in theory for vamb binning, but you are not interested in binding as I understood.

#549 (comment)

I often run a lot of batch data by atlas if rename is troublesome. if sample.tsv is specified, it is convenient for me to record.

I don't understand.

If I have 3 batches running in the same directory for sample.tsv, I will change my name three times. So I want to specify the name. Example: sample _ batch1.tsv sample _ batch2.tsv sample _ batch3.tsv.

Why would you run three batches in the same directory?

Maybe you don't know that atlas runs already everything in parallel and can be effiently be deployed on a cluster. See the docs.

Once the qc step passed the sample.tsv is no longer altered and you could run batches. If really necessary.