rcedgar/renomad

Understanding renomad output

Closed this issue · 3 comments

Thank you for your work on this implementation of nomad.

I would like to better understand the output. I analyzed 3 bioprojects of ~50 samples per project. There are hundreds of thousands to hundreds of millions of significant results when I run renomad. I don't see any P-values in the output files, and I'm also confused that all the anchor-targets are shown. Shouldn't each row be a single anchor along with a P-value of some kind, rather than the anchor and the target together?

I ran renomad with the following command:

renomad -joinp sample_paths.txt -maxp 0.05 -tsv3out renomad_output.tsv

And each time I ran it I got this warning:

WARNING: Option -maxp not used

Any advice?

The -maxp not used warning was a bug, I posted a fix (see changes to joinp.cpp). The output format is described in the README.md file, there is no P-value by design. The idea is that the expensive processing is done by renomad, downstream analysis such as P-values, effect size, contigs... can be done in other modules e.g. python.

Great, thank you. But it is calculating p-values when it applies the -maxp parameter? These results all meet some preliminary test of significance?

Yes, it calculates P-values, default is P<=0.05. You can safely use the version with the bug if you are happy with the default value.