pierrebarbera/epa-ng

FastTree

atoselan opened this issue · 4 comments

Hi, I'm just setting up a pipeline to place query sequences onto a phylogentic tree of ~5,000 reference sequences. I'm finding Raxml to be very slow for tree building and wondered whether epa-ng supports the use of FastTree. Will this be a problem when I have to supply the model parameters to epa-ng? I've looked online but haven't found any examples of this.

Cheers,
Andrew

Hey @atoselan,

as far as I am aware, as long as you get a newick tree and its model parameters in some form that EPA-ng understands, that should work. See here for the specifications of model params that EPA-ng expects.

I don't know in which format FastTree outputs its parameters. If they are not in that format, once you have the newick file from FastTree, you can use RAxML-ng to obtain the model parameters for it, which will not run the whole tree search, but only give you these params, as explained in the above link as well.

Hope that helps
Lucas

Many thanks, it turned out to be straight-forward to get a fasttree info file using raxml. I know this is a different issue but I can't get papara to work, I get a vague error message about inconsistency in the alignment which I can't resolve. I've started looking at using hmmer/hmmalign instead and wondered if you had any examples to follow for using this approach. Hmmer and hmmalign work fine but how to I ensure that I have alignments of the same length? I've created a hmm from the reference alignment, aligned the queries to the hmm but now the alignments are of different lengths.

Hm, if I recall correctly, hmmer/hmmalign uses a flag -m to keep the length. I'd check their manual :-)