Mycorrhiza algorithm

Published in -Mycorrhiza: Combining phylogenetic networks and Random Forests for prediction of ancestry from multilocus genotype data

Installing Mycorrhiza on Ubuntu 16.04

Install pip3, Java and the tkinter library

sudo apt-get install python3-pip python3-tk default-jre

If you don't already have the package manager HomeBrew, install it before proceeding.

ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"

Install Mycorrhiza

sudo -H pip3 install --upgrade mycorrhiza

Install SplitsTree

The package can be found here. Follow the installer instructions, leaving all settings to default.

If the link above is not available - find the most recent version of the SplitsTree: http://ab.inf.uni-tuebingen.de/data/software/splitstree4/download

Import the necessary modules.

from mycorrhiza.dataset import Myco
from mycorrhiza.analysis import CrossValidate
from mycorrhiza.plotting.plotting import mixture_plot

(Optional) By default Mycorrhiza will look for SplitStree in your home folder. I you wish to specify a different path for the SplitsTree executable you can do so in the settings module.
```
from mycorrhiza.settings import const
const['__SPLITSTREE_PATH__'] = '~/splitstree4/SplitsTree'
```
Load some data. Here data is loaded in the Mycorrhiza format from the Gipsy moth sample data file. Example data can be found here.
```
myco = Myco(file_path='data/gipsy.myc')
myco.load()
```

Run an analysis. Here a simple 5-fold cross-validation analysis is executed on all available loci, without partitioning.

cv = CrossValidate(dataset=myco, out_path='data/')
cv.run(n_partitions=1, n_loci=0, n_splits=5, n_estimators=60, n_cores=1)

For microsatellite loci set the is_str flag to True.

```python
data = Myco(file_path='data/myco.myc', is_str=True)
data = Structure(file_path='data/myco.str', is_str=True)
```

Diploid genotypes occupy 2 rows (the sample identifier must be identical).

Diploid genotypes occupy 2 rows (the sample identifier must be identical).