It is recommended to install songbird as follows
conda create -n songbird_env python=3.6 numpy=1.15.4 scikit-bio=0.5.5 seaborn pandas=0.23.4 -c conda-forge
source activate songbird_env
conda install tensorflow=1.10 tqdm nomkl
conda install biom-format h5py -c conda-forge
conda install jupyter notebook
conda install songbird -c conda-forge
The scripts used to run songbird can be found in the scripts directory
All data files are stored under the data directory.
- oral_deblurred.biom : the deblurred microbe abundance table.
- oral_metadata.txt : sample metadata information for the saliva time series dataset
- oral-collapsed-table.qza : the qiime2 artifact of collapsed microbial abundances in the saliva time series dataset. This used taxonomy collapsing and closed ref picking.
- oral-collapsed-table.biom : the biom file of collapsed microbial abundances in the saliva time series dataset. This was extracted from oral-collapsed-table.qza
- byrd_skin_table.biom : The microbe abundances from the shotgun dataset in the atopic dermitatis dataset in Byrd et al 2017.
- byrd_final_microbes.txt: Microbes that were highlighted to be interesting Byrd et al
- byrd_metadata.txt : Sample metadata in Byrd et al
- read_counts_BacteriaMalassezia_62subjectsUPto3mismatchs_ADonly.txt : M. globosa and P. acnes counts in Leung et al
- soils/ : directory containing biom tables and sample metadata to reproduce the Central Parks analysis.
Note that the qiime2 artifacts contain all of the processing information in the providence, so it is possible to retrace all of the qiime2 commands that were performed in order to reproduce those artifacts.
The analysis notebooks can be found under the ipynb folder