/metabarcoding

Daniel's pipeline to analyze metabarcode data + iNEXT figures

metabarcoding

Analyzing metabarcoding data using the packages Metacoder and MicrobiotaProcess

This project uses COI metabarcoding data from two nights of light-trapping at Barro Colorado Island in two seasons (Wet/Dry).

The project aims at describing biodiversity classified through BOLD's mBrave platform based on BINs, and then compare it to biodiversity classified through the ForestGeo Arthrpod Initiative (traditional taxonomy and barcoding).

Raw metabarcoding data is processed through mBrave (with costum-made parameters) and downloaded as a .csv file which includes organism (in ranked classification) and the number of reads for that organism on each light-trap sample for each of the two sampling nights/seasons.

To follow this pipeline, go through the scripts in order 01 - 02 - 03 etc.

NOTE 1: SCRIPT 01 IS FOR INFORMATION PURPOSES ONLY. You should not re-run these scripts as you will overwrite the existing files (which have already been merged and cleaned).

NOTE 2: Most critical merging/cleaning steps were later done in excel where I 'arbitrarily' had to decide which species was that BIN associated to, for BINs with more than one species name. This happened much more frequently than I would hope, but its a consequence of BOLD. As a general rule, I used the first species name for that BIN, which normally corresponds to the name of first database which the barcodes were compared against (in this case, DS-BCIARTH).

NOTE 3: For the MicrobiotaProcess, some further cleaning will be required, as of now, script 03 still contains some preliminary figures (cladograms, specifically) because the data needs to be further refined to include regular expressions to define the taxonomic rank (e.g. p__;c__;o__) in order for us to be able to properly filter the data to focal groups and/or focus only on a specific rank to visualize the differences between seasons. To bypass this issue, I filteered the datasets using Metacoder and created new 'files' for each focal order. If anyone can think of a Tidy way to do this, go ahead.