/Primula

Scripts and parameter files used for Koontz et al. 2022, analysis of the Primula cusickiana species complex

Primary LanguageF*

Overview

This repository contains the scripts and parameter files used for the genomic analysis of the Primula cusickiana species complex. The article describing these analyses is published with Systematic Botany and can be found here.

ipyrad

The ipyrad directory contains 3 folders, each corresponding to a unique run of the ipyrad software pipeline. Directories contain the relevant params files used for each run, as well as relevant outfiles (although not all output files are included, as this is unecessary and would require too much space).

The PreliminaryRuns directory contains the ipyrad barcode and parameters files used for initial exploratory run (titled "UniqueReps"), along with output files. ComplexWide holds the same files for the final run including all complex member samples, and MaguireOnly contains the relevant ipyrad files for the run examining only maguirei samples.

In addition, ComplexWide and MaguireiOnly include folders which contain the files used for FST calculations (using VCFtools).

The ComplexWide folder also contains the JuneSubset.nex file, which was passed into the software SplitsTree to generate the split network diagram shown in the manuscript.

R

The R directory contains the R script used in data analysis. This includes the plotting code for STRUCTURE plots (in structure) and DAPC (in DAPC). Code used to generate Jaccard distribution of samples is included in the JaccardDistances.R script.

Additionally included are R scripts used in intial data exploration.

Mapping

The Mapping directory includes the Python script and .csv file used to generate the map of samples with the cluster membership at STRUCTURE output K=7. Credit goes to Dr. Carol Rowe for compiling this Python script!

STRUCTURE

The structure directory contains folders holding the mainparams and extraparams files for two STRUCTURE runs: the complex wide analysis, and the analysis looking at strictly maguirei samples. Additionally, output files (.str, .stru, .vcf, and .nex) are included, as well as the BASH script used to run STRUCTURE.

Notes

For any questions regarding these scripts, please contact Austin Koontz (either through GitHub or via email). Other scripts/parameter files used in the analysis of this dataset can be made available upon request. Raw data has been uploaded to the NCBI short-read archive (SRA), accession number PRJNA705310.