This repository contains code and data for the assessment of the availability of microbial community sequencing data by Stephanie D. Jurburg, Maximilian Konzack, Nico Eisenhauer, and Anna Heintz-Buschart. The following files are included:
This uses code from https://github.com/komax/teitocsv and a conda environment: pdf2seq.yaml
Main figures:
with input data:
- mining_output.txt
- 191206_simplifiedAll.RDS
- 191206_simplifiedSub.RDS
- accessions_bacteria_cleaner_onlyExtra_AHB.txt
- 200315_simplifiedV4.RDS
- all_study_pie.txt
Analysis of raw and metadata:
with input data: