
This script iterates over an excel file with Genbank accession numbers. It creates individual fasta files with customized deflines for every valid accession number. It also creates a combined fasta file and .txt file with every valid accession number.

Primary LanguageJupyter Notebook


This script iterates over an excel file with Genbank accession numbers. It creates individual fasta files with customized deflines for every valid accession number based on inputs from within the spreadsheet. It also creates a combined fasta file and .txt file with every valid accession number. The script is specific to an excel file associated with the following bioinformatics paper https://doi.org/10.1016/j.isci.2021.102816 The user will need to edit dataframe keys to make this work with a different excel spreadsheet