/gbk_transformer

Transform a Genbank file into a FASTA file and a feature table.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

GBK transformer

Transformation of a GenBank file into a FASTA sequence file and a feature table.

Introduction

gbk_transformer.py is a python script to transfrom a Genbank file into a FASTA sequence file and a feature table.

For multiple Genbank files transformation, please use gbk_transformer.batch.py instead.

Motivation

To transform a GenBank file (.gbk, .gb or .gbff) into a FASTA sequence file (.fna, .fas, .fasta), and the annotations into a feature table that can be easily manipulated with programs like EXCEL.

Prerequisites

This program was written in Python 3.6.7.

No extra Python module or program is needed to run the program.

Usage

If you have Python 3 installed in your system, you could put the program with your GenBank files, and run the program with command like:

python gbk_transformer.py -i input.gbff

The program would then output two files:

input.fna
input.xls

If multiple GenBank files processing is needed, one could first get a list of the input files by using command like:

ls *.gbff>list.gbff.txt

And then process them with gbk_transformer.batch.py:

python gbk_transformer.batch.py -l list.gbff.txt