/DynamicRepresentation

R library for graphical and numerical representation of the DNA/RNA sequences.

Primary LanguageR

drep

Graphical and Numerical Representation of DNA Sequences

R library for characterizing biological sequences graphically and numerically. If you would like to get the gist of implemented approach please read the following article

Bo Liao, Kequan Ding, A 3D graphical representation of DNA sequences and its application. Volume 358, Issue 1, 2006, Pages 56-64.

Install

In order to install and load the package run the following code in the R console

library(devtools)
install_github('officialprofile/DynamicRepresentation')
library(drep)

How to use it

In most cases a single line of code can yield satisfactory results. For example

plot2DGraph(c('KX369547', 'HQ234498', 'MH063265'), genbank = TRUE, main = 'Comparison of three sequences')

returns a ready-to-use graph of the ZIKV genomes

For three-dimensional representation one can use plot3DGraph. The function is based on rgl library.

plot3DGraph(c('DQ415340','AF509109'), genbank = TRUE, radius = 10)

plot2DGraph(c('DQ415340','AF509109'), genbank = TRUE, 
            legend.pos = 'bottomright', 
            colorset = c('lightblue', 'orange'),
            main = 'Two influenza A type NA genes (H1N1 and H5N1)')

Notice that plotting such graphs can be achieved solely by putting the GenBank accession numbers. plot2DGraph as well as plot3DGraph are intended to be quite versatile though.

In similar fashion one can obtain numerical characteristics by employing dRep function, e.g.

dRep(c('KX369547', 'HQ234498'), genbank = TRUE)

returns the following dataframe

len mi_x mi_y sqrt ...
KX369547 10769 84.660 -16.061 86.170 ...
HQ234498 10269 75.171 -17.691 77.224 ...

Naturally, instead of using data from GenBank, one can apply implemented method to one's own sequence or vector of sequences, e.g.

seq <- 'ACCCTCGCGCCGCGATTCTACGGACCCTGAAAATG'
dRep(seq)