/Corona-RNA-Extraction

Extract the RNA of Corona virus from the Biologist's reports on the sequence of the virus genome.

Primary LanguageR

Corona-RNA-Extraction

Extract the RNA of Corona virus from the Biologist's reports on the sequence of the virus genome.

Corona virus is a RNA virus , single strand RNA.

But in the data submitted by the biologist's to The NCBI, consists of the DNA sequence. This is because, they used the reverse transcriptase enzyme for the virus genome sequence classification, and the result is what is known as a cDNA.

For this reason we need to get the complimentary, to retrieve the genome of the virus.

DATA

About NC-045512
NC-045512 data file

NC-045512 is one of the first strand discovered in China.

About MN988668
MN988668 data file

The data is represented in FASTA, a text based format for representing sequences.

The data files contain the cDNA, with first line being the ID for the genome and rest is the cDNA sequence.

The scripts create a complimentary sequence for this sequence to retrieve the original sequence of the virus.

Sequence Complimenting Rule

DNA

Adenine (A), Thymine (T), Cytosine (C), Guanine (G)

DNA complementary base pairing rule

  • A -> T
  • C -> G
  • G -> C
  • T -> A

RNA

Adenine (A), Cytosine (C), Guanine (G), Uracil (U)

RNA complementary base pairing rule

  • A -> U
  • C -> G
  • G -> C
  • T -> A

MADE for the Love of Biology.