/extract_longest_mRNA

since the annotated gene always possesses more than one copy of mRNA, and we need to use a sequence of CDS to represent its gene sequce, so we need to extract the longest mRNA or CDS to perform the following analysis. here the script is suitble for extracting the sequence

Primary LanguagePerl

YOU JUST NEED DOWN LOAD THE ANNOTAIONS (gff, pep and cds) from NCBI, 
THEN PUT ALL ANNOTAIONS INTO THEIR OWN DIRECTORIES INITIATIED WITH THE THEREE CHARECTORS ABBRE OF EACH SPECIES. LIKE, MOUSE (latin name: Mus musculus) as Mus.