WGLab/LIQA

Should not use gene name as key

Closed this issue · 0 comments

ENSEMBL has many different genes with the same gene name. E.g. U2:
http://useast.ensembl.org/Human/Search/Results?q=U2;site=ensembl;facet_species=Human;page=1

I think gene ID should be used as key all the time so the following lines:

my $info = $a[8];
if(/gene_name/) {
my @b = split("gene_name", $info);
my @c = split("\"", $b[1]);
$gene = $c[1];
} else {
my @b = split("gene_id", $info);
my @c = split("\"", $b[1]);
$gene = $c[1];
}
my @d = split("transcript_id", $info);

should be something like:

    my $info = $a[8];
    
    my @b = split("gene_id", $info);
    my @c = split("\"", $b[1]);
    $gene = $c[1];
    my @d = split("transcript_id", $info);