Should not use gene name as key
Closed this issue · 0 comments
baraaorabi commented
ENSEMBL has many different genes with the same gene name. E.g. U2:
http://useast.ensembl.org/Human/Search/Results?q=U2;site=ensembl;facet_species=Human;page=1
I think gene ID should be used as key all the time so the following lines:
LIQA/liqa_src/PreProcess_gtf.pl
Lines 36 to 47 in 8e09856
should be something like:
my $info = $a[8];
my @b = split("gene_id", $info);
my @c = split("\"", $b[1]);
$gene = $c[1];
my @d = split("transcript_id", $info);