cancerit/VAGrENT

Issue in ensembl data import admin scripts - Some CCDS transcripts missed

Closed this issue · 1 comments

The Vagrent admin scripts that import data from Ensembl select for any transcript with a Known status as well as a small list of specific biotypes. However some CCDS transcripts have been issued with Novel or Putative statuses and are being missed

This is an issue with Ensembl versions 59 through to 80. Ensembl were repeating whatever status Havana were reporting on transcript without any of their own QC. Unfortunatly ever since Ensembl 58 Havana has been reclassifying some CCDS transcripts (ie defining members of a CCDS record) as either Novel or Putative.

Need to review the Ensembl data import scripts to asses how this can be coped for in the future.

Status field has been dropped from Ensembl, this is no longer an issue