Issue in ensembl data import admin scripts - Some CCDS transcripts missed
Closed this issue · 1 comments
The Vagrent admin scripts that import data from Ensembl select for any transcript with a Known status as well as a small list of specific biotypes. However some CCDS transcripts have been issued with Novel or Putative statuses and are being missed
This is an issue with Ensembl versions 59 through to 80. Ensembl were repeating whatever status Havana were reporting on transcript without any of their own QC. Unfortunatly ever since Ensembl 58 Havana has been reclassifying some CCDS transcripts (ie defining members of a CCDS record) as either Novel or Putative.
Need to review the Ensembl data import scripts to asses how this can be coped for in the future.
Status field has been dropped from Ensembl, this is no longer an issue