ncbi/datasets

`ncbi_dataset/data/data_report.jsonl` sometimes missing

bernt-matthias opened this issue · 2 comments

When executing:

datasets download gene accession 'WP_004675351.1'   --include-flanks-bp 100   --include gene,protein --no-progressbar
dataformat tsv prok-gene --package ncbi_dataset.zip --fields accession,description,ec-number,gene-symbol,mapping-count,protein-length,protein-name > gene_data_report.tsv

I get Error: no matching files found for [ncbi_dataset/data/data_report.jsonl]

Hi @bernt-matthias,

Thanks for opening this issue. This is a known bug, where we don't return the data report for WP proteins that have been suppressed. WP_004675351.1 was suppressed for the following reason: This protein record was suppressed because it is no longer annotated on any genome.

I'll bring this up with the team and I'll comment on this thread with any updates.

Best,
Eric

Hi @bernt-matthias,

We won't be able to tackle this bug in the near term. I'm closing this ticket for now but I will reopen this issue if we are able to revisit this.

Thanks again for your report.

Best,
Eric