Return of the FTP error- rsync_from_ncbi.pl: unexpected FTP path (new server?)
Madeleine-Meyer opened this issue · 6 comments
Hello everyone,
I am trying to download the standard Kraken2 database and I'm having issues with the rsync_from_ncbi_.pl file. Here is my terminal input and the resulting output and error:
Input:
kraken2-build --standard --db Kraken2_standard_database_NCBI
Output:
Downloading nucleotide gb accession to taxon map... done.
Downloading nucleotide wgs accession to taxon map... done.
Downloaded accession to taxon map(s)
Downloading taxonomy tree data... done.
Uncompressing taxonomy data... done.
Untarring taxonomy tree data... done.
rsync_from_ncbi.pl: unexpected FTP path (new server?) for https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/762/265/GCF_000762265.1_ASM76226v1
Obviously, I Googled the error and when I opened rsync_from_ncbi.pl I saw that the original if statement causing the problems has already accounted for NCBI's switch to https. I tired every variation I could think of swapping out ftp and https and I'm out of ideas. Is the NCBI database having issues or has something changed and does the code need updating?
For reference, I installed Kraken2 under 48 hours ago, so I am using the most up-to-date versions of the code.
Also, edited to add that I get the same error if I try the following as well:
kraken2-build --download-library bacteria --db Kraken2_standard_database_NCBI
Hello,I've encountered the same issue as well. Have you found a solution?
@Madeleine-Meyer , @BioJM How did you install kraken2? From source or using conda?
@Madeleine-Meyer , @BioJM How did you install kraken2? From source or using conda?
Hello @ChillarAnand, I used the source code. I haven't found a solution yet but I've been side stepping the issue for the time being by just building my own custom library with only the species I'm interested in.
Are you using any tools to download specific species data? @Madeleine-Meyer
Are you using any tools to download specific species data? @Madeleine-Meyer
No, I just downloaded what I wanted from GenBank (https://www.ncbi.nlm.nih.gov/datasets/genome/?taxon=780). My downloads weren't particularly large, so I didn't need to use NCBI's command line tools https://www.ncbi.nlm.nih.gov/datasets/docs/v2/download-and-install/. I have never used their command line tools, but will likely give them a try in the near future.
This is still a significant issue. I am not keen to rewriting a series of interlinked perl scripts. Hoping that the Kraken2 devs can come to a quick fix so we can have an Kraken2 db that is up-to-date with a broad set of refseq species.