creating a custom GTDB with user provided MAGs
Closed this issue · 1 comments
Hello I need to use kraken2 for mapping paired end reads a against a custom GTDB database, the issue is that I need to add some metagenome assembly genomes that I built so I think your tool may help me a lot!
The issue is that first I dont know how to do it.
I cloned the repo to my server using git clone https://github.com/leylabmpi/Struo2.git
Once I created the conda environment with conda env create --name struo2 --file conda_env.yaml
im stuck with the following procedure:
Create a directory to hold all of the necessary Struo2 data files
The directory location can be anywhere on your file system
OUTDIR=./data/
mkdir -p $OUTDIR
it is supposed that I create an empty directory where I can download the following custom GTDB taxdump files ?
wget --directory-prefix $OUTDIR http://ftp.tue.mpg.de/ebio/projects/struo2/GTDB_release95/taxdump/names.dmp
wget --directory-prefix $OUTDIR http://ftp.tue.mpg.de/ebio/projects/struo2/GTDB_release95/taxdump/nodes.dmp
Or should I incorporate those taxdump files on the Struo2/data/ directory ?
besides that those wget command gives me the following "error 404" message
--2021-12-07 17:37:38-- http://ftp.tue.mpg.de/ebio/projects/struo2/GTDB_release95/taxdump/names.dmp
Resolving ftp.tue.mpg.de (ftp.tue.mpg.de)... 192.124.27.90
Conecting with ftp.tue.mpg.de (ftp.tue.mpg.de)[192.124.27.90]:80... connected.
Request HTTP sent, waiting response... 404 Not Found
Is there a detailed guide on how to use this useful tool?
Thanks for your time :)
You can use the ./util_scripts/database_download.py
script for downloading database files