jrjhealey/PYlogeny

Running error

Opened this issue · 0 comments

Hello everyone, I'm trying to converting the NCBI accession number, during the running there are errors occur, I've tried several times to figure out but it's still clueless. I ran both the test and my data
"./PYlogeny.py -i /home/biolab_0/C.curva_accession.txt -e nqdung1404@gmail.com --update
[12/28/2023 03:32:39 PM] WARNING -> Creating the NCBITaxa instance. If this is the first run, this could take some time.
[12/28/2023 03:32:39 PM] WARNING -> The database is stored at: ~/.etetoolkit/
[12/28/2023 03:32:39 PM] WARNING -> Updating the taxa database, this could take some time.
Downloading taxdump.tar.gz from NCBI FTP site (via HTTP)...
Done. Parsing...
Loading node names...
2550626 names loaded.
319531 synonyms loaded.
Loading nodes...
2550626 nodes loaded.
Linking nodes...
Tree is loaded.
Updating database: /home/biolab_0/.etetoolkit/taxa.sqlite ...
2550000 generating entries...
Uploading to /home/biolab_0/.etetoolkit/taxa.sqlite

Inserting synonyms: 315000
Inserting taxid merges: 75000
Inserting taxids: 2550000
[12/28/2023 03:33:50 PM] ERROR -> RefSeq accessions only at present. Skipping.
Traceback (most recent call last):
File "/home/biolab_0/PYlogeny/./PYlogeny.py", line 9, in
main()
File "/home/biolab_0/PYlogeny/PYlogeny/main.py", line 111, in main
record = Query(acc)
File "/home/biolab_0/PYlogeny/PYlogeny/query.py", line 25, in init
self.taxid = self.query()
File "/home/biolab_0/PYlogeny/PYlogeny/query.py", line 52, in query
return str(Entrez.read(Entrez.esummary(db=self.database, id=self.accession))[0]['TaxId'])
File "/home/biolab_0/miniconda3/envs/PYlogeny/lib/python3.10/site-packages/Bio/Entrez/init.py", line 508, in read
record = handler.read(handle)
File "/home/biolab_0/miniconda3/envs/PYlogeny/lib/python3.10/site-packages/Bio/Entrez/Parser.py", line 335, in read
self.parser.ParseFile(handle)
File "/croot/python-split_1694437906621/work/Modules/pyexpat.c", line 468, in EndElement
File "/home/biolab_0/miniconda3/envs/PYlogeny/lib/python3.10/site-packages/Bio/Entrez/Parser.py", line 704, in endErrorElementHandler
raise RuntimeError(value)
RuntimeError: db name not defined
^[[A(PYlogeny) biolab_0@BIOLAB:~/PYlogeny$ ./PYlogeny.py -i /home/biolab_0/PYlogeny/tests/10accs.txt -e nqdung1404@gmail.com --update
[12/28/2023 03:34:13 PM] WARNING -> Creating the NCBITaxa instance. If this is the first run, this could take some time.
[12/28/2023 03:34:13 PM] WARNING -> The database is stored at: ~/.etetoolkit/
[12/28/2023 03:34:13 PM] WARNING -> Updating the taxa database, this could take some time.
Local taxdump.tar.gz seems up-to-date
Loading node names...
2550626 names loaded.
319531 synonyms loaded.
Loading nodes...
2550626 nodes loaded.
Linking nodes...
Tree is loaded.
Updating database: /home/biolab_0/.etetoolkit/taxa.sqlite ...
2550000 generating entries...
Uploading to /home/biolab_0/.etetoolkit/taxa.sqlite

Inserting synonyms: 315000
Inserting taxid merges: 75000
Inserting taxids: 2550000
[12/28/2023 03:35:08 PM] WARNING -> No lineage returned for taxid IntegerElement(29487, attributes={}). You may need to update the database.
WP_041379885.1,IntegerElement(29487, attributes={}),Not_found
[12/28/2023 03:35:09 PM] WARNING -> No lineage returned for taxid IntegerElement(2218628, attributes={}). You may need to update the database.
WP_058588699.1,IntegerElement(2218628, attributes={}),Not_found
[12/28/2023 03:35:10 PM] WARNING -> No lineage returned for taxid IntegerElement(768034, attributes={}). You may need to update the database.
WP_105398703.1,IntegerElement(768034, attributes={}),Not_found
[12/28/2023 03:35:11 PM] WARNING -> No lineage returned for taxid IntegerElement(2620880, attributes={}). You may need to update the database.
WP_112878141.1,IntegerElement(2620880, attributes={}),Not_found
[12/28/2023 03:35:12 PM] WARNING -> No lineage returned for taxid IntegerElement(29488, attributes={}). You may need to update the database.
WP_110086472.1,IntegerElement(29488, attributes={}),Not_found
[12/28/2023 03:35:12 PM] WARNING -> No lineage returned for taxid IntegerElement(29487, attributes={}). You may need to update the database.
WP_036780220.1,IntegerElement(29487, attributes={}),Not_found
[12/28/2023 03:35:13 PM] WARNING -> No lineage returned for taxid IntegerElement(1851568, attributes={}). You may need to update the database.
WP_065389666.1,IntegerElement(1851568, attributes={}),Not_found
[12/28/2023 03:35:14 PM] WARNING -> No lineage returned for taxid IntegerElement(2029681, attributes={}). You may need to update the database.
WP_036813449.1,IntegerElement(2029681, attributes={}),Not_found
[12/28/2023 03:35:15 PM] WARNING -> No lineage returned for taxid IntegerElement(29488, attributes={}). You may need to update the database.
WP_110091204.1,IntegerElement(29488, attributes={}),Not_found
[12/28/2023 03:35:16 PM] WARNING -> No lineage returned for taxid IntegerElement(29487, attributes={}). You may need to update the database.
WP_113043119.1,IntegerElement(29487, attributes={}),Not_found"
Please, point out my mistakes. Thank you!