Hi, I can not run guidemarker using genome fasta +gff file. Appreciate any help.
hz-mswang opened this issue · 4 comments
2022-11-10 15:34:20,592 root ERROR GuideMaker terminated with errors. See the log file for details.
Traceback (most recent call last):
File "/home/shuang/miniconda3/envs/gmenv/lib/python3.9/site-packages/guidemaker/cli.py", line 193, in main
anno.get_annotation_features()
File "/home/shuang/miniconda3/envs/gmenv/lib/python3.9/site-packages/guidemaker/core.py", line 761, in get_annotation_features
if not feat_key in feature_dict:
UnboundLocalError: local variable 'feat_key' referenced before assignment
gff example:
JAIZPG010000102.1 AUGUSTUS gene 60 1208 0.39 - . ID=gene-alt2500001;Name=alt2500001;gbkey=Gene;gene_biotype=protein_coding;locus_tag=alt2500001
JAIZPG010000102.1 AUGUSTUS mRNA 60 1208 0.39 - . ID=rna-gnl|WGS:LPVP|mrna.alt2500001;Parent=gene-alt2500001;gbkey=mRNA;locus_tag=alt2500001;orig_protein_id=gnl|WGS:LPVP|alt2500001;orig_transcript_id=gnl|WGS:LPVP|mrna.alt2500001;product=alt2500001
JAIZPG010000102.1 AUGUSTUS CDS 60 234 0.96 - 1 ID=cds-alt2500001;Parent=rna-gnl|WGS:LPVP|mrna.alt2500001;Dbxref=NCBI_GP:alt2500001;Name=alt2500001;gbkey=CDS;locus_tag=alt2500001;orig_transcript_id=gnl|WGS:LPVP|mrna.alt2500001;product=alt2500001;protein_id=alt2500001
Could you give me the command you ran to produce this error? If you are able to post a link to the data files that would be helpful too. I think it has to do with the set of tags your annotation produced in the gff. Guidemaker guesses from a list of standard tags but if they are non-standard you have to provide them on the command line.
I'll take a look at it when I'm back at work next week.
I downloaded your data and ran it successfully on my computer. The only thing I changed was adding the line ##gff-version=3
to your gff file generated by AGUSTUS. I also noticed that the example gff has contig id's that look like this JAIZPG010000102.1
and the files you sent have contig ids that look like this NODE_20_length_608411_cov_17.089849
. Having mismatches in the contig names may have caused the issue.