lfaino/LoReAn

mask gff error: list index out of range

iwangtoknow opened this issue · 9 comments

Hi Luigi,
Recently I pulled the image tagged dev_cli,
a problem raised at the mask step,
here is the log:

###RUNNNG REPEATSCOUT AND REPEATMASK TO MASK THE GENOME STARTED AT:	14:07:53 24-12	###
	###RUNNING REPEATSCOUT TO FIND REPETITIVE ELEMENTS###
	###RUNNING REPEATMASKER TO MASK THE GENOME###
Traceback (most recent call last):
  File "/opt/LoReAn/code/lorean.py", line 552, in <module>
    main()
  File "/opt/LoReAn/code/lorean.py", line 144, in main
    masked_ref = mseq.repeatsfind(ref_link, wd_split, args.repeat_lenght, threads_use, args.verbose)
  File "/opt/LoReAn/code/manipulateSeq.py", line 252, in repeatsfind
    gff = [y for x in os.walk(working_dir) for y in glob(os.path.join(x[0], name_gff))][0]
IndexError: list index out of range

content in split folder

ls LoReAn_20181224/run/split/
BJCHM5.nuclear.fasta.freq           RM_144740.MonDec241413472018  tmp4_49nifo  tmphshwq69c  tmptx_v8m9j
BJCHM5.nuclear.fasta.repeats.fasta  tmp05pzu06l                   tmph0t2lha5  tmpk5dl_073

Thanks
WANG

Hi Wang,
can you please try to run the image with the tag latest? additionally, which are you using Singularity or Docker?

Cheers
Luigi

Hi Luigi,
I'm using Docker. I'll try latest.
Thanks!

Happy New Year!

WANG

Hi Luigi,
Do you think it is necessary to fill gene id with 0, such as ID=gene000001 instead ID=gene1?
WANG

Why do you want to do something like this?

luigi

Hi Luigi,
For example, if you sort the gene name, could have the result like below:
gene1
gene10
gene2
...
gene9

But if you filled into the same length with 0, then it could be this:
gene01
gene02
...
gene10

That's why I have that thought.

WANG

you can use {id:0{width}d}.format(id=id,width=width) in python, do this. just an example, detail you should read python doc.

Thanks

Hi.
the Verticillium example (/opt/singularity/bin/singularity exec ../../lorean_noIPRS.sif lorean -t 7 --keep_tmp -a -f -d -sr readsChr.subset.fastq -lr longReadsChr8.fastq -pr subset.prot.fasta -sp JR2 -mg chr8.fasta), and in fact any command line that has the -mg option, exits with this error:

RUNNING IN STRAND MODE AND USING ADAPTER PROVIDED

LOREAN IS CHECKING THAT GMAP IS CORRECTLY BUILD

###GMAP MAPPING STARTED AT: 00:50:41 08-06 ###
###BUILD INDEX###
###MAP###

GMAP IS CORRECTLY BUILD

###RUNNNG REPEATSCOUT AND REPEATMASK TO MASK THE GENOME STARTED AT: 00:50:41 08-06 ###
###RUNNING REPEATSCOUT TO FIND REPETITIVE ELEMENTS###
###RUNNING REPEATMASKER TO MASK THE GENOME###
Traceback (most recent call last):
File "/opt/LoReAn/code/lorean.py", line 560, in
main()
File "/opt/LoReAn/code/lorean.py", line 149, in main
masked_ref = mseq.repeatsfind(ref_link, wd_split, args.repeat_lenght, threads_use, args.verbose)
File "/opt/LoReAn/code/manipulateSeq.py", line 258, in repeatsfind
gff = [y for x in os.walk(working_dir) for y in glob(os.path.join(x[0], name_gff))][0]
IndexError: list index out of range

@rchavez-langebio ,
I think that you did not follow very well the instruction. You need to download the two files and link the folders to the image as written in the instructions. Repeatmasker and Augustus need to be able to write to a file and it is not possible in the image.

please folow point 2 and 3 in the installation steps.

Cheers
Luigi