Issue running figaro with 16S V4 data

Question

Issue running figaro with 16S V4 data

Opened this issue 4 years ago · 5 comments

Hi,

I am running Figaro via docker. The runs start ok but keep finishing with an error message:

Traceback (most recent call last):
File "/opt/figaro/figaro/figaro.py", line 218, in
main()
File "/opt/figaro/figaro/figaro.py", line 210, in main
resultTable, forwardCurve, reverseCurve = trimParameterPrediction.performAnalysisLite(parameters.inputDirectory.value, parameters.minimumCombinedReadLength.value, subsample = parameters.subsample.value, percentile = parameters.percentile.value, forwardPrimerLength=parameters.forwardPrimerLength.value, reversePrimerLength=parameters.reversePrimerLength.value, namingStandardAlias=fileNamingStandard)
File "/opt/figaro/figaro/trimParameterPrediction.py", line 457, in performAnalysisLite
resultTable = runTrimParameterTestLite(forwardExpectedErrorMatrix, reverseExpectedErrorMatrix, trimPositions, minimumTrimmingPositions, forwardCurve, reverseCurve, forwardPrimerLength, reversePrimerLength)
File "/opt/figaro/figaro/trimParameterPrediction.py", line 347, in runTrimParameterTestLite
reverseExpectedErrors = reverseExpectedErrorMatrix[reverseTrimPosition - reverseMinimumTrimPosition]
IndexError: index 327 is out of bounds for axis 0 with size 27

Any idea what the problem could be. Could it be a memory issue?

Answer 1 · 2021-05-10T05:11:46.000Z

I would have see what your run parameters are as well as your reads. My first guess would be that you might have some very short reads in your sample. Can you check that all your reads are of uniform length in the fastq (especially the paired-end 2 reads)?

Answer 2 · 2021-05-11T20:56:03.000Z

Hi,

Run parameters:

docker container run --rm -e AMPLICONLENGTH=254 -e FORWARDPRIMERLENGTH=0 -e REVERSEPRIMERLENGTH=0 -e MINIMUMOVERLAP=20

FastQC gives no indication that there are very short reads, just stating that R1 and R2 read files contain reads of 301 bp

Answer 3 · 2021-05-13T04:17:58.000Z

Are the lengths of the reads longer than your intended amplicon? If so, check out #36

Answer 4 · 2021-05-13T21:55:48.000Z

If this is the case, would this program still be appropriate for using with 16S V4 paired reads? I am just wondering whether this would introduce biases into the trimming optimisation results

Answer 5 · 2021-05-14T06:31:22.000Z

Should still be OK. The program is just optimizing the trimming sites. The only issue I can foresee here is that you will have much less overlap in the middle, since the program will trim to a pre-determined overlap length (default of 20). I can’t imagine how this would cause any major bias unless there’s already something seriously wrong with the reads. From: pig-raffles ***@***.***> Sent: Thursday, May 13, 2021 2:56 PM To: Zymo-Research/figaro ***@***.***> Cc: Michael Weinstein ***@***.***>; Comment ***@***.***> Subject: Re: [Zymo-Research/figaro] Issue running figaro with 16S V4 data (#35) If this is the case, would this program still be appropriate for using with 16S V4 paired reads? I am just wondering whether this would introduce biases into the trimming optimisation results — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#35 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACEYNLOEYDUOBGFU3NKBIJ3TNRDHLANCNFSM44IVPUSQ> . <https://github.com/notifications/beacon/ACEYNLLIVCSI6UAFS3TGIQTTNRDHLA5CNFSM44IVPUS2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOGIPHSKI.gif>