shilad/wikibrain

IndexOutOfBoundsException when parsing Simple English

Closed this issue · 1 comments

I cloned and run ./scripts/runpipeline.sh all -l simple but I got warnings while doing it

WARNING: exception while parsing unknown
java.lang.IndexOutOfBoundsException: Index: 1879, Size: 1879
    at java.util.ArrayList.rangeCheck(ArrayList.java:571)
    at java.util.ArrayList.get(ArrayList.java:349)
    at de.tudarmstadt.ukp.wikipedia.parser.mediawiki.SpanManager.getSrcPos(SpanManager.java:63)
    at de.tudarmstadt.ukp.wikipedia.parser.mediawiki.ModularParser.parseTemplates(ModularParser.java:985)
    at de.tudarmstadt.ukp.wikipedia.parser.mediawiki.ModularParser.parse(ModularParser.java:372)
    at org.wikapidia.parser.wiki.WikiTextParser.parse(WikiTextParser.java:62)
    at org.wikapidia.parser.wiki.WikiTextDumpParser$ParserProcedure.call(WikiTextDumpParser.java:95)
    at org.wikapidia.parser.wiki.WikiTextDumpParser$ParserProcedure.call(WikiTextDumpParser.java:74)
    at org.wikapidia.utils.ParallelForEach$4.run(ParallelForEach.java:166)
    at org.wikapidia.utils.ParallelForEach$BoundedExecutor$1.run(ParallelForEach.java:245)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
    at java.lang.Thread.run(Thread.java:679)

There was one change in the configuration (reference.conf) where I use 20 threads instead of all available.

Thanks

I think those are just relatively harmless parse warnings.

Sent from my phone. Please excuse abbreviations and fat fingers.

On Nov 9, 2013, at 2:55 PM, Thanapon Noraset notifications@github.com
wrote:

I cloned and run ./scripts/runpipeline.sh all -l simple but I got warnings
while doing it

WARNING: exception while parsing unknown
java.lang.IndexOutOfBoundsException: Index: 1879, Size: 1879
at java.util.ArrayList.rangeCheck(ArrayList.java:571)
at java.util.ArrayList.get(ArrayList.java:349)
at de.tudarmstadt.ukp.wikipedia.parser.mediawiki.SpanManager.getSrcPos(SpanManager.java:63)
at de.tudarmstadt.ukp.wikipedia.parser.mediawiki.ModularParser.parseTemplates(ModularParser.java:985)
at de.tudarmstadt.ukp.wikipedia.parser.mediawiki.ModularParser.parse(ModularParser.java:372)
at org.wikapidia.parser.wiki.WikiTextParser.parse(WikiTextParser.java:62)
at org.wikapidia.parser.wiki.WikiTextDumpParser$ParserProcedure.call(WikiTextDumpParser.java:95)
at org.wikapidia.parser.wiki.WikiTextDumpParser$ParserProcedure.call(WikiTextDumpParser.java:74)
at org.wikapidia.utils.ParallelForEach$4.run(ParallelForEach.java:166)
at org.wikapidia.utils.ParallelForEach$BoundedExecutor$1.run(ParallelForEach.java:245)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:679)

There was one change in the configuration (resource.conf) where I use 20
threads instead of all available.

Thanks


Reply to this email directly or view it on
GitHubhttps://github.com//issues/110
.