Issues
- 0
Need help with extracting nested HTML content.
#45 opened by Amaresh - 0
- 0
how to split a list of strings separated by a specific character?? is there any thing like a split function or some work around for it?
#43 opened by m2ai - 1
Limit the extraction of outlinks
#42 opened by wiradikusuma - 12
Nutch 2.x support
#41 opened by tomchiverton - 3
Plugin doesn't work
#30 opened by AndraIonescu - 4
Concatenate 2 fields into one
#29 opened by AndraDenis - 1
Css :not pseudo-class doesn't work
#32 opened by AndraIonescu - 1
Ignore load-external-dtd declaration in xml
#33 opened by nithingit - 1
Missing LICENSE
#37 opened by nicobrevin - 0
Not indexing data in solr
#36 opened by aakashkag - 0
Using fragment on xml documents
#35 opened by rohith004 - 0
Using fragment on xml documents
#34 opened by rohith004 - 0
Plugin doesn't work on Linux
#31 opened by rodrigomagnoss - 3
- 4
Conditional indexing or following
#12 opened by tahagh - 4
cannot testUrl
#26 opened by virivigio - 2
How do I get nutch to crawl outlinks only and not the urls for each fragment?
#27 opened by manjunathbharadwaj - 2
Unlike in all examples, I am asked to explicitly declare a "url" field in my extractors.xml where I want to use fragments
#25 opened by manjunathbharadwaj - 6
error when using fragment option
#11 opened by moees - 1
How to use this
#8 opened by jayasreemca - 0
- 8
extraction with xpath engin
#9 opened by moees - 5
What subset of Jsoup css selector is supported?
#22 opened by ChanderG - 3
Parsing Javascript script tags
#19 opened by raisindetre - 3
Bug in document inheritance?
#18 opened by raisindetre - 8
Return img alt and text
#14 opened by phranq - 3
Using for-each in an HTML Page.
#17 opened by jaychakra - 1
Extract from PDF
#16 opened by rugbymauri - 3
corrupt distribution zip
#13 opened by rugbymauri - 2
NPE trying to index
#15 opened by dmnt3rr0r - 3
Unable to index documents with Tika
#1 opened by wwhurley - 1
sitemap protocol
#10 opened by paulescom - 1
Compile plugin as .job file
#7 opened by arkka - 0
Implement conditional resource matching
#5 opened by tahagh - 5
Error: Unsupported major.minor version 51.0
#3 opened by pepeabel - 3