Issues
- 0
Use OpenGraph meta tags in indexing
#1941 opened by TorutheRedFox - 6
dpkg error : conffile name 'etc/opensearchserver' is not an absolute pathname
#1939 opened by judemont - 0
OpenSearchServer and OpenSearch difference
#1938 opened by diolegend - 1
Any Demo for testing?
#1937 opened by passionate2023 - 1
pdf viewer using deprecated class
#1925 opened by kcfez - 1
New Release?
#1909 opened by Dexus - 0
OSS v1.5.14 creates many temporary files
#1926 opened by zpaul91 - 2
CamelCase filter/tokenizer
#1878 opened by TudorCretu - 9
Horizontal scaling across multiple nodes
#1916 opened by lausycampari - 1
OSS web crawler pattern list needs to be split into multiple files instead of only one.
#1889 opened by ZeroCool940711 - 0
- 0
Minimise Sidebar Filters and sort order
#1914 opened by pbartett - 0
Problem with renderer and viewer for FTP files
#1913 opened by VincentDomo - 1
- 0
- 0
- 1
- 2
file:// URLs instead of SMB:// URLs ?
#1906 opened by jebofponderworthy - 0
Support SmartChineseAnalyzer?
#1907 opened by marklin0531 - 0
RESTful API access to Renderer engine?
#1905 opened by jebofponderworthy - 0
XML Parser hangs
#1904 opened by emmanuel-keller - 0
Patterns added using REST api are not being crawled. Patterns added manually however are crawled.
#1903 opened by JimHha - 0
Screenshot in results
#1902 opened by Marx1st - 0
Missing title for indexed PDFs
#1901 opened by Marx1st - 2
- 0
Add Cassandra connector
#1887 opened by emmanuel-keller - 0
New parser field in the HTMLParser providing the full htmlSource without XPATH exclusions
#1896 opened by emmanuel-keller - 1
- 0
Allow leading wildcard in Pattern query
#1900 opened by emmanuel-keller - 0
BigchainDB
#1899 opened - 0
content is extracted twice while using a regexp in the HTMLParser on HtmlSource field
#1897 opened by emmanuel-keller - 0
How to set up the scheduler for web crawling
#1894 opened by alistr2 - 0
Crawl Filter for a specific tld
#1893 opened by Zerokami - 0
Error while working on URL, The text would exceed the max allowed overall size
#1892 opened by emmanuel-keller - 1
- 0
URLs should be trimed to avoid leading spaces
#1890 opened by emmanuel-keller - 0
Query or Term Filter with wildcard on URL
#1888 opened by etiwari - 0
is it possible crawl JavaScript web pages?
#1884 opened by bolotkalil - 0
Internet Explorer links are incorrect in Renderer
#1886 opened by marke72 - 0
How to develop and register custom parser?
#1885 opened by HiranChaudhuri - 0
Building from source
#1883 opened by HiranChaudhuri - 0
Support of array in database crawler
#1881 opened by emmanuel-keller - 0
Error reading 'patternList' !
#1880 opened by seokickup - 0
Parsing of BLOB in the Database crawler
#1879 opened by emmanuel-keller - 0
Crawler keeps getting stuck in the current directory loop with certain FTP servers
#1876 opened by emmanuel-keller - 0
ClosedChannelException on Terms extraction
#1877 opened by emmanuel-keller - 0
Upgrade PDFBox to 2.0.x
#1875 opened by emmanuel-keller - 0
Wildcard Queries fail if some letters are capital
#1874 opened by lagerfeuer - 1
Error when Wildcard first Character
#1872 opened by lagerfeuer - 0
Support of MongoDb as crawl cache
#1873 opened by emmanuel-keller