Crawling capabilities for Apache Tika. Crawl content out of e.g. file systems, http(s) sources (webcrawling) imap(s) servers or your own arbitrary data sources. LeechCrawler offers additional Tika parsers providing these crawling capabilities.
Primary LanguageJavaBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause
No one’s star this repository yet.