ybv/WikiSearch
This project aims to build a Wikipedia indexer with the following goals: • Parse fairly involved Wikipedia markup. • Index a decent sized subset of the Wikipedia corpus. • Create multiple indexes on the page data as well as metadata. • Provide an index introspection mechanism that can later be built upon to support queries.
Java