Inspired by and based on: Let's build a Full-Text Search engine
Loads documents and processes them to create a reverse index.
Todos:
- Take raw text
- Tokenize
- Normalize & filter
- Search in tokens
- Make the input JSON instead of XML
- Return the documents that match instead of their indexes
- Accept the file path as a command line argument
- Change the file path to be a flag
- Support wildcards
- Extend boolean queries to support
OR
andNOT
- Sort results by relevance