Build search engine from scratch. This is the project at My University
- Crawl many page from wiki
- Split document into words
- Index words using Reverse Index (DBMS)
- Store Index file and document file how to fast retrieve. I store some documents in a large file, i know exactly what a document at a specific row. Why do i store some documents in a larger file? Because Operating read a whole sector