Search system for the Reuters 21578 Corpus
It uses an inverted index to store the documents and each document represented as a bag of words model where each word is reduced to its stem before inferring the probability distribution.
The relevance score of a document is computed by summing over the query terms' tf-idf weights.
git clone --recursive https://github.com/deniskyashif/ranked-search.git
- .NET 8 LTS