/SearchEngine_WikiArticles

Java based search engine to search wiki articles. shows the top articles using cosine similarity

Primary LanguageJava

SearchEngine_WikiArticles

Java based search engine to search wiki articles. It shows the top articles using cosine similarity between the documents and query. For a given query it outputs the top documents from the index. Includes a crawler and indexer to crawl and index the wikipedia xml dump file. The project was created using netbeans. Please import the project in netbeans to view code. Contains the Wiki xml dump file. The program uses Cosine Similarity to measure the relevance of documents to query and provide result.