/biblio-duplicate-detection

This repo deals with the problem of detecting near-duplicates among Russian-language bibliographic references. For the purpose of obtaining references, an additional task is solved --- the allocation of bibliographic references from scientific documents. To build the base of unique links, a search engine indexing is implemented.

Primary LanguageJupyter Notebook

No issues in this repository yet.