This project aims to show basic concepts of information indexing and retrieval,document indexing and querying. Concepts like stemming, lemmantization, similarity and dissimilarity querying and latent semantic indexing, are applied on this project. A Jupyter Notebook and an Oracle Database are used to create a client-server architecture. The project was mounted with Oracle Cloud Free Tier, using an Oracle Autonomous Database and a AMD Compute VM.
victor-cali/DocumentBase
Educational Document Base prototype to perform queries based on similarity and dissimilarity measures of documents to which stemming, lemmantization and latent semantic indexing was applied.
Jupyter Notebook