document-similarity
There are 76 repositories under document-similarity topic.
piskvorky/gensim
Topic Modelling for Humans
oborchers/Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast!
abhilampard/Simple-Plagiarism-Checker
Web Application for checking the similarity between query and document using the concept of Cosine Similarity.
IlyaGusev/tgcontest
Telegram Data Clustering contest solution by Mindful Squirrel
massanishi/document_similarity_algorithms_experiments
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
Sarthakjain1206/Intelligent_Document_Finder
Document Search Engine Tool
andrewmcloud/consimilo
A Clojure library for querying large data-sets on similarity
allenai/aspire
Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.
zayedrais/DocumentSearchEngine
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
meenavyas/Misc
Contains interesting projects like Cat face detection, cat face recognition, code generation, Building chatbot, finding similar documents, image segmentation, UCI credit card, anomaly detection, MNIST etc.
parvez86/Smart-Recruitment-System
A simple Django-based resume ranker website where recruiters post their jobs and candidates applies for their desired vacancies. The system gets the document similarity between the job description and the candidate resumes, generates similarity scores using the KNN model, and rank or shortlist the candidate resumes.
shrebox/Natural-Language-Processing
Compilation of Natural Language Processing (NLP) codes. BONUS: Link to Information Retrieval (IR) codes compilation. (checkout the readme)
Sarthakjain1206/Intelligent-Document-Finder
A tool which can find your any document using semantic search
nicoDs96/Document-Similarity-using-Python-and-PySpark
Document Similarity with Apache Spark using Locality Sesitive Hashing and Python
TSunny007/Document-Similarity
Using Jaccard-Similarity and Minhashing to determine similarity between two text documents
malteos/wikipedia-article-recommendations
Survey data and Python code for the ICADL 2021 paper "A Qualitative Evaluation of User Preference for Link-based vs. Text-based Recommendations of Wikipedia Articles"
maxoodf/tgnews
Telegram Data Clustering Contest (Bossy Gnu's submission )
mdietrichstein/ir-search-engine-rust
Rust-based text search engine from scratch supporting multiple document similarity metrics (TF-IDF, BM25, BM25VA)
574567254/Job_Searching_Analysis
Aims to provide job searching strategy for new graduates who are interested in data-related positions.
izikeros/sentence-plagiarism
Compare sentences from input document with all sentences from reference documents - find very similar ones.
mohammaduzair9/Document-Searching
Document searching from queries using Inverted index
tejaspradhan/AI-based-Hiring-Platform
A Two-ended Hiring web application built using flask. The application uses document similarity techniques for recommendation.
Bit-Nation/notary
The Bitnation Jurisdiction Public Notary DApp
MSVCode/doc-similarity
Simple document similarity module implemented in NodeJS
nunososorio/docxmatch
DocxMatch is a Streamlit app that analyzes the similarity between Word files.
shreyansh26/MinHash-Implemenation
A simple MinHash implementation based on the explanation in the Mining of Massive Datasets course by Stanford
topcat/pubmed-docsim
Code to train a LSI model using Pubmed OA medical documents and to use pre-trained Pubmed models on your own corpus for document similarity.
DrKenReid/Generalized-Analysis-of-Text-Data
A comprehensive toolkit for analyzing text data using various AI and NLP techniques, including topic modeling, sentiment analysis, and text classification, demonstrated on the 20 Newsgroups dataset.
Forthoney/doc_sim
Approximate document similarity with Minhash + Locality Sensitive Hashing
johnifegwu/natural-language-processing
Natural language processing examples and automations
JPWJPW25/7-789-1.00_Advanced-Programming_Python
Individual group project in Python
Siddhantmest/Categorizing-amazon-products
Classifying products into categories using NLP techniques
zbmed-semtec/doc2vec-doc-relevance
An approach exploring and assessing literature-based doc-2-doc recommendations using a doc2vec and applying to the RELISH dataset.
zbmed-semtec/hybrid-pre-doc2vec-doc-relevance
Hybrid approach combining dictionary-based NER and doc2vec
zbmed-semtec/word2doc2vec-doc-relevance
An approach exploring and assessing literature-based doc-2-doc recommendations using word2vec combined with doc2vec, and applying it to TREC and RELISH datasets