lsh-algorithm
There are 56 repositories under lsh-algorithm topic.
Infini-AI-Lab/MagicPIG
[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation
ritchie46/lsh-rs
Locality Sensitive Hashing in Rust with Python bindings
guofei9987/pyLSHash
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度
xiaogp/recsys_faiss
一个基于 fasttext + faiss 的商品内容相关推荐实现,nginx+uwsgi+flask / gunicorn+uvicorn+fastapi 提供api查询接口,增加Spark实现 Ansj+Word2vec+LSH+Phoenix
oertl/probminhash
ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity
RishabhMaheshwary/query-attack
A Query Efficient Natural Language Attack in a Black Box Setting
oertl/treeminhash
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
munnafaisal/Deep-Object-Search-With-Hash
Search your object with hash
ZiadSheriif/IntelliQuery
A semantic search indexing system designed to efficiently retrieve top matching results from a database of 20 million documents. Given the embedding of a search query, it quickly identifies and returns the most relevant documents
muyuuuu/high-performance-LSH
使用线程池的高并发 LSH 算法, C++ 实现
shaltielshmid/MinHashSharp
A Robust Library in C# for Similarity Estimation
justinfargnoli/lshforest
An implementation of LSH Forrest based off of the following paper (http://infolab.stanford.edu/~bawa/Pub/similarity.pdf).
lehuutrung1412/ImageRetrieval
Build content-based image retrieval system using deep learning, applied some large scale similarity search technicals like Kdtree, LSH, Faiss.
theatina/CryptoRecommendation
Recommendation System on cryptocurrency, using data collected from users' tweets + 10-Fold Cross Validation ( Based on the cryptocoins from each user's tweets, the program runs algorithms on the data, resulting in the recommendation of other cryptocoins for each user) ( readme in greek but soon to be translated in English )
Alexdruso/ID2222-Data-Mining-Sanvito-Stuart
Lab assignments for the course ID2222-Data Mining at KTH
jianshu93/DartUniFrac
Approximate UniFrac via Weighted MinHash 🦀
kochlisGit/Big-Data-Algorithms
Implementation of algorithms for big data using python, numpy, pandas.
Vedant2311/Data-Mining-Algorithms
Repository for all assignments of the course COL761: Data Mining (Fall 2020), taught at IIT Delhi
aidaLabDEI/MOMENTI-motifs
Scalable mining of multidimensional time series motifs.
ashkanans/text-similarity-and-clustering
A Python project implementing shingling, minwise hashing, and locality-sensitive hashing (LSH) for text similarity detection, along with feature engineering and clustering analysis on real-world datasets. Includes code, visualizations, and key insights for efficient data processing and analysis.
eduardosantoshf/most-frequent-itemsets
MDLE First Assignment - The objective of this project was to implement the A-Priori algorithm to obtain the most frequent itemsets for a list of conditions for a large set of patients, obtaining then associations between conditions by extracting some rules, and also to implement and apply LSH to identify similar news articles from a dataset.
ludwigfriborg/SwiftNilsimsa
Nilsimsa implementation as a swift package
MenesesGHZ/locality-sensitive-hashing
LSH algorithm made with C++
Mrugank97/KNNavigate
Scaling Up Nearest Neighbor Search : How Dataset Size and Dimensionality Affect KNN Variants
Muvels/LSHEngine
This repo aims to implement an modular engine for Locality-Sensitive Hashing (LSH).
Sitaras/Software-Development-for-Algorithmic-Problems_Project-1
Vectors - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with L2 metric.
SwamiKannan/Natural-Language-Processing-Specialization
Coursera's Natural Language Processing specialization
jianshu93/dartminhash-rs
Fast Sketching for Weighted Sets
MajaJuri/Analiza-velikih-skupova-podataka
Implementacija algoritama predstavljenih na predmetu Analiza velikih skupova podataka (AVSP)
Meet-Modi/minibaseDB-vector-LSH
A 100D vector database using minibase storage engine
SofiaKyriakiMachaira/SoftwareDevforAlgorithmicProblems
Software Development for Algorithmic Problems (UoA) Assignments
alexrr04/document-similarity-detection-with-LSH
This project implements Locality-Sensitive Hashing (LSH) for efficient document similarity detection. Instead of performing exhaustive pairwise comparisons between documents, LSH uses probabilistic techniques to quickly identify similar document pairs, making it particularly effective for large document collections.
FrancescoMonaco/span
Euclidean Minimum Spanning Tree approximation with a parameterless LSH index
Lefteris-Souflas/Movie-Rating-User-Similarity
Explored Jaccard distance, Min-Hashing, and LSH for user similarity in a movie rating dataset. Tasks involve dataset preprocessing, exact Jaccard Similarity computation, Min-Hash signatures, and LSH implementation. Results and observations are documented in code, output files, and a report