CaucherWang
Ph.D. candidate in Fudan University @DSM-fudan and Université Paris Cité, with interest in high-d vector database.
Fudan UniversityShanghai
CaucherWang's Stars
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
m-bain/webvid
Large-scale text-video dataset. 10 million captioned short videos.
pinecone-io/research-bigann-linscan
zilliztech/feder
Visualize hnsw, faiss and other anns index
Patrick-H-Chen/FINGER
vmware/splinterdb
High Performance Embedded Key-Value Store
georgia-tech-db/evadb
Database system for AI-powered apps
paperswithcode/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
stat-ml/GeoMLE
This repo contains code for GeoMLE intrinsic dimension estimation algorithm
DBAIWangGroup/nns_benchmark
Benchmark of Nearest Neighbor Search on High Dimensional Data
Lsyhprum/WEAVESS
A Comprehensive Survey and Experimental Comparison of Graph-based Approximate Nearest Neighbor Search
amzn/pecos
PECOS - Prediction for Enormous and Correlated Spaces
TheDatumOrg/VAQ
Fast Adaptive Similarity Search through Variance‑Aware Quantization
yahoo/lopq
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
dmllr/fast-lopq
Fast C++ implementation of https://github.com/yahoo/lopq: Locally Optimized Product Quantization (LOPQ) model and searcher for approximate nearest neighbor search of high dimensional data.
dblalock/bolt
10x faster matrix and vector operations
cmuparlay/pbbsbench
New version of pbbs benchmarks
csypeng/LAN
Learning-based Approximate k-NN Search in Graph Databases
csypeng/taumng_sigmod
csypeng/tauMG
Efficient Approximate Nearest Neighbor Search in Multi-dimensional Databases (SIGMOD 2023)
Jacyhust/LSH-APG
This is a source code for LSH-APG (PVLDB 2023)
RSIA-LIESMARS-WHU/LSHBOX
A c++ toolbox of locality-sensitive hashing (LSH), provides several popular LSH algorithms, also support python and matlab.
qtwang/SEAnet
KDD21 Deep Learning Embeddings for Data Series Similarity Search
vdaas/vald
Vald. A Highly Scalable Distributed Vector Search Engine
lyst/rpforest
It is a forest of random projection trees
cmuparlay/pbbsbench-vldb2024
Version of PBBS Benchmarks for VLDB 2024 Reviewers
harsha-simhadri/big-ann-benchmarks
Framework for evaluating ANNS algorithms on billion scale datasets.
gaoj0017/ADSampling
[SIGMOD 2023] High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations
DSM-fudan/Dumpy
Dumpy: A Compact and Adaptive Index for Large Data Series Collections (SIGMOD'23)