- DEEP10M and GloVe200: https://ann-benchmarks.com/index.html#datasets
- SIFT10M, GIST, Tiny5M and GloVe300: https://www.cse.cuhk.edu.hk/systems/hash/gqr/datasets.html
- DEEP100M: https://research.yandex.com/blog/benchmarks-for-billion-scale-similarity-search
- Prerequisite : openmp, cmake, boost
- Prepare and Compile:
- Go to the root directory of PEOs.
- Put the base set(.fvecs), query set(.fvecs) and groundtruth set(.ivecs) into XXX folder, where XXX is the name of dataset.
- Check the parameter setting in the script run_XXX.sh.
- Execute the following commands:
$ cd /path/to/project
$ mkdir -p build && cd build
$ cmake .. && make -j$ bash run_XXX.sh$ bash run_XXX.shThe parmeter M in HNSW is fixed to 32.