/text_search

Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup

Primary LanguagePython

Installation

With pip

pip install fasttextsearch

For developers

pip install numpy

git clone https://github.com/danpovey/text_search
cd text_search

mkdir build
cd build
cmake ..
make -j
make test

# set PYTHONPATH so that you can use "import textsearch"

export PYTHONPATH=$PWD/../textsearch/python:$PWD/lib:$PYTHONPATH

Now you can use

python3 -c "import textsearch; print(textsearch.__file__)"

Caution: We did not use either python3 setup.py install or pip install. We only set the environment variable PYTHONPATH.