Third programming assignment for the WIER (IEPS) faculty course. The main goal was to build a simple index and implement querying against it.
pip install -r requirements.txt
python -m nltk.downloader punkt, stopwords
- Navigate to the
implementation-indexing
directory - Create an inverse index with
python create-index.py
- Run the algorithms:
- run basic search with
python run-basic-search.py
or - run SQLite search with
python run-sqlite-search.py
If you get an SSL exception when downloading NLTK data, you can try to download the data manually from by running the following in the command line. A window will pop up, where you can select the data you want to download, in this case punkt
and stopwords
.
The solution is from gunthercox/ChatterBot#930 (comment).
import nltk
import ssl
try:
_create_unverified_https_context = ssl._create_unverified_context
except AttributeError:
pass
else:
ssl._create_default_https_context = _create_unverified_https_context
nltk.download()