clustering on document text using transformer embeddings
Clone the repository:
git clone https://github.com/prolego-team/doc-text-clustering.git
Use pyenv to install python v. 3.9.2:
pyenv install 3.9.2
Use poetry to create the environment and install dependencies. Note: If you would like the virtual environment to be created inside the project's root directory, first configure poetry settings (poetry config virtualenvs.in-project true
)
cd doc-text-clustering
poetry install