/shrubbery

Some utilities to find things out in an Afrikaans text corpus.

Primary LanguagePythonMIT LicenseMIT

shrubbery

Some utilities to find things out in an Afrikaans text corpus.

Installation

Get the code

Download the latest version from github

OR

clone using git:

git clone https://github.com/avisagie/shrubbery.git

Get the punkt model

Run nltk_download.py, a window will popup that lets you choose to download various things, models among them.

Download the punkt model.

Environment

You need python. On Linux it's easy. On Windows, install the Anaconda python distribution. Then it's easy.

Using

  1. Copy .txt files into the data directory
  2. Double-click run.py
  3. Reports appear