My fork of the original GENIA Tagger
The main purpose of this repository is to serve as a reference for my Java port. The original source code (v 3.0.1) is found with the tag original.
For info about the program, see the original README.
- The original generated binary
geniatagger
can only be called from its same path as it makes static local references to some files. You can use the the simple bash scriptapply-geniatagger
to overcome this.
- The program has to load with every run some big dictionary files which can take quite a lot of time (~15s on a modern machine). The program should be rewritten to be more library-oriented and be able to keep the loaded dictionaries in memory. If you run on the JVM, you can use my Java port.