A web-based distributed multi-task annotation framework following sequential annotation method.
- Clone repository
pip install -r requirements.txt
- Copy
settings.sample.py
tosettings.py
and make appropriate changes. - Run application server
python server.py
Note: Sample corpus and table data is included in data/corpus/sample
and data/tables/sample
respectively.
- Task 1: Sentence Boundary
- Task 2: Canonical Word Order (a.k.a. Anvaya)
- Task 3: Token Classification (e.g. Named Entity Recognition)
- Task 4: Token Graph (e.g. Dependency Graph, Action Graph)
- Task 5: Token Connection (e.g. Co-reference Resolution)
- Task 6: Sentence Classification
- Task 7: Sentence Graph (e.g. Discourse Graph)
- Task 8: Token Text Annotation (e.g. Lemmatization)