/code

Multi-task NLP Annotation Framework

Primary LanguageJavaScript

Antarlekhaka

A web-based distributed multi-task annotation framework following sequential annotation method.

Installation Instructions

  • Clone repository
  • pip install -r requirements.txt
  • Copy settings.sample.py to settings.py and make appropriate changes.
  • Run application server python server.py

Note: Sample corpus and table data is included in data/corpus/sample and data/tables/sample respectively.

Supported Tasks

  • Task 1: Sentence Boundary
  • Task 2: Canonical Word Order (a.k.a. Anvaya)
  • Task 3: Token Classification (e.g. Named Entity Recognition)
  • Task 4: Token Graph (e.g. Dependency Graph, Action Graph)
  • Task 5: Token Connection (e.g. Co-reference Resolution)
  • Task 6: Sentence Classification
  • Task 7: Sentence Graph (e.g. Discourse Graph)
  • Task 8: Token Text Annotation (e.g. Lemmatization)