HebrewTools/ParseTrainer

Load database with ETCBC data

Closed this issue · 2 comments

The ETCBC database contains all verb forms in the Hebrew bible with parsing information. It should be relatively straightforward to extract this and put it in our database; provided of course the copyright allows this.

Still needs to be done:

  • Checking the kind of weak root
  • What if there are duplicate forms that are not attested? Perhaps better to only include frequent verbs or to use rules (like the common gender on 1s forms; 2fp=3fp in ipf; etc.)

Not worth the effort due to the second issue listed in the previous comment.