ilo-token/ilo-token.github.io

Scrape wiktionary

Closed this issue · 1 comments

Currently, we manually encode conjugations and other properties in the dictionary. Perhaps we could instead make this semi automatic by scraping wiktionary to fetch conjugations and other properties.

This solution is becoming attractive because of complicated conjugations the verbs has (#39).

We still need to encode other properties such as what kind of adjective it is for adjective ordering (#17).

Potential additional use: scraping pronunciation to find out whether it starts with vowel or consonant sound to find out whether to use article "a" or "an".

thanks to jan Kita, we don't need to scrape wiktionary. instead we'll simply use https://github.com/spencermountain/compromise