LanguageMachines/frog

[question] Which datasets are used for training each specific module

olix20 opened this issue · 2 comments

Hi and thanks for the great work!

Can i check if there's a documentation on what datasets were used for training each specific module of Frog (eg. pos tagger/dependency parser)?

Specifically I'd like to know if Frog is suitable for spoken language and if CGN annotations were used for training.

Assuming this is not relevant anymore. (sorry for ignoring it)

Let's at least point to the documentation for completion's sake, this section should answer some of those questions: https://frognlp.readthedocs.io/en/latest/advanced.html