/ULMFiT-Filipino-Tagalog

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Language Model for Filipino (Tagalog) Language

This project contains relevant files for creating a language model for the Filipino language from the Filipino (Tagalog) Wikipedia corpus, Wiktionary, and Wikibooks.

Language model weights and the itos (index -> string) mapping pickle file is available for download here

The RNN was trained in 4 iterations with the following learning rates:

  1. 8
  2. 4
  3. 1.5
  4. 0.2

The learning rates were determined through the plot_lr() methods of fastai.

Performance:

Perplexity: 26.1997

Accuracy: 0.4403

TODO: