Using language modelling with word level recognition
dutta1kartik3 opened this issue · 1 comments
Hi @jpuigcerver,
Could you please point me to how I could modify the existing Laia framework to create a character-level language model which I could use for decoding the output of a word-level recognition system based on IAM? (similarly to how currently a word-level language model is used with line-level recognizer, say for IAM)
Thanks
Hi,
In https://github.com/jpuigcerver/Laia/tree/master/egs/iam/utils you have scripts to create a character-level language model, and decode images using it.
In particular:
https://github.com/jpuigcerver/Laia/blob/master/egs/iam/utils/build_char_lm.sh
https://github.com/jpuigcerver/Laia/blob/master/egs/iam/utils/build_char_fsts.sh
However, these scripts are not very well tested, so I cannot guarantee that they are bug-free and work well.
If you have any more concrete question, please re-open the issue and I'll try to address it.