caiselvas/language-identification
An NLP project leveraging character trigrams and smoothing techniques (Lidstone, Linear Discounting, Absolute Discounting) for language identification. Trained on for Spanish, Italian, English, French, Dutch, and German, achieving 99.8932% accuracy. Includes datasets, model parameters, and comprehensive documentation.
Jupyter Notebook
Issues
- 1
Documentation and report
#3 opened by pauhidalgoo - 0
Validation to-do
#2 opened by pauhidalgoo - 2
Testing (and validation) time
#1 opened by pauhidalgoo