/language-detector

Language detector implementation in Python

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

About

This is a basic language detector. It is comparing language profiles distance. The implementation is based on here.

Dataset

Datasets were taken from here.

Usage

from src.detection import get_closest_language as DetectLanguage

text_trk = "Bu Türkçe bir kelimedir"
print(DetectLanguage(text_trk))

text_eng = "This is an English word"
print(DetectLanguage(text_eng))

Language

The default languages are Turkish, English, Russian, German, French and Japanese If you want to analyse more language, you just need to add random article to dataset directory.