/book-vocabulary-analyser

book vocabulary ordinarity analyser

Primary LanguageJupyter NotebookMIT LicenseMIT

Book vocabulary analyser

This notebook analysing pdf or epub book vocabulary in purposes of evaluating ordinarity of book's most frequent words.

Learning vocabulary by reading books, especially fiction, you might have noticed that from time to time reading procedure becomes more easier.

This means that you have to be more picky in searching something for reading. The most popular text apt to has ordinary vocabulary.

This survey aimed to solve this problem, at least on stage of python prototype.

The most frequent words from book has detected and then highlighted on english corpuses distribution to compare. To compare I have used some english word frequencies dataset of 300000 words.

Well, unfortunately it's not that clear as far as I imagined it(