/gibberlite

r-package to detect gibber in text data

Primary LanguageROtherNOASSERTION

gibberlite

R build status

⏬ Installation

You can install the development version of gibberlite from GitHub with:

# install.packages("devtools")
devtools::install_github("Glender/gibberlite")

Overview

With the gibberlite package you can sssess whether a sentence contains gibberish words. A Markov chain inspects the sequence of vowels and consonents to estimate whether a sentence consists of natural words. Therefore, words like ‘asdfg’ and ‘dfrgfh’ are considered unnatural and are classified as gibberish. The accuracy of the classification improves when a sentence contains more characters.

📖 Usage

library(gibberlite)
# create vector with character data
text <- c(
 "Personally I'm always ready to learn, although I do not always like being taught.",
 "asdfg",
 "Computer",
 "dfhdfghd",
 "I love to walk."
)

# assess if text is gibber
is_gibberish(text)
#> [1] FALSE  TRUE FALSE  TRUE FALSE

💬 Looking for something more fancy?

The gibberlite packages implements a very simple model. If you want a state-of-the-art solution, see the gibber package: https://github.com/Glender/gibber