/ngrams

Simple abstraction of n-grams in Scala

Primary LanguageScala

ngrams

Simple abstraction of n-grams in Scala

GDELT -- the global database of events, language, and tone -- is a public data set. See Leetaru, Kalev and Schrodt, Philip. (2013). GDELT: Global Data on Events, Language, and Tone, 1979-2012. International Studies Association Annual Conference, April 2013. San Diego, CA. See more at: http://gdelt.utdallas.edu/about.html#howtocite GDELT is used in this project simply for testing; no endorsement of any kind is implied. Only the first few thousand records from GDELT are used, concatenated together into a single test file.