Natural-Language-Processing-Prediction

Coursera Data Science Project

Johns Hopkins University

Description

Often when entering text into a mobile telephone or other handheld device, your are prompted with a likely next word. This is the jist of this application. The user will supply a partial phrase or sentence and the application will provide its best guess for the next word.

Data Set

The SwiftKey data set used to make this application consists of samples taken from three sources, the news, blogs and twitter messages. The data set is huge and must be rendered into a usable form that is both timely and compact.

Application

The application uses this data to predict the next word in a phrase. The method used for this prediction relies on a history of previous phrase fragments, known as Ngrams, to accomplish this. The application searches through the data and determines the next word based on statistical likelihood.

http://www.rpubs.com/tbalgrover/236683