On Github billxiao0 / courseracapstone
Created by Bill Xiao
Final project of Data Science CapStone
Main task is to develop a data prediction model
Source data is from Cousera Dataset
The prediction model will predict the next word after user has input a few words
There are 8 tasks of the project:
Based on Natural Language Processing
All 3 sets of data were used to train the model:
N-Gram Back Off model was used for development
Data was cleaned and processed into data frames with corresponding sorted cumulative frequencies
Katz Back-off algorithm was used for the word prediction
Access the shiny app here
The Shiny App takes words input and predict the next word
Enter the words in the input box and press the "Predict" button
The Next Word will be shown in the output box
The Note will explain which N-gram algorithm was used to predict the next word
Thank you for taking time to evaluate my app and have a good day :)