On Github kawenks / devdatprod
for the Johns Hopkins Bloomberg School of Public Health
Data Science Specialization -- Developing Data Products
via Coursera.org
May 2015
This Shiny App uses the Auto dataset in the ISLR package. It sets up a Random Forest prediction model to determine a car's acceleration based on several attributes.
Feature Selection and Cross Validation Fine-Tuning
Random Forest has lower RMSE and explains more of the data variability.
Model RMSE R^2 RMSE sd R^2 sd Random Forest 1.4171 0.7508 0.2087 0.0730 Bayesian Generalized Linear Model 1.5208 0.7210 0.2411 0.0970 Generalized Additive Model 1.4838 0.7121 0.1437 0.0850