The Iris Dataset – Developping Data Products Course Project



The Iris Dataset – Developping Data Products Course Project

0 0


developping-data-products-slidify


On Github jlanga-coursera / developping-data-products-slidify

The Iris Dataset

Developping Data Products Course Project

Created by Jorge Langa

The Iris dataset

  • Introduced by statistician Ronald Fisher

  • 3 species of Iris:

    • I. setosa
    • I. virginica
    • I. versicolor
  • 4 measurements:

    • Sepal: lenght and width (in cm.)
    • Petal: length and width (in cm.)

Working in R

data(iris) # Load the dataset in case it's not
str(iris)
## 'data.frame':    150 obs. of  5 variables:
##  $ Sepal.Length: num  5.1 4.9 4.7 4.6 5 5.4 4.6 5 4.4 4.9 ...
##  $ Sepal.Width : num  3.5 3 3.2 3.1 3.6 3.9 3.4 3.4 2.9 3.1 ...
##  $ Petal.Length: num  1.4 1.4 1.3 1.5 1.4 1.7 1.4 1.5 1.4 1.5 ...
##  $ Petal.Width : num  0.2 0.2 0.2 0.2 0.2 0.4 0.3 0.2 0.2 0.1 ...
##  $ Species     : Factor w/ 3 levels "setosa","versicolor",..: 1 1 1 1 1 1 1 1 1 1 ...

Taking a look into the dataset

pairs(iris[,1:4],col=iris$Species)

Developping a Shiny app

  • To do data exploration without knowing R programming!

  • Here is the app!

  • Choose a species

  • Choose what to visualize (petal or sepal)

  • Et violà!