The iris data set is most widely used beginner's dataset for machine learning purposes.
Data set Attribute:
- sepal length in cm
- sepal width in cm
- petal length in cm
- petal width in cm
- Species (class):
-- Iris Setosa
-- Iris Versicolour
-- Iris Virginica
The dataset contains 150 samples also having four features:length and width of sepals and petals and 50 samples of these three species.
The dataset is often used in data mining, classification and clustering examples and to test algorithms.
The dataset contain two clusters with complete sepration:1.Iris setosa
2.Iris virginica ,Iris versicolor
Machile leraning algorithm :
1.Classifiction -supervised learning
2.K means clustering -unsupervised learning
ca be used to predict the type of species based on other features.