SlideShare a Scribd company logo
Classification and Clustering are the two types of learning methods which characterize objects
into groups by one or more features. These processes appear to be similar, but there are some
differences between them in context of data mining.
SI Classification Clustering
1.
Classification is the process of classifying the
data with the help of class labels.
Clustering is similar to classification but
there are no predefined class labels.
2.
In classification, we have a set of predefined
classes and want to know which class a new
object belongs to.
Clustering tries to group a set of objects and
find whether there is some relationship
between the objects.
3.
In the context of machine learning,
classification is Supervised learning.
In the context of machine learning,
clustering is Unsupervised learning
4.
Training sample is provided in classification
method
Training sample is not provided in
classification method
5.
In classification, prior knowledge of classes is
already known.
In clustering, prior knowledge of classes is
not known.
6.
The tools used in classification analysis are
Decision Trees, Bayesian Classifiers,etc.
The tools mainly used in cluster analysis are
K-mean, Expectation Maximization, etc.
Example of Classification: In a banking application, the customer who applies for a loan may be
classified as a safeand risky according to his/her age and salary. This type of activity is also
called supervised learning. The constructed model can be used to classify new data. The
learning step can be accomplished by using already defined training set of data. Each record in
the training data is associated with an attribute referred to as a class label that signifies which
class the record belongs to. The produced model could be in the form of a decision tree or in a
set of rules.
Example of clustering: There are two clusters named as mammal and reptile. A mammal cluster
includes human, leopards, elephant, etc. On the other hand, reptile cluster includes snakes,
lizard, komodo dragon etc. The tools mainly used in cluster analysis are k-mean, k-medoids,
density based, hierarchical and several other methods.

More Related Content

PPT
Object Oriented Database Management System
PPT
Association rule mining
PPTX
Architecture of data mining system
PDF
Lecture13 - Association Rules
PPTX
Dbscan algorithom
PPTX
Clustering in data Mining (Data Mining)
PPT
2.5 backpropagation
PPT
Support Vector Machines
Object Oriented Database Management System
Association rule mining
Architecture of data mining system
Lecture13 - Association Rules
Dbscan algorithom
Clustering in data Mining (Data Mining)
2.5 backpropagation
Support Vector Machines

What's hot (20)

PPT
3 Tier Architecture
PPTX
Ensemble Method (Bagging Boosting)
PPTX
weak slot and filler structure
PPTX
Clustering in Data Mining
PPTX
Association rule mining.pptx
PPT
PPTX
Symmetric and asymmetric key
DOCX
Network hardware
PPTX
Data Integration and Transformation in Data mining
PPT
Bayseian decision theory
PPTX
Clusters techniques
PDF
Decision trees in Machine Learning
PPT
OLAP
PPTX
Gradient Boosted trees
PPTX
Data preprocessing
PPT
Chap8 basic cluster_analysis
PPTX
Classification in data mining
PPTX
multi dimensional data model
PDF
Decision tree lecture 3
PPTX
Data Mining: Application and trends in data mining
3 Tier Architecture
Ensemble Method (Bagging Boosting)
weak slot and filler structure
Clustering in Data Mining
Association rule mining.pptx
Symmetric and asymmetric key
Network hardware
Data Integration and Transformation in Data mining
Bayseian decision theory
Clusters techniques
Decision trees in Machine Learning
OLAP
Gradient Boosted trees
Data preprocessing
Chap8 basic cluster_analysis
Classification in data mining
multi dimensional data model
Decision tree lecture 3
Data Mining: Application and trends in data mining
Ad

Similar to Classification vs clustering (20)

PPTX
Machine learning algorithms for data mining
PDF
Applications Of Clustering Techniques In Data Mining A Comparative Study
PPTX
CLUSTERING, differnt type of clusterings
PPTX
cluster.pptx
PDF
Data mining
PDF
Data mining chapter04and5-best
PPTX
Unit 2 unsupervised learning.pptx
PPTX
Data Mining - The Big Picture!
PDF
clustering-151017180103-lva1-app6892 (1).pdf
PPT
Clustering
PDF
It is a presentation on machine learning
DOCX
Concept of Classification in Data Mining.docx
PPT
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
PPTX
computational statistics machine learning unit 5.pptx
PPTX
CLUSTER ANALYSIS.pptx
PDF
Unsupervised Machine Learning PPT Adi.pdf
PPTX
For iiii year students of cse ML-UNIT-V.pptx
PPT
Data mining
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
Machine learning algorithms for data mining
Applications Of Clustering Techniques In Data Mining A Comparative Study
CLUSTERING, differnt type of clusterings
cluster.pptx
Data mining
Data mining chapter04and5-best
Unit 2 unsupervised learning.pptx
Data Mining - The Big Picture!
clustering-151017180103-lva1-app6892 (1).pdf
Clustering
It is a presentation on machine learning
Concept of Classification in Data Mining.docx
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
computational statistics machine learning unit 5.pptx
CLUSTER ANALYSIS.pptx
Unsupervised Machine Learning PPT Adi.pdf
For iiii year students of cse ML-UNIT-V.pptx
Data mining
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
Ad

Recently uploaded (20)

PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
Pharma ospi slides which help in ospi learning
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Lesson notes of climatology university.
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Classroom Observation Tools for Teachers
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
Cell Types and Its function , kingdom of life
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
O5-L3 Freight Transport Ops (International) V1.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Orientation - ARALprogram of Deped to the Parents.pptx
Pharma ospi slides which help in ospi learning
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Lesson notes of climatology university.
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Microbial disease of the cardiovascular and lymphatic systems
Classroom Observation Tools for Teachers
Pharmacology of Heart Failure /Pharmacotherapy of CHF
STATICS OF THE RIGID BODIES Hibbelers.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
RMMM.pdf make it easy to upload and study
Cell Types and Its function , kingdom of life
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Weekly quiz Compilation Jan -July 25.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Supply Chain Operations Speaking Notes -ICLT Program

Classification vs clustering

  • 1. Classification and Clustering are the two types of learning methods which characterize objects into groups by one or more features. These processes appear to be similar, but there are some differences between them in context of data mining. SI Classification Clustering 1. Classification is the process of classifying the data with the help of class labels. Clustering is similar to classification but there are no predefined class labels. 2. In classification, we have a set of predefined classes and want to know which class a new object belongs to. Clustering tries to group a set of objects and find whether there is some relationship between the objects. 3. In the context of machine learning, classification is Supervised learning. In the context of machine learning, clustering is Unsupervised learning 4. Training sample is provided in classification method Training sample is not provided in classification method 5. In classification, prior knowledge of classes is already known. In clustering, prior knowledge of classes is not known. 6. The tools used in classification analysis are Decision Trees, Bayesian Classifiers,etc. The tools mainly used in cluster analysis are K-mean, Expectation Maximization, etc. Example of Classification: In a banking application, the customer who applies for a loan may be classified as a safeand risky according to his/her age and salary. This type of activity is also called supervised learning. The constructed model can be used to classify new data. The learning step can be accomplished by using already defined training set of data. Each record in the training data is associated with an attribute referred to as a class label that signifies which class the record belongs to. The produced model could be in the form of a decision tree or in a set of rules. Example of clustering: There are two clusters named as mammal and reptile. A mammal cluster includes human, leopards, elephant, etc. On the other hand, reptile cluster includes snakes, lizard, komodo dragon etc. The tools mainly used in cluster analysis are k-mean, k-medoids, density based, hierarchical and several other methods.