SlideShare a Scribd company logo
2
Most read
5
Most read
Types of clustering:
Clustering can be divided into different categories based on different criteria
• 1.Hard clustering: A given data point in n-dimensional space only belongs to one cluster. This is also known as exclusive
clustering. The K-Means clustering mechanism is an example of hard clustering.
• 2.Soft clustering: A given data point can belong to more than one cluster in soft clustering. This is also known as overlapping
clustering. The Fuzzy K-Means algorithm is a good example of soft clustering.
• 3.Hierarchial clustering: In hierarchical clustering, a hierarchy of clusters is built using the top-down (divisive) or bottom-up
(agglomerative) approach.
• 4. Flat clustering: Is a simple technique where no hierarchy is present.
• 5.Model-based clustering: In model-based clustering, data is modeled using a standard statistical model to work with different
distributions. The idea is to find a model that best fits the data.
Different clustering algorithms
• Fuzzy K-Means:
• The K-Means algorithm is for hard clustering. In hard clustering, one data point belongs only to one cluster. However,
there can be situations where one point belongs to more than one cluster. For example, a news article may belong to
both the Technology and Current Affairs categories. In that case, we need a soft clustering mechanism.
• The Fuzzy K-Means algorithm implements soft clustering. It generates overlapping clusters. Each point has a probability of
belonging to each cluster, based on the distance from each centroid.
• In this example, we apply the Fuzzy K-Means algorithm for dataset(22 80 ,25 75 ,28 85 ,55 150,50 145 ,53 153 ,38 115 )
The outcome of the example is given in the following figure. Note that the newly added data point (someone
who had medium weight and height) belongs to cluster 3 in 0.52 probability and to cluster 1 in 0.47
probability, whereas other data points (people who are either large or small) belongs to nearly 0.9 to a
particular cluster.
Streaming K-Means
• If the volume of data is too large to be stored in the main memory available, the K-Means algorithm is not suitable, as it's
batch processing mechanism iterates over all the data points. Also, the K-Means algorithm is sensitive to the noise and outliers
in data.
• Streaming K-Means algorithms has provided a solution for these problems by operating in two steps, as follows:
• The streaming step
• The ball K-Means step
• The idea is to read data points sequentially, storing very few data points in memory.
• Then, after the first step, a better representative set of weighted data points is produced for further processing.
• The final K number of clusters is produced in the ball K-Means step. During the second step, potential outliers are eliminated.
Spectral clustering
• The spectral clustering algorithm is helpful in hard, nonconvex clustering problems. It clusters points using
the eigenvectors of matrices derived from data.
Dirichlet clustering
• The Fuzzy K-Means and K-Means algorithms model clusters as spheres (circles in n-dimensional space.) K-Means assumes a
common fixed variance. Further, K-Means does not model the data point distribution.
• A normal data distribution should be there for the K-Means and Fuzzy K-Means algorithms to process effectively. If the data
distribution is different, for example, an asymmetrical normal distribution (different standard deviations), the K-Means
algorithm will not perform well and will not give good results.
• Dirichlet clustering can be applied to model different data distributions (data points that are not in normal distribution)
effectively. Dirichlet clustering fits a model over a dataset and tunes parameters to adjust the model's parameters to correctly
fit the data. This approach is suitable to address the hierarchical-clustering problem.
Ad

Recommended

05 Clustering in Data Mining
05 Clustering in Data Mining
Valerii Klymchuk
 
Unsupervised learning clustering
Unsupervised learning clustering
Arshad Farhad
 
Kmeans
Kmeans
Nikita Goyal
 
Multiclass classification of imbalanced data
Multiclass classification of imbalanced data
SaurabhWani6
 
Introduction to Clustering algorithm
Introduction to Clustering algorithm
hadifar
 
Cluster analysis
Cluster analysis
Kamalakshi Deshmukh-Samag
 
3.2 partitioning methods
3.2 partitioning methods
Krish_ver2
 
K means clustering
K means clustering
keshav goyal
 
3.3 hierarchical methods
3.3 hierarchical methods
Krish_ver2
 
Machine learning clustering
Machine learning clustering
CosmoAIMS Bassett
 
Clustering in Data Mining
Clustering in Data Mining
Archana Swaminathan
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)
Mustafa Sherazi
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentation
AyanaRukasar
 
Density Based Clustering
Density Based Clustering
SSA KPI
 
Unsupervised learning
Unsupervised learning
amalalhait
 
K means Clustering Algorithm
K means Clustering Algorithm
Kasun Ranga Wijeweera
 
Classification using back propagation algorithm
Classification using back propagation algorithm
KIRAN R
 
Clusters techniques
Clusters techniques
rajshreemuthiah
 
K MEANS CLUSTERING
K MEANS CLUSTERING
singh7599
 
3.5 model based clustering
3.5 model based clustering
Krish_ver2
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)
Pravinkumar Landge
 
Clustering
Clustering
Dr. C.V. Suresh Babu
 
Unit 2 unsupervised learning.pptx
Unit 2 unsupervised learning.pptx
Dr.Shweta
 
Cluster Analysis
Cluster Analysis
Kamal Acharya
 
K-Means Clustering Algorithm.pptx
K-Means Clustering Algorithm.pptx
JebaRaj26
 
k medoid clustering.pptx
k medoid clustering.pptx
Roshan86572
 
Clique and sting
Clique and sting
Subramanyam Natarajan
 
DBSCAN : A Clustering Algorithm
DBSCAN : A Clustering Algorithm
Pınar Yahşi
 
12 งานนำสนอ cluster analysis
12 งานนำสนอ cluster analysis
khuwawa2513
 
cluster analysis
cluster analysis
รุ่งทิวา ปุณะตุง
 

More Related Content

What's hot (20)

3.3 hierarchical methods
3.3 hierarchical methods
Krish_ver2
 
Machine learning clustering
Machine learning clustering
CosmoAIMS Bassett
 
Clustering in Data Mining
Clustering in Data Mining
Archana Swaminathan
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)
Mustafa Sherazi
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentation
AyanaRukasar
 
Density Based Clustering
Density Based Clustering
SSA KPI
 
Unsupervised learning
Unsupervised learning
amalalhait
 
K means Clustering Algorithm
K means Clustering Algorithm
Kasun Ranga Wijeweera
 
Classification using back propagation algorithm
Classification using back propagation algorithm
KIRAN R
 
Clusters techniques
Clusters techniques
rajshreemuthiah
 
K MEANS CLUSTERING
K MEANS CLUSTERING
singh7599
 
3.5 model based clustering
3.5 model based clustering
Krish_ver2
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)
Pravinkumar Landge
 
Clustering
Clustering
Dr. C.V. Suresh Babu
 
Unit 2 unsupervised learning.pptx
Unit 2 unsupervised learning.pptx
Dr.Shweta
 
Cluster Analysis
Cluster Analysis
Kamal Acharya
 
K-Means Clustering Algorithm.pptx
K-Means Clustering Algorithm.pptx
JebaRaj26
 
k medoid clustering.pptx
k medoid clustering.pptx
Roshan86572
 
Clique and sting
Clique and sting
Subramanyam Natarajan
 
DBSCAN : A Clustering Algorithm
DBSCAN : A Clustering Algorithm
Pınar Yahşi
 
3.3 hierarchical methods
3.3 hierarchical methods
Krish_ver2
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)
Mustafa Sherazi
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentation
AyanaRukasar
 
Density Based Clustering
Density Based Clustering
SSA KPI
 
Unsupervised learning
Unsupervised learning
amalalhait
 
Classification using back propagation algorithm
Classification using back propagation algorithm
KIRAN R
 
K MEANS CLUSTERING
K MEANS CLUSTERING
singh7599
 
3.5 model based clustering
3.5 model based clustering
Krish_ver2
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)
Pravinkumar Landge
 
Unit 2 unsupervised learning.pptx
Unit 2 unsupervised learning.pptx
Dr.Shweta
 
K-Means Clustering Algorithm.pptx
K-Means Clustering Algorithm.pptx
JebaRaj26
 
k medoid clustering.pptx
k medoid clustering.pptx
Roshan86572
 
DBSCAN : A Clustering Algorithm
DBSCAN : A Clustering Algorithm
Pınar Yahşi
 

Viewers also liked (12)

12 งานนำสนอ cluster analysis
12 งานนำสนอ cluster analysis
khuwawa2513
 
cluster analysis
cluster analysis
รุ่งทิวา ปุณะตุง
 
Cluster analysis
Cluster analysis
saba khan
 
Belief Networks & Bayesian Classification
Belief Networks & Bayesian Classification
Adnan Masood
 
Association Analysis
Association Analysis
guest0edcaf
 
Chap8 basic cluster_analysis
Chap8 basic cluster_analysis
guru_prasadg
 
Data Mining: clustering and analysis
Data Mining: clustering and analysis
DataminingTools Inc
 
Clustering: A Survey
Clustering: A Survey
Raffaele Capaldo
 
Bayesian Networks - A Brief Introduction
Bayesian Networks - A Brief Introduction
Adnan Masood
 
Bayesian Belief Networks for dummies
Bayesian Belief Networks for dummies
Gilad Barkan
 
Clustering training
Clustering training
Gabor Veress
 
K means Clustering
K means Clustering
Edureka!
 
12 งานนำสนอ cluster analysis
12 งานนำสนอ cluster analysis
khuwawa2513
 
Cluster analysis
Cluster analysis
saba khan
 
Belief Networks & Bayesian Classification
Belief Networks & Bayesian Classification
Adnan Masood
 
Association Analysis
Association Analysis
guest0edcaf
 
Chap8 basic cluster_analysis
Chap8 basic cluster_analysis
guru_prasadg
 
Data Mining: clustering and analysis
Data Mining: clustering and analysis
DataminingTools Inc
 
Bayesian Networks - A Brief Introduction
Bayesian Networks - A Brief Introduction
Adnan Masood
 
Bayesian Belief Networks for dummies
Bayesian Belief Networks for dummies
Gilad Barkan
 
Clustering training
Clustering training
Gabor Veress
 
K means Clustering
K means Clustering
Edureka!
 
Ad

Similar to Types of clustering and different types of clustering algorithms (20)

machine learning - Clustering in R
machine learning - Clustering in R
Sudhakar Chavan
 
15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning
Anil Yadav
 
Chapter7 clustering types concepts algorithms.pdf
Chapter7 clustering types concepts algorithms.pdf
PRABHUCECC
 
Chapter 11 cluster advanced, Han & Kamber
Chapter 11 cluster advanced, Han & Kamber
Houw Liong The
 
Chapter 11 cluster advanced : web and text mining
Chapter 11 cluster advanced : web and text mining
Houw Liong The
 
Unsupervised%20Learninffffg (2).pptx. application
Unsupervised%20Learninffffg (2).pptx. application
ShabirAhmad625218
 
Introduction to Clustering . pptx
Introduction to Clustering . pptx
Harsha Patil
 
Dp33701704
Dp33701704
IJERA Editor
 
Dp33701704
Dp33701704
IJERA Editor
 
A survey on Efficient Enhanced K-Means Clustering Algorithm
A survey on Efficient Enhanced K-Means Clustering Algorithm
ijsrd.com
 
DS9 - Clustering.pptx
DS9 - Clustering.pptx
JK970901
 
Clustering & classification
Clustering & classification
Jamshed Khan
 
Chapter 11. Cluster Analysis Advanced Methods.ppt
Chapter 11. Cluster Analysis Advanced Methods.ppt
Subrata Kumer Paul
 
Clustering[306] [Read-Only].pdf
Clustering[306] [Read-Only].pdf
igeabroad
 
Neural nw k means
Neural nw k means
Eng. Dr. Dennis N. Mwighusa
 
Data clustering using kernel based
Data clustering using kernel based
IJITCA Journal
 
ClusteringClusteringClusteringClustering.pdf
ClusteringClusteringClusteringClustering.pdf
SsdSsd5
 
Unsupervised Learning.pptx
Unsupervised Learning.pptx
GandhiMathy6
 
[ML]-Unsupervised-learning_Unit2.ppt.pdf
[ML]-Unsupervised-learning_Unit2.ppt.pdf
4NM20IS025BHUSHANNAY
 
clustering using different methods in .pdf
clustering using different methods in .pdf
officialnovice7
 
machine learning - Clustering in R
machine learning - Clustering in R
Sudhakar Chavan
 
15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning
Anil Yadav
 
Chapter7 clustering types concepts algorithms.pdf
Chapter7 clustering types concepts algorithms.pdf
PRABHUCECC
 
Chapter 11 cluster advanced, Han & Kamber
Chapter 11 cluster advanced, Han & Kamber
Houw Liong The
 
Chapter 11 cluster advanced : web and text mining
Chapter 11 cluster advanced : web and text mining
Houw Liong The
 
Unsupervised%20Learninffffg (2).pptx. application
Unsupervised%20Learninffffg (2).pptx. application
ShabirAhmad625218
 
Introduction to Clustering . pptx
Introduction to Clustering . pptx
Harsha Patil
 
A survey on Efficient Enhanced K-Means Clustering Algorithm
A survey on Efficient Enhanced K-Means Clustering Algorithm
ijsrd.com
 
DS9 - Clustering.pptx
DS9 - Clustering.pptx
JK970901
 
Clustering & classification
Clustering & classification
Jamshed Khan
 
Chapter 11. Cluster Analysis Advanced Methods.ppt
Chapter 11. Cluster Analysis Advanced Methods.ppt
Subrata Kumer Paul
 
Clustering[306] [Read-Only].pdf
Clustering[306] [Read-Only].pdf
igeabroad
 
Data clustering using kernel based
Data clustering using kernel based
IJITCA Journal
 
ClusteringClusteringClusteringClustering.pdf
ClusteringClusteringClusteringClustering.pdf
SsdSsd5
 
Unsupervised Learning.pptx
Unsupervised Learning.pptx
GandhiMathy6
 
[ML]-Unsupervised-learning_Unit2.ppt.pdf
[ML]-Unsupervised-learning_Unit2.ppt.pdf
4NM20IS025BHUSHANNAY
 
clustering using different methods in .pdf
clustering using different methods in .pdf
officialnovice7
 
Ad

Recently uploaded (20)

Modern multi-proposer consensus implementations
Modern multi-proposer consensus implementations
François Garillot
 
Data Structures Module 3 Binary Trees Binary Search Trees Tree Traversals AVL...
Data Structures Module 3 Binary Trees Binary Search Trees Tree Traversals AVL...
resming1
 
MATERIAL SCIENCE LECTURE NOTES FOR DIPLOMA STUDENTS
MATERIAL SCIENCE LECTURE NOTES FOR DIPLOMA STUDENTS
SAMEER VISHWAKARMA
 
Kel.3_A_Review_on_Internet_of_Things_for_Defense_v3.pptx
Kel.3_A_Review_on_Internet_of_Things_for_Defense_v3.pptx
Endang Saefullah
 
Bitumen Emulsion by Dr Sangita Ex CRRI Delhi
Bitumen Emulsion by Dr Sangita Ex CRRI Delhi
grilcodes
 
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Mark Billinghurst
 
Mobile database systems 20254545645.pptx
Mobile database systems 20254545645.pptx
herosh1968
 
International Journal of Advanced Information Technology (IJAIT)
International Journal of Advanced Information Technology (IJAIT)
ijait
 
DESIGN OF REINFORCED CONCRETE ELEMENTS S
DESIGN OF REINFORCED CONCRETE ELEMENTS S
prabhusp8
 
special_edition_using_visual_foxpro_6.pdf
special_edition_using_visual_foxpro_6.pdf
Shabista Imam
 
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
hosseinihamid192023
 
How to Un-Obsolete Your Legacy Keypad Design
How to Un-Obsolete Your Legacy Keypad Design
Epec Engineered Technologies
 
LECTURE 7 COMPUTATIONS OF LEVELING DATA APRIL 2025.pptx
LECTURE 7 COMPUTATIONS OF LEVELING DATA APRIL 2025.pptx
rr22001247
 
Industrial internet of things IOT Week-3.pptx
Industrial internet of things IOT Week-3.pptx
KNaveenKumarECE
 
Validating a Citizen Observatories enabling Platform by completing a Citizen ...
Validating a Citizen Observatories enabling Platform by completing a Citizen ...
Diego López-de-Ipiña González-de-Artaza
 
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Mark Billinghurst
 
20CE404-Soil Mechanics - Slide Share PPT
20CE404-Soil Mechanics - Slide Share PPT
saravananr808639
 
輪読会資料_Miipher and Miipher2 .
輪読会資料_Miipher and Miipher2 .
NABLAS株式会社
 
NEW Strengthened Senior High School Gen Math.pptx
NEW Strengthened Senior High School Gen Math.pptx
DaryllWhere
 
FUNDAMENTALS OF COMPUTER ORGANIZATION AND ARCHITECTURE
FUNDAMENTALS OF COMPUTER ORGANIZATION AND ARCHITECTURE
Shabista Imam
 
Modern multi-proposer consensus implementations
Modern multi-proposer consensus implementations
François Garillot
 
Data Structures Module 3 Binary Trees Binary Search Trees Tree Traversals AVL...
Data Structures Module 3 Binary Trees Binary Search Trees Tree Traversals AVL...
resming1
 
MATERIAL SCIENCE LECTURE NOTES FOR DIPLOMA STUDENTS
MATERIAL SCIENCE LECTURE NOTES FOR DIPLOMA STUDENTS
SAMEER VISHWAKARMA
 
Kel.3_A_Review_on_Internet_of_Things_for_Defense_v3.pptx
Kel.3_A_Review_on_Internet_of_Things_for_Defense_v3.pptx
Endang Saefullah
 
Bitumen Emulsion by Dr Sangita Ex CRRI Delhi
Bitumen Emulsion by Dr Sangita Ex CRRI Delhi
grilcodes
 
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Mark Billinghurst
 
Mobile database systems 20254545645.pptx
Mobile database systems 20254545645.pptx
herosh1968
 
International Journal of Advanced Information Technology (IJAIT)
International Journal of Advanced Information Technology (IJAIT)
ijait
 
DESIGN OF REINFORCED CONCRETE ELEMENTS S
DESIGN OF REINFORCED CONCRETE ELEMENTS S
prabhusp8
 
special_edition_using_visual_foxpro_6.pdf
special_edition_using_visual_foxpro_6.pdf
Shabista Imam
 
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
hosseinihamid192023
 
LECTURE 7 COMPUTATIONS OF LEVELING DATA APRIL 2025.pptx
LECTURE 7 COMPUTATIONS OF LEVELING DATA APRIL 2025.pptx
rr22001247
 
Industrial internet of things IOT Week-3.pptx
Industrial internet of things IOT Week-3.pptx
KNaveenKumarECE
 
Validating a Citizen Observatories enabling Platform by completing a Citizen ...
Validating a Citizen Observatories enabling Platform by completing a Citizen ...
Diego López-de-Ipiña González-de-Artaza
 
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Mark Billinghurst
 
20CE404-Soil Mechanics - Slide Share PPT
20CE404-Soil Mechanics - Slide Share PPT
saravananr808639
 
輪読会資料_Miipher and Miipher2 .
輪読会資料_Miipher and Miipher2 .
NABLAS株式会社
 
NEW Strengthened Senior High School Gen Math.pptx
NEW Strengthened Senior High School Gen Math.pptx
DaryllWhere
 
FUNDAMENTALS OF COMPUTER ORGANIZATION AND ARCHITECTURE
FUNDAMENTALS OF COMPUTER ORGANIZATION AND ARCHITECTURE
Shabista Imam
 

Types of clustering and different types of clustering algorithms

  • 1. Types of clustering: Clustering can be divided into different categories based on different criteria • 1.Hard clustering: A given data point in n-dimensional space only belongs to one cluster. This is also known as exclusive clustering. The K-Means clustering mechanism is an example of hard clustering. • 2.Soft clustering: A given data point can belong to more than one cluster in soft clustering. This is also known as overlapping clustering. The Fuzzy K-Means algorithm is a good example of soft clustering. • 3.Hierarchial clustering: In hierarchical clustering, a hierarchy of clusters is built using the top-down (divisive) or bottom-up (agglomerative) approach. • 4. Flat clustering: Is a simple technique where no hierarchy is present. • 5.Model-based clustering: In model-based clustering, data is modeled using a standard statistical model to work with different distributions. The idea is to find a model that best fits the data.
  • 2. Different clustering algorithms • Fuzzy K-Means: • The K-Means algorithm is for hard clustering. In hard clustering, one data point belongs only to one cluster. However, there can be situations where one point belongs to more than one cluster. For example, a news article may belong to both the Technology and Current Affairs categories. In that case, we need a soft clustering mechanism. • The Fuzzy K-Means algorithm implements soft clustering. It generates overlapping clusters. Each point has a probability of belonging to each cluster, based on the distance from each centroid. • In this example, we apply the Fuzzy K-Means algorithm for dataset(22 80 ,25 75 ,28 85 ,55 150,50 145 ,53 153 ,38 115 ) The outcome of the example is given in the following figure. Note that the newly added data point (someone who had medium weight and height) belongs to cluster 3 in 0.52 probability and to cluster 1 in 0.47 probability, whereas other data points (people who are either large or small) belongs to nearly 0.9 to a particular cluster.
  • 3. Streaming K-Means • If the volume of data is too large to be stored in the main memory available, the K-Means algorithm is not suitable, as it's batch processing mechanism iterates over all the data points. Also, the K-Means algorithm is sensitive to the noise and outliers in data. • Streaming K-Means algorithms has provided a solution for these problems by operating in two steps, as follows: • The streaming step • The ball K-Means step • The idea is to read data points sequentially, storing very few data points in memory. • Then, after the first step, a better representative set of weighted data points is produced for further processing. • The final K number of clusters is produced in the ball K-Means step. During the second step, potential outliers are eliminated.
  • 4. Spectral clustering • The spectral clustering algorithm is helpful in hard, nonconvex clustering problems. It clusters points using the eigenvectors of matrices derived from data.
  • 5. Dirichlet clustering • The Fuzzy K-Means and K-Means algorithms model clusters as spheres (circles in n-dimensional space.) K-Means assumes a common fixed variance. Further, K-Means does not model the data point distribution. • A normal data distribution should be there for the K-Means and Fuzzy K-Means algorithms to process effectively. If the data distribution is different, for example, an asymmetrical normal distribution (different standard deviations), the K-Means algorithm will not perform well and will not give good results. • Dirichlet clustering can be applied to model different data distributions (data points that are not in normal distribution) effectively. Dirichlet clustering fits a model over a dataset and tunes parameters to adjust the model's parameters to correctly fit the data. This approach is suitable to address the hierarchical-clustering problem.