SlideShare a Scribd company logo
3
Most read
4
Most read
7
Most read
CLUSTER ANALYSIS

 PREPARED BY SABA KHAN
PRESENTED TO IMTIAZ ARIF
        ID 4640
What is Cluster Analysis?
     It is a descriptive analysis technique which groups
     objects (respondents, products, firms, variables,
     etc.) so that each object is similar to the other
     objects in the cluster and different from objects in
     all the other clusters.




2
What is Cluster Analysis?
 Cluster: a collection of data objects
   Similar to one another within the same cluster
   Dissimilar to the objects in other clusters


 Cluster analysis
   Finding similarities between data according to the
   characteristics found in the data and grouping
   similar data objects into clusters
When to use cluster analysis?
     The essence of all clustering approaches is the classification of
        data as suggested by “natural” groupings of the data themselves.
     Simply put when you desire the following then use
        Cluster analysis.
          Taxonomy development(segmentation)
          Data simplification
          Relationship identification
         Applications.
     It is used to segment the market in Marketing, used in
        social networking sites in making new groups based on
        users data, Flickr’s map of photos and other map sites
        use clustering to reduce the number of markers on a
        map.
4
    
Examples of Clustering Applications

 • Marketing: Help marketers discover distinct groups in their
customer bases, and then use this knowledge to develop
targeted marketing programs.
 • Land use: Identification of areas of similar land use in an
earth observation database.
 • Insurance: Identifying groups of motor insurance policy
holders with a high average claim cost.
 • City-planning: Identifying groups of houses according to
their house type, value, and geographical location.
 • Earth-quake studies: Observed earth quake epicenters
  should be clustered along continent faults
Assumptions for Cluster Analysis.
     Sufficient size is needed to ensure representativeness of
        the population and its underlying structure, particularly
        small groups within the population.
       Outliers can severely distort the representativeness of the
        results if they appear as structure (clusters) that are
        inconsistent with the research objectives
       Representativeness of the sample. The sample must
        represent the research question.
       Impact of multicollinearity. Input variables should be
        examined for substantial multicollinearity and if present:
       Reduce the variables to equal numbers in each set of
        correlated measures.


6
HOW TO DEFINE
CLUSTERS
   CLUSTER       CLUSTER
   A             B




             1

             2

             3
We will now go to SPSS for
     analysis.

      Retrieve judges.sav
      Analyze  classify  Hierarchical cluster
      All variables.




10

More Related Content

PPTX
discriminant analysis
PPTX
Discriminant analysis
DOCX
Method study
PPTX
Modes of transportation
PPTX
Datastructures in python
PPTX
Factor analysis
PPT
Remote sensing and its applications in environment
PDF
Indifference curve analysis
discriminant analysis
Discriminant analysis
Method study
Modes of transportation
Datastructures in python
Factor analysis
Remote sensing and its applications in environment
Indifference curve analysis

What's hot (20)

PPTX
Cluster analysis
PPTX
Multidimensional scaling1
PPTX
Multivariate data analysis
PPTX
Factor analysis
PPTX
Cluster Analysis
PPT
Regression analysis ppt
PPTX
Factor Analysis in Research
PDF
Multivariate Analysis
PPT
Multidimensional scaling
PPTX
Correlation and Regression
PPTX
Multivariate analyses
PDF
Cluster analysis
PDF
Simple linear regression
PPT
SAMPLING AND SAMPLING ERRORS
PPT
Correlation
PPTX
Descriptive Statistics
PPT
Multivariate Analysis Techniques
DOCX
Probability distribution
PPTX
Sensitivity analysis
Cluster analysis
Multidimensional scaling1
Multivariate data analysis
Factor analysis
Cluster Analysis
Regression analysis ppt
Factor Analysis in Research
Multivariate Analysis
Multidimensional scaling
Correlation and Regression
Multivariate analyses
Cluster analysis
Simple linear regression
SAMPLING AND SAMPLING ERRORS
Correlation
Descriptive Statistics
Multivariate Analysis Techniques
Probability distribution
Sensitivity analysis
Ad

Viewers also liked (8)

PDF
Three case studies deploying cluster analysis
PPT
Human aspect of project
PPT
Clustering
PDF
Project delay and_cost_overrun-libre
PPTX
Time overruns
PPTX
Types of clustering and different types of clustering algorithms
PPTX
Clustering in Data Mining
PPT
Test of hypothesis
Three case studies deploying cluster analysis
Human aspect of project
Clustering
Project delay and_cost_overrun-libre
Time overruns
Types of clustering and different types of clustering algorithms
Clustering in Data Mining
Test of hypothesis
Ad

Similar to Cluster analysis (20)

PPTX
pratik meshram-Unit 5 (contemporary mkt r sch)
PPTX
QUALITY AND VALIDITY of cluster analysis in data minig
PDF
QUALITY AND VALIDITY OF CLUSTER ANALYSIS
PPTX
CLuster analysis presentation.pptx
DOCX
Cluster analysis (2).docx
PPTX
Program_Cluster_Analysis
PDF
It is a presentation on machine learning
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PDF
4.Unit 4 ML Q&A.pdf machine learning qb
PPTX
Ahhbsdnfmfmfmdbshehwheheheheh3hehehehebq
PPTX
Hierarchical Clustering in Data Mining
PPTX
Artificial Intelligence Clustering lecture
PPTX
Clusteranalysis 121206234137-phpapp01
PPTX
Clusteranalysis
PPTX
Read first few slides cluster analysis
PPTX
Cluster analysis
PPTX
PPTX
Presentation on K-Means Clustering
PPTX
For iiii year students of cse ML-UNIT-V.pptx
pratik meshram-Unit 5 (contemporary mkt r sch)
QUALITY AND VALIDITY of cluster analysis in data minig
QUALITY AND VALIDITY OF CLUSTER ANALYSIS
CLuster analysis presentation.pptx
Cluster analysis (2).docx
Program_Cluster_Analysis
It is a presentation on machine learning
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
4.Unit 4 ML Q&A.pdf machine learning qb
Ahhbsdnfmfmfmdbshehwheheheheh3hehehehebq
Hierarchical Clustering in Data Mining
Artificial Intelligence Clustering lecture
Clusteranalysis 121206234137-phpapp01
Clusteranalysis
Read first few slides cluster analysis
Cluster analysis
Presentation on K-Means Clustering
For iiii year students of cse ML-UNIT-V.pptx

More from saba khan (6)

PPTX
Training_Self Assessment Report
PPTX
PPTX
Regression analysis
PPTX
Logistic regression
PPTX
Correspondence analysis final
PPTX
Conjoint ppt final one
Training_Self Assessment Report
Regression analysis
Logistic regression
Correspondence analysis final
Conjoint ppt final one

Recently uploaded (20)

PPTX
Machine Learning_overview_presentation.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
Tartificialntelligence_presentation.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Mushroom cultivation and it's methods.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Machine learning based COVID-19 study performance prediction
PDF
August Patch Tuesday
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
Machine Learning_overview_presentation.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
TLE Review Electricity (Electricity).pptx
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Tartificialntelligence_presentation.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Mushroom cultivation and it's methods.pdf
Network Security Unit 5.pdf for BCA BBA.
Machine learning based COVID-19 study performance prediction
August Patch Tuesday
Advanced methodologies resolving dimensionality complications for autism neur...
Digital-Transformation-Roadmap-for-Companies.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Heart disease approach using modified random forest and particle swarm optimi...
Group 1 Presentation -Planning and Decision Making .pptx
OMC Textile Division Presentation 2021.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
cloud_computing_Infrastucture_as_cloud_p
Building Integrated photovoltaic BIPV_UPV.pdf
Assigned Numbers - 2025 - Bluetooth® Document

Cluster analysis

  • 1. CLUSTER ANALYSIS PREPARED BY SABA KHAN PRESENTED TO IMTIAZ ARIF ID 4640
  • 2. What is Cluster Analysis?  It is a descriptive analysis technique which groups objects (respondents, products, firms, variables, etc.) so that each object is similar to the other objects in the cluster and different from objects in all the other clusters. 2
  • 3. What is Cluster Analysis?  Cluster: a collection of data objects  Similar to one another within the same cluster  Dissimilar to the objects in other clusters  Cluster analysis  Finding similarities between data according to the characteristics found in the data and grouping similar data objects into clusters
  • 4. When to use cluster analysis?  The essence of all clustering approaches is the classification of data as suggested by “natural” groupings of the data themselves.  Simply put when you desire the following then use Cluster analysis.  Taxonomy development(segmentation)  Data simplification  Relationship identification Applications.  It is used to segment the market in Marketing, used in social networking sites in making new groups based on users data, Flickr’s map of photos and other map sites use clustering to reduce the number of markers on a map. 4 
  • 5. Examples of Clustering Applications  • Marketing: Help marketers discover distinct groups in their customer bases, and then use this knowledge to develop targeted marketing programs.  • Land use: Identification of areas of similar land use in an earth observation database.  • Insurance: Identifying groups of motor insurance policy holders with a high average claim cost.  • City-planning: Identifying groups of houses according to their house type, value, and geographical location.  • Earth-quake studies: Observed earth quake epicenters should be clustered along continent faults
  • 6. Assumptions for Cluster Analysis.  Sufficient size is needed to ensure representativeness of the population and its underlying structure, particularly small groups within the population.  Outliers can severely distort the representativeness of the results if they appear as structure (clusters) that are inconsistent with the research objectives  Representativeness of the sample. The sample must represent the research question.  Impact of multicollinearity. Input variables should be examined for substantial multicollinearity and if present:  Reduce the variables to equal numbers in each set of correlated measures. 6
  • 7. HOW TO DEFINE CLUSTERS CLUSTER CLUSTER A B 1 2 3
  • 8. We will now go to SPSS for analysis. Retrieve judges.sav Analyze  classify  Hierarchical cluster All variables. 10