SlideShare a Scribd company logo
6
Most read
7
Most read
8
Most read
Data Cube Computation and Data Generalization
What is Data generalization?Data generalization is a process that abstracts a large set of task-relevant data in a database from a relatively low conceptual level to higher conceptual levels.
What are efficient methods for Data Cube Computation?Different Data cube materialization include Full CubeIceberg CubeClosed CubeShell Cube
General Strategies for Cube Computation    1: Sorting, hashing, and grouping.2: Simultaneous aggregation and caching intermediate results.3: Aggregation from the smallest child, when there exist multiple child cuboids.4: The Apriori pruning method can be explored to compute iceberg cubes efficiently
What is Apriori Property?The Apriori property, in the context of data cubes, states as follows: If a given cell does not satisfy minimum support, then no descendant (i.e., more specialized or detailed version) of the cell will satisfy minimum support either. This property can be used to substantially reduce the computation of iceberg cubes.
The Full Cube   The Multi way Array Aggregation (or simply Multi Way) method computes a full data cube by using a multidimensional array as its basic data structurePartition the array into chunksCompute aggregates by visiting (i.e., accessing the values at) cube cells
BUC: Computing Iceberg Cubes from the Apex Cuboid’s DownwardBUC stands for “Bottom-Up Construction" , BUC is an algorithm for the computation of sparse and iceberg cubes. Unlike Multi Way, BUC constructs the cube from the apex cuboids' toward the base cuboids'. This allows BUC to share data partitioning costs. This order of processing also allows BUC to prune during construction, using the Apriori property. (for algorithm refer wiki)
Development of Data Cube and OLAP TechnologyDiscovery-Driven Exploration of Data Cubes Tools need to be developed to assist users in intelligently exploring the huge aggregated space of a data cube. Discovery-driven exploration is such a cube exploration approach.Complex Aggregation at Multiple Granularity: Multi feature Cubes Data cubes facilitate the answering of data mining queries as they allow the computation of aggregate data at multiple levels of granularity
Constrained Gradient Analysis in Data CubesConstrained multidimensional gradient analysis reduces the search space and derives interesting results. It incorporates the following types of constraints:Significance constraintProbe constraintGradient constraint
Alternative Method for Data GeneralizationAttribute-Oriented Induction for Data CharacterizationThe attribute-oriented induction approach is basically a query-oriented, generalization-based, on-line data analysis technique The general idea of attribute-oriented induction is to first collect the task-relevant data using a database query and then perform generalization based on the examination of the number of distinct values of each attribute in the relevant set of data
Cont..Attribute generalization is based on the following rule: If there is a large set of distinct values for an attribute in the initial working relation, and there exists a set of generalization operators on the attribute, then a generalization operator should be selected and applied to the attribute.
Different ways to control a generalization process   The control of how high an attribute should be generalized is typically quite subjective. The control of this process is called attribute generalization control.Attribute generalization threshold controlGeneralized relation threshold control
Mining ClassesData collectionDimension relevance analysisSynchronous generalizationPresentation of the derived comparison
Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net
Ad

Recommended

Database security
Database security
Software Engineering
 
Clustering in Data Mining
Clustering in Data Mining
Archana Swaminathan
 
Data cube computation
Data cube computation
Rashmi Sheikh
 
Multidimensional schema of data warehouse
Multidimensional schema of data warehouse
kunjan shah
 
Data reduction
Data reduction
kalavathisugan
 
Dbms
Dbms
Rupali Salunkhe
 
Data Reduction
Data Reduction
Rajan Shah
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
1.2 steps and functionalities
1.2 steps and functionalities
Krish_ver2
 
Data cubes
Data cubes
Mohammed
 
Kdd process
Kdd process
Rajesh Chandra
 
Distributed database
Distributed database
ReachLocal Services India
 
Data mining primitives
Data mining primitives
lavanya marichamy
 
weak slot and filler
weak slot and filler
BMS Institute of Technology and Management
 
14. Query Optimization in DBMS
14. Query Optimization in DBMS
koolkampus
 
Mining Frequent Patterns, Association and Correlations
Mining Frequent Patterns, Association and Correlations
Justin Cletus
 
introduction to NOSQL Database
introduction to NOSQL Database
nehabsairam
 
Relational model
Relational model
Dabbal Singh Mahara
 
3. mining frequent patterns
3. mining frequent patterns
Azad public school
 
Data Integration and Transformation in Data mining
Data Integration and Transformation in Data mining
kavitha muneeshwaran
 
Density based clustering
Density based clustering
YaswanthHariKumarVud
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
DataminingTools Inc
 
Relational algebra ppt
Relational algebra ppt
GirdharRatne
 
The Object Model
The Object Model
yndaravind
 
Naming in Distributed System
Naming in Distributed System
MNM Jain Engineering College
 
2.3 bayesian classification
2.3 bayesian classification
Krish_ver2
 
OLAP in Data Warehouse
OLAP in Data Warehouse
SOMASUNDARAM T
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
Olap Cube Design
Olap Cube Design
h1m
 
MS SQL SERVER: Olap cubes and data mining
MS SQL SERVER: Olap cubes and data mining
DataminingTools Inc
 

More Related Content

What's hot (20)

1.2 steps and functionalities
1.2 steps and functionalities
Krish_ver2
 
Data cubes
Data cubes
Mohammed
 
Kdd process
Kdd process
Rajesh Chandra
 
Distributed database
Distributed database
ReachLocal Services India
 
Data mining primitives
Data mining primitives
lavanya marichamy
 
weak slot and filler
weak slot and filler
BMS Institute of Technology and Management
 
14. Query Optimization in DBMS
14. Query Optimization in DBMS
koolkampus
 
Mining Frequent Patterns, Association and Correlations
Mining Frequent Patterns, Association and Correlations
Justin Cletus
 
introduction to NOSQL Database
introduction to NOSQL Database
nehabsairam
 
Relational model
Relational model
Dabbal Singh Mahara
 
3. mining frequent patterns
3. mining frequent patterns
Azad public school
 
Data Integration and Transformation in Data mining
Data Integration and Transformation in Data mining
kavitha muneeshwaran
 
Density based clustering
Density based clustering
YaswanthHariKumarVud
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
DataminingTools Inc
 
Relational algebra ppt
Relational algebra ppt
GirdharRatne
 
The Object Model
The Object Model
yndaravind
 
Naming in Distributed System
Naming in Distributed System
MNM Jain Engineering College
 
2.3 bayesian classification
2.3 bayesian classification
Krish_ver2
 
OLAP in Data Warehouse
OLAP in Data Warehouse
SOMASUNDARAM T
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
1.2 steps and functionalities
1.2 steps and functionalities
Krish_ver2
 
14. Query Optimization in DBMS
14. Query Optimization in DBMS
koolkampus
 
Mining Frequent Patterns, Association and Correlations
Mining Frequent Patterns, Association and Correlations
Justin Cletus
 
introduction to NOSQL Database
introduction to NOSQL Database
nehabsairam
 
Data Integration and Transformation in Data mining
Data Integration and Transformation in Data mining
kavitha muneeshwaran
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
DataminingTools Inc
 
Relational algebra ppt
Relational algebra ppt
GirdharRatne
 
The Object Model
The Object Model
yndaravind
 
2.3 bayesian classification
2.3 bayesian classification
Krish_ver2
 
OLAP in Data Warehouse
OLAP in Data Warehouse
SOMASUNDARAM T
 

Viewers also liked (17)

Olap Cube Design
Olap Cube Design
h1m
 
MS SQL SERVER: Olap cubes and data mining
MS SQL SERVER: Olap cubes and data mining
DataminingTools Inc
 
Data mining notes
Data mining notes
AVC College of Engineering
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
Datamining Tools
 
Dimensionality Reduction
Dimensionality Reduction
mrizwan969
 
Datacube
Datacube
man2sandsce17
 
Dimensionality reduction
Dimensionality reduction
Shatakirti Er
 
Different type of databases
Different type of databases
Shwe Yee
 
Concept description characterization and comparison
Concept description characterization and comparison
ric_biet
 
1.7 data reduction
1.7 data reduction
Krish_ver2
 
1.8 discretization
1.8 discretization
Krish_ver2
 
Apriori Algorithm
Apriori Algorithm
International School of Engineering
 
Substitution Cipher
Substitution Cipher
Agung Julisman
 
Data Mining: Association Rules Basics
Data Mining: Association Rules Basics
Benazir Income Support Program (BISP)
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Data mining slides
Data mining slides
smj
 
Data mining
Data mining
Akannsha Totewar
 
Olap Cube Design
Olap Cube Design
h1m
 
MS SQL SERVER: Olap cubes and data mining
MS SQL SERVER: Olap cubes and data mining
DataminingTools Inc
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
Datamining Tools
 
Dimensionality Reduction
Dimensionality Reduction
mrizwan969
 
Dimensionality reduction
Dimensionality reduction
Shatakirti Er
 
Different type of databases
Different type of databases
Shwe Yee
 
Concept description characterization and comparison
Concept description characterization and comparison
ric_biet
 
1.7 data reduction
1.7 data reduction
Krish_ver2
 
1.8 discretization
1.8 discretization
Krish_ver2
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Data mining slides
Data mining slides
smj
 
Ad

Similar to Data Mining: Data cube computation and data generalization (20)

05 cubetech
05 cubetech
JoonyoungJayGwak
 
Data Mining and Warehousing Concept and Techniques
Data Mining and Warehousing Concept and Techniques
AnilkumarBrahmane2
 
Data Mining: Concepts and Techniques (3rd ed.) Chapter 5
Data Mining: Concepts and Techniques (3rd ed.) Chapter 5
FriendsofGADGETS
 
Chapter 5. Data Cube Technology.ppt
Chapter 5. Data Cube Technology.ppt
Subrata Kumer Paul
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Salah Amean
 
19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE
Bharath123Maddipati
 
04 data mining : data generelization
04 data mining : data generelization
Institute of Technology Telkom
 
Data Warehouse Implementation
Data Warehouse Implementation
omayva
 
Lecture 8 is for best and you should read
Lecture 8 is for best and you should read
centralcollegepkr
 
Efficient_Cube_computation.ppt
Efficient_Cube_computation.ppt
Kulwinder Padda
 
5desc
5desc
Vishwajeet Gudadhe
 
data generalization and summarization
data generalization and summarization
janani thirupathi
 
OLAP Basics and Fundamentals by Bharat Kalia
OLAP Basics and Fundamentals by Bharat Kalia
Bharat Kalia
 
Dw-dm-part-01
Dw-dm-part-01
nash512
 
data mining and data warehousing PPT module 2
data mining and data warehousing PPT module 2
premajain3
 
mod 2.pdf
mod 2.pdf
ShivaprasadGouda3
 
2-Concept Hierarchy to Classification of DMS.pptx
2-Concept Hierarchy to Classification of DMS.pptx
shobyscms
 
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduce
ijircee
 
DECISION TREE CLUSTERING: A COLUMNSTORES TUPLE RECONSTRUCTION
DECISION TREE CLUSTERING: A COLUMNSTORES TUPLE RECONSTRUCTION
cscpconf
 
Multi dimensional model vs (1)
Multi dimensional model vs (1)
JamesDempsey1
 
Data Mining and Warehousing Concept and Techniques
Data Mining and Warehousing Concept and Techniques
AnilkumarBrahmane2
 
Data Mining: Concepts and Techniques (3rd ed.) Chapter 5
Data Mining: Concepts and Techniques (3rd ed.) Chapter 5
FriendsofGADGETS
 
Chapter 5. Data Cube Technology.ppt
Chapter 5. Data Cube Technology.ppt
Subrata Kumer Paul
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Salah Amean
 
Data Warehouse Implementation
Data Warehouse Implementation
omayva
 
Lecture 8 is for best and you should read
Lecture 8 is for best and you should read
centralcollegepkr
 
Efficient_Cube_computation.ppt
Efficient_Cube_computation.ppt
Kulwinder Padda
 
data generalization and summarization
data generalization and summarization
janani thirupathi
 
OLAP Basics and Fundamentals by Bharat Kalia
OLAP Basics and Fundamentals by Bharat Kalia
Bharat Kalia
 
Dw-dm-part-01
Dw-dm-part-01
nash512
 
data mining and data warehousing PPT module 2
data mining and data warehousing PPT module 2
premajain3
 
2-Concept Hierarchy to Classification of DMS.pptx
2-Concept Hierarchy to Classification of DMS.pptx
shobyscms
 
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduce
ijircee
 
DECISION TREE CLUSTERING: A COLUMNSTORES TUPLE RECONSTRUCTION
DECISION TREE CLUSTERING: A COLUMNSTORES TUPLE RECONSTRUCTION
cscpconf
 
Multi dimensional model vs (1)
Multi dimensional model vs (1)
JamesDempsey1
 
Ad

More from DataminingTools Inc (20)

Terminology Machine Learning
Terminology Machine Learning
DataminingTools Inc
 
Techniques Machine Learning
Techniques Machine Learning
DataminingTools Inc
 
Machine learning Introduction
Machine learning Introduction
DataminingTools Inc
 
Areas of machine leanring
Areas of machine leanring
DataminingTools Inc
 
AI: Planning and AI
AI: Planning and AI
DataminingTools Inc
 
AI: Logic in AI 2
AI: Logic in AI 2
DataminingTools Inc
 
AI: Logic in AI
AI: Logic in AI
DataminingTools Inc
 
AI: Learning in AI 2
AI: Learning in AI 2
DataminingTools Inc
 
AI: Learning in AI
AI: Learning in AI
DataminingTools Inc
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
DataminingTools Inc
 
AI: Belief Networks
AI: Belief Networks
DataminingTools Inc
 
AI: AI & Searching
AI: AI & Searching
DataminingTools Inc
 
AI: AI & Problem Solving
AI: AI & Problem Solving
DataminingTools Inc
 
Data Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Outlier analysis
Data Mining: Outlier analysis
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 
Data Mining: Data processing
Data Mining: Data processing
DataminingTools Inc
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
DataminingTools Inc
 
Data Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 

Recently uploaded (20)

CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
 
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
 
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
 
Quantum AI: Where Impossible Becomes Probable
Quantum AI: Where Impossible Becomes Probable
Saikat Basu
 
"Scaling in space and time with Temporal", Andriy Lupa.pdf
"Scaling in space and time with Temporal", Andriy Lupa.pdf
Fwdays
 
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Safe Software
 
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
Techniques for Automatic Device Identification and Network Assignment.pdf
Techniques for Automatic Device Identification and Network Assignment.pdf
Priyanka Aash
 
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
"How to survive Black Friday: preparing e-commerce for a peak season", Yurii ...
"How to survive Black Friday: preparing e-commerce for a peak season", Yurii ...
Fwdays
 
Python Conference Singapore - 19 Jun 2025
Python Conference Singapore - 19 Jun 2025
ninefyi
 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
 
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
Fwdays
 
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
Edge AI and Vision Alliance
 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
 
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
 
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
 
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
 
Quantum AI: Where Impossible Becomes Probable
Quantum AI: Where Impossible Becomes Probable
Saikat Basu
 
"Scaling in space and time with Temporal", Andriy Lupa.pdf
"Scaling in space and time with Temporal", Andriy Lupa.pdf
Fwdays
 
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Safe Software
 
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
Techniques for Automatic Device Identification and Network Assignment.pdf
Techniques for Automatic Device Identification and Network Assignment.pdf
Priyanka Aash
 
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
"How to survive Black Friday: preparing e-commerce for a peak season", Yurii ...
"How to survive Black Friday: preparing e-commerce for a peak season", Yurii ...
Fwdays
 
Python Conference Singapore - 19 Jun 2025
Python Conference Singapore - 19 Jun 2025
ninefyi
 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
 
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
Fwdays
 
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
“MPU+: A Transformative Solution for Next-Gen AI at the Edge,” a Presentation...
Edge AI and Vision Alliance
 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
 

Data Mining: Data cube computation and data generalization

  • 1. Data Cube Computation and Data Generalization
  • 2. What is Data generalization?Data generalization is a process that abstracts a large set of task-relevant data in a database from a relatively low conceptual level to higher conceptual levels.
  • 3. What are efficient methods for Data Cube Computation?Different Data cube materialization include Full CubeIceberg CubeClosed CubeShell Cube
  • 4. General Strategies for Cube Computation 1: Sorting, hashing, and grouping.2: Simultaneous aggregation and caching intermediate results.3: Aggregation from the smallest child, when there exist multiple child cuboids.4: The Apriori pruning method can be explored to compute iceberg cubes efficiently
  • 5. What is Apriori Property?The Apriori property, in the context of data cubes, states as follows: If a given cell does not satisfy minimum support, then no descendant (i.e., more specialized or detailed version) of the cell will satisfy minimum support either. This property can be used to substantially reduce the computation of iceberg cubes.
  • 6. The Full Cube The Multi way Array Aggregation (or simply Multi Way) method computes a full data cube by using a multidimensional array as its basic data structurePartition the array into chunksCompute aggregates by visiting (i.e., accessing the values at) cube cells
  • 7. BUC: Computing Iceberg Cubes from the Apex Cuboid’s DownwardBUC stands for “Bottom-Up Construction" , BUC is an algorithm for the computation of sparse and iceberg cubes. Unlike Multi Way, BUC constructs the cube from the apex cuboids' toward the base cuboids'. This allows BUC to share data partitioning costs. This order of processing also allows BUC to prune during construction, using the Apriori property. (for algorithm refer wiki)
  • 8. Development of Data Cube and OLAP TechnologyDiscovery-Driven Exploration of Data Cubes Tools need to be developed to assist users in intelligently exploring the huge aggregated space of a data cube. Discovery-driven exploration is such a cube exploration approach.Complex Aggregation at Multiple Granularity: Multi feature Cubes Data cubes facilitate the answering of data mining queries as they allow the computation of aggregate data at multiple levels of granularity
  • 9. Constrained Gradient Analysis in Data CubesConstrained multidimensional gradient analysis reduces the search space and derives interesting results. It incorporates the following types of constraints:Significance constraintProbe constraintGradient constraint
  • 10. Alternative Method for Data GeneralizationAttribute-Oriented Induction for Data CharacterizationThe attribute-oriented induction approach is basically a query-oriented, generalization-based, on-line data analysis technique The general idea of attribute-oriented induction is to first collect the task-relevant data using a database query and then perform generalization based on the examination of the number of distinct values of each attribute in the relevant set of data
  • 11. Cont..Attribute generalization is based on the following rule: If there is a large set of distinct values for an attribute in the initial working relation, and there exists a set of generalization operators on the attribute, then a generalization operator should be selected and applied to the attribute.
  • 12. Different ways to control a generalization process The control of how high an attribute should be generalized is typically quite subjective. The control of this process is called attribute generalization control.Attribute generalization threshold controlGeneralized relation threshold control
  • 13. Mining ClassesData collectionDimension relevance analysisSynchronous generalizationPresentation of the derived comparison
  • 14. Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net