SlideShare a Scribd company logo
Data Mining Concepts
overviewHistory of DMXDMX IntroductionDMX objectsQuery SyntaxPrediction
History of DMXDMX was first introduced in the OLE DB for Data Mining specification authored by Microsoft in conjunction with other vendors in 1999.The goal of DMX is to define common concepts and common query expressions for the data mining world.It is similar to what SQL has done for databases.
overview of DMXData Mining Extensions (DMX) is a query language for Data Mining Models. It consists of: DDL (Data definition Language)The DDL of the DMX is used to create new data mining models and structures, export and import mining structures, copy or transfer data from one mining model to another and delete existing data mining models and mining structures. DML (Data Manipulation Language)The DML of the DMX is used to search and browse data in the data mining models, update the data mining models by insertion and updating of the data and derive predictions using the prediction query.
DMX objectsData Mining Extensions (DMX) is a language that you can use to create and work with data mining models in Microsoft SQL Server Analysis Services.DMX is used to create the structure of new data mining models, to train these models, and to browse, manage, and predict against them.There are two major objects that are used to manifest this transformation:The mining structure
The mining modelThe mining structureA mining structure is defined as a list of columns, with their data types and information describing how they should be handled.When a mining structure is processed, it contains a compressed cache  or copy  of the source data. This cache is used to train any models that are subsequently added to the structure. which can be queried to return its data or the distinct states that exist in any structure column. The cache is only maintained temporarily, and can be dropped at any time.
The mining modelA mining model is the object that transforms rows of data into cases and performs the machine learning using a specified data mining algorithm. A mining model is described as a subset of columns from the structure, how those columns are to be used as attributes along with the algorithm and parameters  to perform  machine learning on the structure data.Statistics about predictions are available as well, Additionally the learned patterns themselves can be queried to discover what the algorithm found. These patterns are generally referred to as the model content.
 Query SyntaxDMX statements  are used to create, process, delete, copy, browse, and predict against data mining models.The three basic steps for data mining process are:Creation
Prediction
TrainingCREATING MINING STRUCTURES:Creating mining structures is similar to creating tables in SQL.Syntax:CREATE [SESSION] MINING STRUCTURE <structure> ( [(<column definition list>)] )StructureA unique name for the structure.column definitionlistA comma-separated list of column definitions
CREATING MINING STRUCTURES:The following example creates a new mining structure called New Mailing.CREATE MINING STRUCTURE [New Mailing]    ( CustomerKey LONG KEY,    Gender TEXT DISCRETE,    [Number Cars Owned] LONG DISCRETE,    [Bike Buyer] LONG DISCRETE )
ALTERING MINING STRUCTURES:Creates a new mining model that is based on an existing mining structure. When you use the alter structure statement to create a new mining model, the structure must already exist. Syntax:ALTER MINING STRUCTURE <structure>      ADD MINING MODEL <model>      ( <column definition list> [(<nested column definition list>) [WITH FILTER (<nested filter criteria>)]] )      USING <algorithm> [(<parameter list>)] FILTER keyword is used to  filter condition.
ALTERING MINING STRUCTURESThe following example adds a Naive Bayes mining model to the New Mailing mining structure and limits the maximum number of attribute states to 50.ALTER MINING STRUCTURE [New Mailing]      ADD MINING MODEL [Naive Bayes]       ( CustomerKey,         Gender,        [Number Cars Owned],        [Bike Buyer] PREDICT )       USING Microsoft_Naive_Bayes (MAXIMUM_STATES = 50)
Data Types and Content typesThe following table shows the list of data types and content types for mining structure columns:Time Series models.Sequence Clustering models in nested tables.
DROP MINING MODEL Deletes a mining model from the database.Syntax:DROP MINING MODEL <model >ModelA model identifier.Ex:  The following sample code drops the mining model NBSample.DROP MINING MODEL [NBSample]
NESTED TABLESEx: Consider the following  case derived from two tables, one table that contains customer information and another table that contains customer purchases. A single customer in the customer table may have multiple purchases in the purchases table, which makes it difficult to describe the data using a single row. Analysis Services provides a unique method for handling these cases, by using nested tables. The concept of a nested table is demonstrated in the following illustration.
The first table is the parent table has information about customers, and associates a unique identifier for each customer. The second table, the child table, contains purchases for each customer. The purchases in the child table are related back to the parent table by the unique identifier, the CustomerKey column. The third table in the diagram shows the two tables combined.
PredictionPredictionmeansapplying the patterns that were found in the data to estimate unknown information. Examples:  of prediction might be predicting if a customer will or will not be good for a loan, estimating a credit score, determining to what cluster a case belongs, or predicting future values of a time series.
Prediction JoinUsing prediction join in this example we can come to conclusion that:‘‘if the kid is male and class is 5, then the highest scored subject is science.’’
Prediction Join syntaxSELECT [TOP <count>] <column references> FROM <mining model>    [[NATURAL] PREDICTION JOIN    <source-data> [ ON <mapping clause> ]   [ WHERE <condition clause> ]   [ ORDER BY <order clause> [DESC | ASC] ]]Count  Optional, An integer that specifies how many rows to return.column referencesA comma-separated list of column identifiers an expressions that are derived from the mining model.mining  modelA model identifier.source -dataThe source query.mapping clauseOptional, A logical expression that compares columns from the model to columns from the source query.condition clause Optional, A condition to restrict the values that are returned from the column list.order clause Optional, An expression that returns a scalar value.
summaryHistory of DMXDMX IntroductionDMX objectsQuery SyntaxPrediction join syntax
Ad

Recommended

MS SQL SERVER: Using the data mining tools
MS SQL SERVER: Using the data mining tools
DataminingTools Inc
 
Online SAP BO 4.2 Training
Online SAP BO 4.2 Training
ashok training
 
Ms access
Ms access
Amrita University
 
Msaccess
Msaccess
Manish Goyal
 
Quickstart Microsoft access 2013
Quickstart Microsoft access 2013
comatsg
 
Ms access
Ms access
Neetu Yadav
 
MS Access – Part 1 (One Day Workshop)
MS Access – Part 1 (One Day Workshop)
Khader Shaik
 
OBIEE publisher with Report creation - Tutorial
OBIEE publisher with Report creation - Tutorial
onlinetrainingplacements
 
Access presentation
Access presentation
DUSPviz
 
Ms access ppt 2017 by Gopal saha
Ms access ppt 2017 by Gopal saha
253253
 
Ms access
Ms access
Shubhanjali -
 
Microsoft access 2007
Microsoft access 2007
JaiRane007
 
Access Basics 01
Access Basics 01
Lets try
 
Intro to Microsoft Access
Intro to Microsoft Access
DUSPviz
 
Class viii ch-2 log on to access
Class viii ch-2 log on to access
jessandy
 
Ms access
Ms access
Nagarajan
 
Access 2010 Unit A PPT
Access 2010 Unit A PPT
okmomwalking
 
Dynamics AX 2009 Data Dictionary - Güven Şahin - 04.05.2013
Dynamics AX 2009 Data Dictionary - Güven Şahin - 04.05.2013
guvensahin
 
Ms Access
Ms Access
Swati Sinha
 
B.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms access
Rai University
 
Ms access 2007
Ms access 2007
Ramesh Pant
 
Introduction to microsoft access
Introduction to microsoft access
Hardik Patel
 
SAP BO Web Intelligence Basics
SAP BO Web Intelligence Basics
Kiran Joy
 
MS Access Training
MS Access Training
Michael Sheyahshe
 
Database notes ICTL
Database notes ICTL
Zarina Suleiman
 
Ms access 2010
Ms access 2010
Alsufaacademy
 
Microsoft Dynamics AX 2012 - Development Introduction Training - Part 3/3
Microsoft Dynamics AX 2012 - Development Introduction Training - Part 3/3
Fabio Filardi
 
Module 1 Database for ICTL Form 2
Module 1 Database for ICTL Form 2
fazihafid
 
Data Mining: Outlier analysis
Data Mining: Outlier analysis
DataminingTools Inc
 
Apt 502 distance tech handouts study guides and visuals
Apt 502 distance tech handouts study guides and visuals
Betty Howard
 

More Related Content

What's hot (20)

Access presentation
Access presentation
DUSPviz
 
Ms access ppt 2017 by Gopal saha
Ms access ppt 2017 by Gopal saha
253253
 
Ms access
Ms access
Shubhanjali -
 
Microsoft access 2007
Microsoft access 2007
JaiRane007
 
Access Basics 01
Access Basics 01
Lets try
 
Intro to Microsoft Access
Intro to Microsoft Access
DUSPviz
 
Class viii ch-2 log on to access
Class viii ch-2 log on to access
jessandy
 
Ms access
Ms access
Nagarajan
 
Access 2010 Unit A PPT
Access 2010 Unit A PPT
okmomwalking
 
Dynamics AX 2009 Data Dictionary - Güven Şahin - 04.05.2013
Dynamics AX 2009 Data Dictionary - Güven Şahin - 04.05.2013
guvensahin
 
Ms Access
Ms Access
Swati Sinha
 
B.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms access
Rai University
 
Ms access 2007
Ms access 2007
Ramesh Pant
 
Introduction to microsoft access
Introduction to microsoft access
Hardik Patel
 
SAP BO Web Intelligence Basics
SAP BO Web Intelligence Basics
Kiran Joy
 
MS Access Training
MS Access Training
Michael Sheyahshe
 
Database notes ICTL
Database notes ICTL
Zarina Suleiman
 
Ms access 2010
Ms access 2010
Alsufaacademy
 
Microsoft Dynamics AX 2012 - Development Introduction Training - Part 3/3
Microsoft Dynamics AX 2012 - Development Introduction Training - Part 3/3
Fabio Filardi
 
Module 1 Database for ICTL Form 2
Module 1 Database for ICTL Form 2
fazihafid
 
Access presentation
Access presentation
DUSPviz
 
Ms access ppt 2017 by Gopal saha
Ms access ppt 2017 by Gopal saha
253253
 
Microsoft access 2007
Microsoft access 2007
JaiRane007
 
Access Basics 01
Access Basics 01
Lets try
 
Intro to Microsoft Access
Intro to Microsoft Access
DUSPviz
 
Class viii ch-2 log on to access
Class viii ch-2 log on to access
jessandy
 
Access 2010 Unit A PPT
Access 2010 Unit A PPT
okmomwalking
 
Dynamics AX 2009 Data Dictionary - Güven Şahin - 04.05.2013
Dynamics AX 2009 Data Dictionary - Güven Şahin - 04.05.2013
guvensahin
 
B.sc i agri u 4 introduction to ms access
B.sc i agri u 4 introduction to ms access
Rai University
 
Introduction to microsoft access
Introduction to microsoft access
Hardik Patel
 
SAP BO Web Intelligence Basics
SAP BO Web Intelligence Basics
Kiran Joy
 
Microsoft Dynamics AX 2012 - Development Introduction Training - Part 3/3
Microsoft Dynamics AX 2012 - Development Introduction Training - Part 3/3
Fabio Filardi
 
Module 1 Database for ICTL Form 2
Module 1 Database for ICTL Form 2
fazihafid
 

Viewers also liked (20)

Data Mining: Outlier analysis
Data Mining: Outlier analysis
DataminingTools Inc
 
Apt 502 distance tech handouts study guides and visuals
Apt 502 distance tech handouts study guides and visuals
Betty Howard
 
Edit distance problem
Edit distance problem
Maher Alshammari
 
Chapter 12 outlier
Chapter 12 outlier
Houw Liong The
 
Outliers
Outliers
Chidirala Anil Shankar
 
Outlier and fraud detection using Hadoop
Outlier and fraud detection using Hadoop
Pranab Ghosh
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Salah Amean
 
建築師法修正草案總說明
建築師法修正草案總說明
Filip Yang
 
Info Chimps: What Makes Infochimps.org Unique
Info Chimps: What Makes Infochimps.org Unique
DataminingTools Inc
 
RapidMiner: Setting Up A Process
RapidMiner: Setting Up A Process
DataminingTools Inc
 
LISP:Object System Lisp
LISP:Object System Lisp
DataminingTools Inc
 
XL-Miner: Timeseries
XL-Miner: Timeseries
DataminingTools Inc
 
SPSS: Data Editor
SPSS: Data Editor
DataminingTools Inc
 
Data Applied:Forecast
Data Applied:Forecast
DataminingTools Inc
 
Kidical Mass Presentation
Kidical Mass Presentation
Eugene SRTS
 
LISP: Declarations In Lisp
LISP: Declarations In Lisp
DataminingTools Inc
 
Retrieving Data From A Database
Retrieving Data From A Database
DataminingTools Inc
 
Association Rules
Association Rules
DataminingTools Inc
 
Bernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial Distribution
DataminingTools Inc
 
Ontwikkeling In Eigen Handen Nl Web
Ontwikkeling In Eigen Handen Nl Web
Infirmiers de rue ASBL
 
Apt 502 distance tech handouts study guides and visuals
Apt 502 distance tech handouts study guides and visuals
Betty Howard
 
Outlier and fraud detection using Hadoop
Outlier and fraud detection using Hadoop
Pranab Ghosh
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Salah Amean
 
建築師法修正草案總說明
建築師法修正草案總說明
Filip Yang
 
Info Chimps: What Makes Infochimps.org Unique
Info Chimps: What Makes Infochimps.org Unique
DataminingTools Inc
 
RapidMiner: Setting Up A Process
RapidMiner: Setting Up A Process
DataminingTools Inc
 
Kidical Mass Presentation
Kidical Mass Presentation
Eugene SRTS
 
Bernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial Distribution
DataminingTools Inc
 
Ad

Similar to MS SQL SERVER: Data mining concepts and dmx (20)

Data Mining With SQL Server
Data Mining With SQL Server
Hoan Phuc
 
Chapter 1. Introduction
Chapter 1. Introduction
butest
 
Unit 3 part i Data mining
Unit 3 part i Data mining
Dhilsath Fathima
 
MS SQL SERVER: Using the data mining tools
MS SQL SERVER: Using the data mining tools
sqlserver content
 
Minería de Datos en Sql Server 2008
Minería de Datos en Sql Server 2008
Eduardo Castro
 
BAS 250 Lecture 1
BAS 250 Lecture 1
Wake Tech BAS
 
Knowledge discovery thru data mining
Knowledge discovery thru data mining
Devakumar Jain
 
Dwdmunit1 a
Dwdmunit1 a
bhagathk
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
Premkumar R
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
PalaniKumarR2
 
hanjia chapter_1.ppt data mining chapter 1
hanjia chapter_1.ppt data mining chapter 1
Ping261512
 
Overview of Data Mining
Overview of Data Mining
ijtsrd
 
Introduction of Data Mining - Concept and techniques
Introduction of Data Mining - Concept and techniques
SUMANTWACHASUNDAR1
 
01Intro.ppt data analytics r language slide 1
01Intro.ppt data analytics r language slide 1
MuhammadjunaidgulMuh1
 
Introduction to data warehouse
Introduction to data warehouse
Cognizant Technology Solutions
 
2 introductory slides
2 introductory slides
tafosepsdfasg
 
Introduction to Data Mining
Introduction to Data Mining
Sushil Kulkarni
 
Data Mining Presentation.pptx
Data Mining Presentation.pptx
ChingChingErm
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
PadmajaLaksh
 
Data Mining- Unit-I PPT (1).ppt
Data Mining- Unit-I PPT (1).ppt
AravindReddy565690
 
Data Mining With SQL Server
Data Mining With SQL Server
Hoan Phuc
 
Chapter 1. Introduction
Chapter 1. Introduction
butest
 
MS SQL SERVER: Using the data mining tools
MS SQL SERVER: Using the data mining tools
sqlserver content
 
Minería de Datos en Sql Server 2008
Minería de Datos en Sql Server 2008
Eduardo Castro
 
Knowledge discovery thru data mining
Knowledge discovery thru data mining
Devakumar Jain
 
Dwdmunit1 a
Dwdmunit1 a
bhagathk
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
Premkumar R
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
PalaniKumarR2
 
hanjia chapter_1.ppt data mining chapter 1
hanjia chapter_1.ppt data mining chapter 1
Ping261512
 
Overview of Data Mining
Overview of Data Mining
ijtsrd
 
Introduction of Data Mining - Concept and techniques
Introduction of Data Mining - Concept and techniques
SUMANTWACHASUNDAR1
 
01Intro.ppt data analytics r language slide 1
01Intro.ppt data analytics r language slide 1
MuhammadjunaidgulMuh1
 
2 introductory slides
2 introductory slides
tafosepsdfasg
 
Introduction to Data Mining
Introduction to Data Mining
Sushil Kulkarni
 
Data Mining Presentation.pptx
Data Mining Presentation.pptx
ChingChingErm
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
PadmajaLaksh
 
Data Mining- Unit-I PPT (1).ppt
Data Mining- Unit-I PPT (1).ppt
AravindReddy565690
 
Ad

More from DataminingTools Inc (20)

Terminology Machine Learning
Terminology Machine Learning
DataminingTools Inc
 
Techniques Machine Learning
Techniques Machine Learning
DataminingTools Inc
 
Machine learning Introduction
Machine learning Introduction
DataminingTools Inc
 
Areas of machine leanring
Areas of machine leanring
DataminingTools Inc
 
AI: Planning and AI
AI: Planning and AI
DataminingTools Inc
 
AI: Logic in AI 2
AI: Logic in AI 2
DataminingTools Inc
 
AI: Logic in AI
AI: Logic in AI
DataminingTools Inc
 
AI: Learning in AI 2
AI: Learning in AI 2
DataminingTools Inc
 
AI: Learning in AI
AI: Learning in AI
DataminingTools Inc
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
DataminingTools Inc
 
AI: Belief Networks
AI: Belief Networks
DataminingTools Inc
 
AI: AI & Searching
AI: AI & Searching
DataminingTools Inc
 
AI: AI & Problem Solving
AI: AI & Problem Solving
DataminingTools Inc
 
Data Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 
Data Mining: Data processing
Data Mining: Data processing
DataminingTools Inc
 
Data Mining: clustering and analysis
Data Mining: clustering and analysis
DataminingTools Inc
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
DataminingTools Inc
 
Data Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 
Data Mining: clustering and analysis
Data Mining: clustering and analysis
DataminingTools Inc
 

Recently uploaded (20)

GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
 
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
 
MuleSoft for AgentForce : Topic Center and API Catalog
MuleSoft for AgentForce : Topic Center and API Catalog
shyamraj55
 
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Alliance
 
Securing AI - There Is No Try, Only Do!.pdf
Securing AI - There Is No Try, Only Do!.pdf
Priyanka Aash
 
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance
 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
 
cnc-processing-centers-centateq-p-110-en.pdf
cnc-processing-centers-centateq-p-110-en.pdf
AmirStern2
 
The Future of AI Agent Development Trends to Watch.pptx
The Future of AI Agent Development Trends to Watch.pptx
Lisa ward
 
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
 
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
 
You are not excused! How to avoid security blind spots on the way to production
You are not excused! How to avoid security blind spots on the way to production
Michele Leroux Bustamante
 
Creating Inclusive Digital Learning with AI: A Smarter, Fairer Future
Creating Inclusive Digital Learning with AI: A Smarter, Fairer Future
Impelsys Inc.
 
OWASP Barcelona 2025 Threat Model Library
OWASP Barcelona 2025 Threat Model Library
PetraVukmirovic
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
Securing Account Lifecycles in the Age of Deepfakes.pptx
Securing Account Lifecycles in the Age of Deepfakes.pptx
FIDO Alliance
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
 
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC
 
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Alliance
 
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
 
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
 
MuleSoft for AgentForce : Topic Center and API Catalog
MuleSoft for AgentForce : Topic Center and API Catalog
shyamraj55
 
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Seminar: Authentication for a Billion Consumers - Amazon.pptx
FIDO Alliance
 
Securing AI - There Is No Try, Only Do!.pdf
Securing AI - There Is No Try, Only Do!.pdf
Priyanka Aash
 
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance
 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
 
cnc-processing-centers-centateq-p-110-en.pdf
cnc-processing-centers-centateq-p-110-en.pdf
AmirStern2
 
The Future of AI Agent Development Trends to Watch.pptx
The Future of AI Agent Development Trends to Watch.pptx
Lisa ward
 
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
 
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
 
You are not excused! How to avoid security blind spots on the way to production
You are not excused! How to avoid security blind spots on the way to production
Michele Leroux Bustamante
 
Creating Inclusive Digital Learning with AI: A Smarter, Fairer Future
Creating Inclusive Digital Learning with AI: A Smarter, Fairer Future
Impelsys Inc.
 
OWASP Barcelona 2025 Threat Model Library
OWASP Barcelona 2025 Threat Model Library
PetraVukmirovic
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
Securing Account Lifecycles in the Age of Deepfakes.pptx
Securing Account Lifecycles in the Age of Deepfakes.pptx
FIDO Alliance
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
 
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC
 
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Alliance
 

MS SQL SERVER: Data mining concepts and dmx

  • 2. overviewHistory of DMXDMX IntroductionDMX objectsQuery SyntaxPrediction
  • 3. History of DMXDMX was first introduced in the OLE DB for Data Mining specification authored by Microsoft in conjunction with other vendors in 1999.The goal of DMX is to define common concepts and common query expressions for the data mining world.It is similar to what SQL has done for databases.
  • 4. overview of DMXData Mining Extensions (DMX) is a query language for Data Mining Models. It consists of: DDL (Data definition Language)The DDL of the DMX is used to create new data mining models and structures, export and import mining structures, copy or transfer data from one mining model to another and delete existing data mining models and mining structures. DML (Data Manipulation Language)The DML of the DMX is used to search and browse data in the data mining models, update the data mining models by insertion and updating of the data and derive predictions using the prediction query.
  • 5. DMX objectsData Mining Extensions (DMX) is a language that you can use to create and work with data mining models in Microsoft SQL Server Analysis Services.DMX is used to create the structure of new data mining models, to train these models, and to browse, manage, and predict against them.There are two major objects that are used to manifest this transformation:The mining structure
  • 6. The mining modelThe mining structureA mining structure is defined as a list of columns, with their data types and information describing how they should be handled.When a mining structure is processed, it contains a compressed cache or copy of the source data. This cache is used to train any models that are subsequently added to the structure. which can be queried to return its data or the distinct states that exist in any structure column. The cache is only maintained temporarily, and can be dropped at any time.
  • 7. The mining modelA mining model is the object that transforms rows of data into cases and performs the machine learning using a specified data mining algorithm. A mining model is described as a subset of columns from the structure, how those columns are to be used as attributes along with the algorithm and parameters to perform machine learning on the structure data.Statistics about predictions are available as well, Additionally the learned patterns themselves can be queried to discover what the algorithm found. These patterns are generally referred to as the model content.
  • 8. Query SyntaxDMX statements are used to create, process, delete, copy, browse, and predict against data mining models.The three basic steps for data mining process are:Creation
  • 10. TrainingCREATING MINING STRUCTURES:Creating mining structures is similar to creating tables in SQL.Syntax:CREATE [SESSION] MINING STRUCTURE <structure> ( [(<column definition list>)] )StructureA unique name for the structure.column definitionlistA comma-separated list of column definitions
  • 11. CREATING MINING STRUCTURES:The following example creates a new mining structure called New Mailing.CREATE MINING STRUCTURE [New Mailing] ( CustomerKey LONG KEY, Gender TEXT DISCRETE, [Number Cars Owned] LONG DISCRETE, [Bike Buyer] LONG DISCRETE )
  • 12. ALTERING MINING STRUCTURES:Creates a new mining model that is based on an existing mining structure. When you use the alter structure statement to create a new mining model, the structure must already exist. Syntax:ALTER MINING STRUCTURE <structure> ADD MINING MODEL <model> ( <column definition list> [(<nested column definition list>) [WITH FILTER (<nested filter criteria>)]] ) USING <algorithm> [(<parameter list>)] FILTER keyword is used to filter condition.
  • 13. ALTERING MINING STRUCTURESThe following example adds a Naive Bayes mining model to the New Mailing mining structure and limits the maximum number of attribute states to 50.ALTER MINING STRUCTURE [New Mailing] ADD MINING MODEL [Naive Bayes] ( CustomerKey, Gender, [Number Cars Owned], [Bike Buyer] PREDICT ) USING Microsoft_Naive_Bayes (MAXIMUM_STATES = 50)
  • 14. Data Types and Content typesThe following table shows the list of data types and content types for mining structure columns:Time Series models.Sequence Clustering models in nested tables.
  • 15. DROP MINING MODEL Deletes a mining model from the database.Syntax:DROP MINING MODEL <model >ModelA model identifier.Ex: The following sample code drops the mining model NBSample.DROP MINING MODEL [NBSample]
  • 16. NESTED TABLESEx: Consider the following case derived from two tables, one table that contains customer information and another table that contains customer purchases. A single customer in the customer table may have multiple purchases in the purchases table, which makes it difficult to describe the data using a single row. Analysis Services provides a unique method for handling these cases, by using nested tables. The concept of a nested table is demonstrated in the following illustration.
  • 17. The first table is the parent table has information about customers, and associates a unique identifier for each customer. The second table, the child table, contains purchases for each customer. The purchases in the child table are related back to the parent table by the unique identifier, the CustomerKey column. The third table in the diagram shows the two tables combined.
  • 18. PredictionPredictionmeansapplying the patterns that were found in the data to estimate unknown information. Examples: of prediction might be predicting if a customer will or will not be good for a loan, estimating a credit score, determining to what cluster a case belongs, or predicting future values of a time series.
  • 19. Prediction JoinUsing prediction join in this example we can come to conclusion that:‘‘if the kid is male and class is 5, then the highest scored subject is science.’’
  • 20. Prediction Join syntaxSELECT [TOP <count>] <column references> FROM <mining model> [[NATURAL] PREDICTION JOIN <source-data> [ ON <mapping clause> ] [ WHERE <condition clause> ] [ ORDER BY <order clause> [DESC | ASC] ]]Count  Optional, An integer that specifies how many rows to return.column referencesA comma-separated list of column identifiers an expressions that are derived from the mining model.mining modelA model identifier.source -dataThe source query.mapping clauseOptional, A logical expression that compares columns from the model to columns from the source query.condition clause Optional, A condition to restrict the values that are returned from the column list.order clause Optional, An expression that returns a scalar value.
  • 21. summaryHistory of DMXDMX IntroductionDMX objectsQuery SyntaxPrediction join syntax
  • 22. Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net