SlideShare a Scribd company logo
3
Most read
4
Most read
5
Most read
OMega TechEd
9
Part-I
BUSINESS INTELLIGENCE
CLASSIFICATION ALGORITHMS
Mrs. Megha Sharma
M.Sc. Computer Science. B.Ed.
K-Nearest Neighbours
Decision trees
Naive Baye’s Classifier
Logistic Regression
Artificial Neural Network
Support Vector Machines
1. K-Nearest Neighbour
 K-Nearest algorithm classifies data sets based on their similarity with neighbours. It
works in a very simple way by taking into account the distance from known data point.
After we gather the ‘k’ neighbours , we simply take the majority and classify the
unknown data into that category.
Class A:
Class B:
Unknown data :
 When k=3
 Data point belongs to class B
2. Decision Trees
A decision tree represents classification. Decision trees predict the future based on the
previous learning and input data sets. It taken multiple input values and returns back the
probable output with the single value which is considered as a decision.
 A decision tree starts with a single node, which branches into possible outcomes. Each of
those outcomes lead to additional nodes, which branch off into other possibilities.
Algorithm:
1. Begin the tree with the root node
2. Split into subsets.
3. If subset is pure (all yes or all no) stop the process. Else go to step 1 and repeat the process
until we get pure subset.
E.g. We want to predict whether Amit will play cricket or not?
Weather : Rainy, Humidity : high, Wind : Strong.
Day Weather Humidity wind Play
1 Sunny high weak yes
2 Sunny high strong yes
3 Cloudy high weak yes
4 Cloudy high weak Yes
5 Rainy high weak Yes
6 Rainy normal weak Yes
7 Cloudy normal strong No
8 Cloudy normal weak No
9 Sunny normal normal Yes
10 rainy normal strong No
11 Cloudy high strong yes
Weather
Rainy
Cloudy Sunny
Day humidity wind play
3 high weak yes
4 high weak Yes
7 normal strong No
8 normal weak No
11 high strong yes
Day humidity wind play
1 high weak yes
2 high strong yes
9 normal normal Yes
Day humidity wind play
5 high weak Yes
6 normal weak Yes
10 normal strong No
All yes no need to divide further
Weather
Rainy
Cloudy Sunny
High
Humidity
Normal Strong
Weak
Wind
YES NO YES NO
Weather : Rainy,
Humidity : High,
Wind : Strong.
NO
Concept of Probability
Probability is a measure of the likelihood of an event to occur.
P(A) = Number of favorable outcomes to A
Total number of outcomes
Number of red balls= 6
Number of Blue balls= 4
Total no of balls= 10
P(R): The probability of selecting a red ball is = 6/10
P(B): The probability of selecting a blue ball is = 4/10
Box-A
Box-B
Conditional Probability
Box A 1 Red 4 Blue Total 5
Box B 3 Red 2 Blue Total 5
Total 4 Red 6 Blue 10
P(R) = 4/10 P (B)= 6/10
Probability of getting red ball if ball is drawn from Box A.
P(R|A) = 1/5
Similarly Probability of getting red ball if ball is drawn
compulsorily from box B.
P(R|B)= 3/5
Bayesian Methods
Thomas Bayes, a statistician had given a probabilistic theory known as Baye’s theorem
for probability , which describe the probability of an event or occurrence, based on
prior knowledge of events and expected outcome.
Baye’s Formula:
P(A|B) = P (B|A) P(A)
P(B)
• Where P(A) and P(B) , are prior probabilities or observed probabilities
independent of each other.
• P(A|B), is conditional probability (posterior probability), the likelihood of
event A occurring given that B is True.
• P(B|A), is conditional probability (posterior probability), the likelihood of
event B occurring given that A is True.
Example:
Finding out a patient probability of having cancer disease. Let say “smoking test” is the test
for diagnosing disease.
10% of Patients have cancer : P(A) = 0.1.
5% of Patient is a smoker: P(B) = 0.05.
Among those patients diagnosed with cancer , 7% are smokers i.e. P(B|A) .
Using Bayes theorem we can find the probability of having cancer if the patient is smoker.
P(A|B) = (0.07* 0.1)/ 0.05 = 0.14
If the patient is a smoker, their chances of having cancer is 14%
3. Naïve Baye’s Classifier
Naïve Baye’s classifier technique is particularly suited when the dimensionality of the
input is high. Naive Bayes classifiers are the family of simple “probabilistic classifiers”
based on applying Bayes’ theorem with strong (naïve) independence assumptions
between the features.
Total 60 objects. 40 Green and 20 Red.
Prior probability of Green=40/60
Prior probability of Red= 20/60
Continue…
Probability of X given Green = 1/40
Probability of X given Red=3/20
Posterior probability of X being Green : prior probability of Green * Likelihood of X given Green
4/6 x 1/40 =1/60
Posterior probability of X being Red : prior probability of Red * Likelihood of X given Red
2/6 x 3/20 = 1/20
Finally we classify X as ‘Red’ since its class membership
achieves the largest posterior probability.
Thanks For Watching.
Next Topic : Classification Algorithms Part-II.
About the Channel
This channel helps you to prepare for BSc IT and BSc computer science subjects.
In this channel we will learn Business Intelligence , A.I., Digital Electronics,
Internet OF Things Python programming , Data-Structure etc.
Which is useful for upcoming university exams.
Gmail: omega.teched@gmail.com
Social Media Handles:
omega.teched
megha_with
OMega TechEd

More Related Content

What's hot (20)

PPTX
Supervised and unsupervised learning
Paras Kohli
 
PDF
Introduction to Machine Learning Classifiers
Functional Imperative
 
PDF
Classification and Clustering
Eng Teong Cheah
 
PPTX
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Simplilearn
 
PPT
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
Salah Amean
 
PPTX
Classification Algorithm-II
Megha Sharma
 
PPTX
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
Simplilearn
 
PDF
Naive Bayes
CloudxLab
 
PPTX
Data Mining: clustering and analysis
DataminingTools Inc
 
PDF
Decision tree
R A Akerkar
 
PPTX
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Simplilearn
 
PPTX
Kmeans
Nikita Goyal
 
PPTX
Machine Learning
Girish Khanzode
 
PPT
Machine Learning
Rahul Kumar
 
PDF
Introduction to Statistical Machine Learning
mahutte
 
PDF
Logistic regression in Machine Learning
Kuppusamy P
 
PPTX
CART – Classification & Regression Trees
Hemant Chetwani
 
PPTX
Data mining: Classification and prediction
DataminingTools Inc
 
PPTX
Naïve Bayes Classifier Algorithm.pptx
Shubham Jaybhaye
 
Supervised and unsupervised learning
Paras Kohli
 
Introduction to Machine Learning Classifiers
Functional Imperative
 
Classification and Clustering
Eng Teong Cheah
 
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Simplilearn
 
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
Salah Amean
 
Classification Algorithm-II
Megha Sharma
 
Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...
Simplilearn
 
Naive Bayes
CloudxLab
 
Data Mining: clustering and analysis
DataminingTools Inc
 
Decision tree
R A Akerkar
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Simplilearn
 
Kmeans
Nikita Goyal
 
Machine Learning
Girish Khanzode
 
Machine Learning
Rahul Kumar
 
Introduction to Statistical Machine Learning
mahutte
 
Logistic regression in Machine Learning
Kuppusamy P
 
CART – Classification & Regression Trees
Hemant Chetwani
 
Data mining: Classification and prediction
DataminingTools Inc
 
Naïve Bayes Classifier Algorithm.pptx
Shubham Jaybhaye
 

Similar to Classification Algorithm. (20)

PPTX
Complements and Conditional Probability, and Bayes' Theorem
Long Beach City College
 
PPTX
2.statistical DEcision makig.pptx
ImpanaR2
 
DOCX
1 Probability Please read sections 3.1 – 3.3 in your .docx
aryan532920
 
PDF
Probability concepts for Data Analytics
SSaudia
 
PPTX
Introduction to Probability - NZ Level 2 2.12
Kate FLR
 
PPT
Lecture3
MAYANKSINGLA4395
 
PPTX
Chapter three and - Economics-introduction
ismailadan4
 
PPTX
Basic concepts of probability
Long Beach City College
 
PPTX
probability_statistics_presentation.pptx
vietnam5hayday
 
PDF
Statistics 1 (FPN) QP
Success Formula
 
PDF
Probability for engineering student module (2).pdf
WatrSat
 
PPTX
conditional probability from probability.pptx
22951a04p3
 
PPTX
Mncs 16-09-4주-변승규-introduction to the machine learning
Seung-gyu Byeon
 
PPTX
Data Science-entropy machine learning.pptx
ZainabShahzad9
 
PDF
Module 4 bayes classification
SatishH5
 
PPTX
Day 3.pptx
estelaabera
 
PPTX
Mathematics Assignment Help
Statistics Assignment Help
 
PPTX
Probability unit2.pptx
SNIGDHABADIDA2127755
 
PPTX
statistics assignment help
Statistics Homework Helper
 
PPT
9-Decision Tree Induction-23-01-2025.ppt
DarrinBright1
 
Complements and Conditional Probability, and Bayes' Theorem
Long Beach City College
 
2.statistical DEcision makig.pptx
ImpanaR2
 
1 Probability Please read sections 3.1 – 3.3 in your .docx
aryan532920
 
Probability concepts for Data Analytics
SSaudia
 
Introduction to Probability - NZ Level 2 2.12
Kate FLR
 
Chapter three and - Economics-introduction
ismailadan4
 
Basic concepts of probability
Long Beach City College
 
probability_statistics_presentation.pptx
vietnam5hayday
 
Statistics 1 (FPN) QP
Success Formula
 
Probability for engineering student module (2).pdf
WatrSat
 
conditional probability from probability.pptx
22951a04p3
 
Mncs 16-09-4주-변승규-introduction to the machine learning
Seung-gyu Byeon
 
Data Science-entropy machine learning.pptx
ZainabShahzad9
 
Module 4 bayes classification
SatishH5
 
Day 3.pptx
estelaabera
 
Mathematics Assignment Help
Statistics Assignment Help
 
Probability unit2.pptx
SNIGDHABADIDA2127755
 
statistics assignment help
Statistics Homework Helper
 
9-Decision Tree Induction-23-01-2025.ppt
DarrinBright1
 
Ad

More from Megha Sharma (20)

PPTX
Designing Printed Circuit boards, Software Choices, The Design Process
Megha Sharma
 
PPTX
Manufacturing PCB, Etching board, milling board, Third party manufacturing, a...
Megha Sharma
 
PPTX
Business Model, make thing, sell thing, subscription, customization, Key Reso...
Megha Sharma
 
PPTX
Funding an IOT startup, Venture Capital, Government funding, Crowdfunding, Le...
Megha Sharma
 
PPTX
Sketch, Iterate and Explore, Nondigital Methods.
Megha Sharma
 
PPTX
CNC Milling, Software, Repurposing and Recycling.
Megha Sharma
 
PPTX
3D printing, Types of 3D printing: FDM, Laser Sintering, Powder bed, LOM, DLP.
Megha Sharma
 
PPTX
Laser Cutting, Choosing a laser cutter, Software, Hinges and joints.
Megha Sharma
 
PPTX
Memory management, Types of memory, Making the most of your RAM.
Megha Sharma
 
PPTX
Performance and Battery Life, Libraries, Debugging.
Megha Sharma
 
PPTX
Prototyping Embedded Devices: Arduino, Developing on the Arduino.
Megha Sharma
 
PPTX
Raspberry-Pi, Developing on Raspberry Pi, Difference between Arduino & Raspbe...
Megha Sharma
 
PPTX
Open Source versus Closed Source in IOT in IOT
Megha Sharma
 
PPTX
Why closed? Why Open? Mixing open and closed source
Megha Sharma
 
PPTX
Model Performance Metrics. Accuracy, Precision, Recall
Megha Sharma
 
PPTX
Graceful Degradation and Affordance in IOT
Megha Sharma
 
PPTX
Web thinking connected device, Small Pieces Loosely joined.
Megha Sharma
 
PPTX
Production & Mass Personalization, Changing Embedded Platform, Physical proto...
Megha Sharma
 
PPTX
Whose data is it anyways? Public vs Private data collection.
Megha Sharma
 
PPTX
Thinking about Prototyping: Sketching, Familiarity, Cost versus Ease of proto...
Megha Sharma
 
Designing Printed Circuit boards, Software Choices, The Design Process
Megha Sharma
 
Manufacturing PCB, Etching board, milling board, Third party manufacturing, a...
Megha Sharma
 
Business Model, make thing, sell thing, subscription, customization, Key Reso...
Megha Sharma
 
Funding an IOT startup, Venture Capital, Government funding, Crowdfunding, Le...
Megha Sharma
 
Sketch, Iterate and Explore, Nondigital Methods.
Megha Sharma
 
CNC Milling, Software, Repurposing and Recycling.
Megha Sharma
 
3D printing, Types of 3D printing: FDM, Laser Sintering, Powder bed, LOM, DLP.
Megha Sharma
 
Laser Cutting, Choosing a laser cutter, Software, Hinges and joints.
Megha Sharma
 
Memory management, Types of memory, Making the most of your RAM.
Megha Sharma
 
Performance and Battery Life, Libraries, Debugging.
Megha Sharma
 
Prototyping Embedded Devices: Arduino, Developing on the Arduino.
Megha Sharma
 
Raspberry-Pi, Developing on Raspberry Pi, Difference between Arduino & Raspbe...
Megha Sharma
 
Open Source versus Closed Source in IOT in IOT
Megha Sharma
 
Why closed? Why Open? Mixing open and closed source
Megha Sharma
 
Model Performance Metrics. Accuracy, Precision, Recall
Megha Sharma
 
Graceful Degradation and Affordance in IOT
Megha Sharma
 
Web thinking connected device, Small Pieces Loosely joined.
Megha Sharma
 
Production & Mass Personalization, Changing Embedded Platform, Physical proto...
Megha Sharma
 
Whose data is it anyways? Public vs Private data collection.
Megha Sharma
 
Thinking about Prototyping: Sketching, Familiarity, Cost versus Ease of proto...
Megha Sharma
 
Ad

Recently uploaded (20)

PPTX
How to Setup Automatic Reordering Rule in Odoo 18 Inventory
Celine George
 
PPTX
How to Configure Taxes in Company Currency in Odoo 18 Accounting
Celine George
 
PPTX
Comparing Translational and Rotational Motion.pptx
AngeliqueTolentinoDe
 
PDF
The Power of Compound Interest (Stanford Initiative for Financial Decision-Ma...
Stanford IFDM
 
PDF
Learning Styles Inventory for Senior High School Students
Thelma Villaflores
 
PPTX
How to Configure Refusal of Applicants in Odoo 18 Recruitment
Celine George
 
PDF
Public Health For The 21st Century 1st Edition Judy Orme Jane Powell
trjnesjnqg7801
 
PDF
COM and NET Component Services 1st Edition Juval Löwy
kboqcyuw976
 
PDF
Gladiolous Cultivation practices by AKL.pdf
kushallamichhame
 
PDF
Lesson 1 - Nature of Inquiry and Research.pdf
marvinnbustamante1
 
PPTX
PLANNING A HOSPITAL AND NURSING UNIT.pptx
PRADEEP ABOTHU
 
PDF
Indian National movement PPT by Simanchala Sarab, Covering The INC(Formation,...
Simanchala Sarab, BABed(ITEP Secondary stage) in History student at GNDU Amritsar
 
PDF
Free eBook ~100 Common English Proverbs (ebook) pdf.pdf
OH TEIK BIN
 
PPTX
Natural Language processing using nltk.pptx
Ramakrishna Reddy Bijjam
 
PDF
Rapid Mathematics Assessment Score sheet for all Grade levels
DessaCletSantos
 
PDF
DIGESTION OF CARBOHYDRATES ,PROTEINS AND LIPIDS
raviralanaresh2
 
PDF
TechSoup Microsoft Copilot Nonprofit Use Cases and Live Demo - 2025.06.25.pdf
TechSoup
 
PDF
Nanotechnology and Functional Foods Effective Delivery of Bioactive Ingredien...
rmswlwcxai8321
 
PDF
Genomics Proteomics and Vaccines 1st Edition Guido Grandi (Editor)
kboqcyuw976
 
PPTX
Elo the Hero is an story about a young boy who became hero.
TeacherEmily1
 
How to Setup Automatic Reordering Rule in Odoo 18 Inventory
Celine George
 
How to Configure Taxes in Company Currency in Odoo 18 Accounting
Celine George
 
Comparing Translational and Rotational Motion.pptx
AngeliqueTolentinoDe
 
The Power of Compound Interest (Stanford Initiative for Financial Decision-Ma...
Stanford IFDM
 
Learning Styles Inventory for Senior High School Students
Thelma Villaflores
 
How to Configure Refusal of Applicants in Odoo 18 Recruitment
Celine George
 
Public Health For The 21st Century 1st Edition Judy Orme Jane Powell
trjnesjnqg7801
 
COM and NET Component Services 1st Edition Juval Löwy
kboqcyuw976
 
Gladiolous Cultivation practices by AKL.pdf
kushallamichhame
 
Lesson 1 - Nature of Inquiry and Research.pdf
marvinnbustamante1
 
PLANNING A HOSPITAL AND NURSING UNIT.pptx
PRADEEP ABOTHU
 
Indian National movement PPT by Simanchala Sarab, Covering The INC(Formation,...
Simanchala Sarab, BABed(ITEP Secondary stage) in History student at GNDU Amritsar
 
Free eBook ~100 Common English Proverbs (ebook) pdf.pdf
OH TEIK BIN
 
Natural Language processing using nltk.pptx
Ramakrishna Reddy Bijjam
 
Rapid Mathematics Assessment Score sheet for all Grade levels
DessaCletSantos
 
DIGESTION OF CARBOHYDRATES ,PROTEINS AND LIPIDS
raviralanaresh2
 
TechSoup Microsoft Copilot Nonprofit Use Cases and Live Demo - 2025.06.25.pdf
TechSoup
 
Nanotechnology and Functional Foods Effective Delivery of Bioactive Ingredien...
rmswlwcxai8321
 
Genomics Proteomics and Vaccines 1st Edition Guido Grandi (Editor)
kboqcyuw976
 
Elo the Hero is an story about a young boy who became hero.
TeacherEmily1
 

Classification Algorithm.

  • 2. BUSINESS INTELLIGENCE CLASSIFICATION ALGORITHMS Mrs. Megha Sharma M.Sc. Computer Science. B.Ed.
  • 3. K-Nearest Neighbours Decision trees Naive Baye’s Classifier Logistic Regression Artificial Neural Network Support Vector Machines
  • 4. 1. K-Nearest Neighbour  K-Nearest algorithm classifies data sets based on their similarity with neighbours. It works in a very simple way by taking into account the distance from known data point. After we gather the ‘k’ neighbours , we simply take the majority and classify the unknown data into that category. Class A: Class B: Unknown data :  When k=3  Data point belongs to class B
  • 5. 2. Decision Trees A decision tree represents classification. Decision trees predict the future based on the previous learning and input data sets. It taken multiple input values and returns back the probable output with the single value which is considered as a decision.  A decision tree starts with a single node, which branches into possible outcomes. Each of those outcomes lead to additional nodes, which branch off into other possibilities. Algorithm: 1. Begin the tree with the root node 2. Split into subsets. 3. If subset is pure (all yes or all no) stop the process. Else go to step 1 and repeat the process until we get pure subset.
  • 6. E.g. We want to predict whether Amit will play cricket or not? Weather : Rainy, Humidity : high, Wind : Strong. Day Weather Humidity wind Play 1 Sunny high weak yes 2 Sunny high strong yes 3 Cloudy high weak yes 4 Cloudy high weak Yes 5 Rainy high weak Yes 6 Rainy normal weak Yes 7 Cloudy normal strong No 8 Cloudy normal weak No 9 Sunny normal normal Yes 10 rainy normal strong No 11 Cloudy high strong yes
  • 7. Weather Rainy Cloudy Sunny Day humidity wind play 3 high weak yes 4 high weak Yes 7 normal strong No 8 normal weak No 11 high strong yes Day humidity wind play 1 high weak yes 2 high strong yes 9 normal normal Yes Day humidity wind play 5 high weak Yes 6 normal weak Yes 10 normal strong No All yes no need to divide further
  • 8. Weather Rainy Cloudy Sunny High Humidity Normal Strong Weak Wind YES NO YES NO Weather : Rainy, Humidity : High, Wind : Strong. NO
  • 9. Concept of Probability Probability is a measure of the likelihood of an event to occur. P(A) = Number of favorable outcomes to A Total number of outcomes Number of red balls= 6 Number of Blue balls= 4 Total no of balls= 10 P(R): The probability of selecting a red ball is = 6/10 P(B): The probability of selecting a blue ball is = 4/10
  • 10. Box-A Box-B Conditional Probability Box A 1 Red 4 Blue Total 5 Box B 3 Red 2 Blue Total 5 Total 4 Red 6 Blue 10 P(R) = 4/10 P (B)= 6/10 Probability of getting red ball if ball is drawn from Box A. P(R|A) = 1/5 Similarly Probability of getting red ball if ball is drawn compulsorily from box B. P(R|B)= 3/5
  • 11. Bayesian Methods Thomas Bayes, a statistician had given a probabilistic theory known as Baye’s theorem for probability , which describe the probability of an event or occurrence, based on prior knowledge of events and expected outcome. Baye’s Formula: P(A|B) = P (B|A) P(A) P(B) • Where P(A) and P(B) , are prior probabilities or observed probabilities independent of each other. • P(A|B), is conditional probability (posterior probability), the likelihood of event A occurring given that B is True. • P(B|A), is conditional probability (posterior probability), the likelihood of event B occurring given that A is True.
  • 12. Example: Finding out a patient probability of having cancer disease. Let say “smoking test” is the test for diagnosing disease. 10% of Patients have cancer : P(A) = 0.1. 5% of Patient is a smoker: P(B) = 0.05. Among those patients diagnosed with cancer , 7% are smokers i.e. P(B|A) . Using Bayes theorem we can find the probability of having cancer if the patient is smoker. P(A|B) = (0.07* 0.1)/ 0.05 = 0.14 If the patient is a smoker, their chances of having cancer is 14%
  • 13. 3. Naïve Baye’s Classifier Naïve Baye’s classifier technique is particularly suited when the dimensionality of the input is high. Naive Bayes classifiers are the family of simple “probabilistic classifiers” based on applying Bayes’ theorem with strong (naïve) independence assumptions between the features. Total 60 objects. 40 Green and 20 Red. Prior probability of Green=40/60 Prior probability of Red= 20/60
  • 14. Continue… Probability of X given Green = 1/40 Probability of X given Red=3/20 Posterior probability of X being Green : prior probability of Green * Likelihood of X given Green 4/6 x 1/40 =1/60 Posterior probability of X being Red : prior probability of Red * Likelihood of X given Red 2/6 x 3/20 = 1/20 Finally we classify X as ‘Red’ since its class membership achieves the largest posterior probability.
  • 15. Thanks For Watching. Next Topic : Classification Algorithms Part-II.
  • 16. About the Channel This channel helps you to prepare for BSc IT and BSc computer science subjects. In this channel we will learn Business Intelligence , A.I., Digital Electronics, Internet OF Things Python programming , Data-Structure etc. Which is useful for upcoming university exams. Gmail: [email protected] Social Media Handles: omega.teched megha_with OMega TechEd