SlideShare a Scribd company logo
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Decision Tree
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
What Will You Learn Today?
ClassificationMachine Learning Types Of Classifiers
What Is Decision Tree? How Decision Tree Works? Demo In R: Diabetes
Prevention Use Case
1 2 3
4 65
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Machine Learning
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Introduction To Machine Learning
Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without
being explicitly programmed.
Training Data Learn
Algorithm
Build Model Perform
Feedback
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Machine Learning - Example
 Amazon has huge amount of consumer purchasing
data.
 The data consists of consumer demographics (age,
sex, location), purchasing history, past browsing
history.
 Based on this data, Amazon segments its
customers, draws a pattern and recommends the
right product to the right customer at the right
time.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Classification
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Introduction To Classification
 Classification is the problem of
identifying to which set of categories a
new observation belongs.
 It is a supervised learning model as the
classifier already has a set of classified
examples and from these examples, the
classifier learns to assign unseen new
examples.
 Example: Assigning a given email
into "spam" or "non-spam" category.
Is this A or B ?
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Classification - Example
Feed the classifier with training data set and predefined labels.
It will learn to categorize particular data under a specific label.
How to train my
model to identify
spam mails from
genuine mails?
Source IP Address
Phrases in the text
Subject Line
HTML Tags
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Classification Use Cases
Banking
Remote sensing
Medicine
Banking
Identification of loan risk applicants by their
probability of defaulting payments.
Medicine
Identification of at-risk patients and disease trends.
Remote sensing
Identification of areas of similar land use in a GIS
database.
Marketing
Identifying customer churn.
Use-cases
Marketing
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Types Of Classifiers
Decision Tree
• Decision tree builds classification
models in the form of a tree
structure.
• It breaks down a dataset into
smaller and smaller subsets.
• Random Forest is an ensemble
classifier made using many
decision tree models.
• Ensemble models combine the
results from different models.
Random Forest Naïve Bayes
• It is a classification technique
based on Bayes' Theorem with an
assumption of independence
among attributes.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
What is Decision Tree?
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
What Is Decision Tree?
 A decision tree uses a tree structure to specify sequences of decisions and consequences.
 A decision tree employs a structure of nodes and branches.
 The depth of a node is the minimum number of steps required to reach the node from the root.
 Eventually, a final point is reached and a prediction is made.
Gender
AgeIncome
Yes No Yes No
Root Node
Internal Node
Branch NodeDepth=1
Female Male
<=40 >40
Leaf Node
<=45000 >45000
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Use Case - Credit Risk Detection
 To minimize loss, the bank needs a
decision rule to predict whom to give
approval of the loan.
 An applicant’s demographic (income,
debts, credit history) and socio-economic
profiles are considered.
 Data science can help banks recognize
behavior patterns and provide a
complete view of individual customers.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
Let’s take an example,
We have taken dataset consisting of:
• Weather information of last 14 days
• Whether match was played or not on that particular day
Now using the decision tree we need to predict whether the
game will happen if the weather condition is
Outlook = Rain
Humidity = High
Wind = Weak
Play = ?
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
From our data, we will choose one variable “Outlook” and will see how it affects the variable “Play”
Day Outlook Humidity Wind Play
D1 Sunny High Weak No
D2 Sunny High Strong No
D3 Overcast High Weak Yes
D4 Rain High Weak Yes
D5 Rain Normal Weak Yes
D6 Rain Normal Strong No
D7 Overcast Normal Strong Yes
D8 Sunny High Weak No
D9 Sunny Normal Weak Yes
D10 Rain Normal Weak Yes
D11 Sunny Normal Strong Yes
D12 Overcast High Strong Yes
D13 Overcast Normal Weak Yes
D14 Rain High Strong No
Outlook
Play: 9 Yes, 5 No
Sunny Overcast Rain
There are 3 types
of Outlook Here
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
We can further divide our data based on Outlook.
Outlook
Overcast
Sunny Rain
Day Outlook Humidity Wind
D1 Sunny High Weak
D2 Sunny High Strong
D8 Sunny High Weak
D9 Sunny Normal Weak
D11 Sunny Normal Strong
2 Yes / 3 No
Split further
Pure subset
Will play
3 Yes / 2 No
Split further
Day Outlook Humidity Wind
D4 Rain High Weak
D5 Rain Normal Weak
D6 Rain Normal Strong
D10 Rain Normal Weak
D14 Rain High Strong
We will split the data until we get pure subsets at every branch
9 Yes / 5 No
Day Outlook Humidity Wind
D3 Overcast High Weak
D7 Overcast Normal Strong
D12 Overcast High Strong
D13 Overcast Normal Weak
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
We will use Humidity column to split the subset “Sunny” further.
Will playWill not play
Outlook
Overcast
Sunny Rain
Humidity
NormalHigh
Day Humidity Wind
D1 High Weak
D2 High Strong
D8 High Weak
Pure subset
Day Humidity Wind
D9 Normal Weak
D11 Normal Strong
Pure subset 3 Yes / 2 No
Split further
Day Outlook Humidity Wind
D4 Rain High Weak
D5 Rain Normal Weak
D6 Rain Normal Strong
D10 Rain Normal Weak
D14 Rain High Strong
Pure subset
Will play
Day Outlook Humidity Wind
D3 Overcast High Weak
D7 Overcast Normal Strong
D12 Overcast High Strong
D13 Overcast Normal Weak
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
We will use Humidity column to split the subset “Sunny” further.
Will playWill not play
Outlook
Overcast
Sunny Rain
Humidity
NormalHigh Weak Strong
Wind
Will play Will not play
Day Humidity Wind
D1 High Weak
D2 High Strong
D8 High Weak
Pure subset
Day Humidity Wind
D9 Normal Weak
D11 Normal Strong
Pure subset
Day Humidity Wind
D4 High Weak
D5 Normal Weak
D10 Normal Weak
Pure subset
Day Humidity Wind
D6 Normal Strong
D14 High Strong
Pure subset
Pure subset
Will play
Day Outlook Humidity Wind
D3 Overcast High Weak
D7 Overcast Normal Strong
D12 Overcast High Strong
D13 Overcast Normal Weak
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How Decision Tree Works?
We will use Humidity column to split the subset “Sunny” further.
Will playWill not play
Outlook
Overcast
Sunny Rain
Humidity
NormalHigh Weak Strong
Wind
Will play Will not play
Will play
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How will I know which
attribute to take?
I’ll show you how
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Problem – Client Subscription
Consider the case of a bank that wants to market its products to the appropriate customers.
Given the demographics of clients and their reactions to previous campaign phone calls, the bank's goal is to predict
which clients would subscribe.
The attributes are:
• Job
• Marital status
• Education
• Housing
• Loan
• Contact
• Poutcome
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How To Choose An Attribute?
 A common way to identify the most
informative attribute is to use entropy-based
methods.
 The entropy methods select the most
informative attribute.
 Entropy (H) can be calculated as,
x = Datapoint
p(x) = Probability of x
H = Entropy of x
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How To Choose An Attribute?
Now, let’s do some mathematics on it
 Therefore, the root is only 10.55% pure on the subscribed = yes class.
 Conversely, it is 89.45% pure on the subscribed = no class.
P(subscribed=yes) = 0.1055
P(subscribed=no) = 0.8945
Hsubscribed = −0.1055·log20.1055–0.8945·log20.8945
≈ 0.4862
P(subscribed = yes) = 1-1789/2000 =10.55%
Let’s say, the overall fraction of the clients who have not subscribed to is 1,789 out
of the total population of 2,000.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How To Choose An Attribute?
Conditional entropy is,
Hsubscribed|contact = 0.4661
Calculating conditional entropy for subscribed|contact gives us following result.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How To Choose An Attribute?
 The information gain of an attribute A is defined as
the difference between the base entropy (HS) and
the conditional entropy of the attribute (HS|A).
 Attribute poutcome has the most information
gain and is the most informative variable.
Therefore, poutcome is chosen for the first split of
the decision tree.
InfoGainA = HS – HS|A
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
How To Choose An Attribute?
Finally, we get the following decision tree
Poutcome
EducationNo
Job Yes
Root Node
Branch Node
Failure, Other,
Unknown
Secondary,
tertiary
Success
Internal Node
Primary,
Unknown
Leaf Node
Admin, blue-collar,
management,
technician
Self-employed,
student, unemployed
No Yes
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Decision Tree - Pros And Cons
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
What if we could predict the
occurrence of diabetes and
take appropriate measures
beforehand to prevent it?
Sure! Let me take you
through the steps to
predict the vulnerable
patients.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
Doctor gets the following data from the medical history of the patient.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
We will divide our entire dataset into two subsets as:
• Training dataset -> to train the model
• Testing dataset -> to validate and make predictions
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
 Here, we implement decision tree in R using following commands.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
 We get the output as follows but this is not easy to understand, so let’s
visualize it for better understanding.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
For plotting we can use the following commands
> plot(diabet_model,margin = 0.1)
> text(diabet_model,use.n= TRUE,pretty = TRUE,cex =0.6)
glucose_conc< 154.5
Diabetes_pedigree_fn<0.315glucose_conc< 131
blood_pressure>=72
NO
68/18 NO
12/3
YES
5/11
glucose_conc< 100.5
NO
107/3
BMI <26.35 Age >=53.5
NO
6/4
YES
9/65
NO
93/13
Age <30.5
Age >=53.5
NO
5/2
YES
13/39
NO
35/18
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
Now, we can use our model to predict the output of our testing dataset.
We can use the following code for predicting the output.
 pred_diabet<-predict(diabet_model,newdata = diabet_test,type ="class")
 pred_diabet
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
We get the following output for our testing dataset where:
“YES” means the probability of patient being vulnerable to diabetes is positive
“NO” means the probability of patient being vulnerable to diabetes is negative.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
 library(caret)
 confusionMatrix(table(pred_diabet,diabet_test$is_diabetic))
We can create confusion matrix for the model using the library caret to
know how good is our model.
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Demo
Data acquisition
Divide dataset
Implement model
Visualize
Accuracy = 71.13%
The accuracy (or the overall success rate) is a metric defining the rate at
which a model has classified the records correctly. A good model should
have a high accuracy score.
Data Acquisition
Divide dataset
Implement model
Visualize
Model Validation
www.edureka.co/data-scienceEdureka’s Data Science Certification Training
Course Details
Go to www.edureka.co/data-science
Get Edureka Certified in Data Science Today!
What our learners have to say about us!
Shravan Reddy says- “I would like to recommend any one who
wants to be a Data Scientist just one place: Edureka. Explanations
are clean, clear, easy to understand. Their support team works
very well.. I took the Data Science course and I'm going to take
Machine Learning with Mahout and then Big Data and Hadoop”.
Gnana Sekhar says - “Edureka Data science course provided me a very
good mixture of theoretical and practical training. LMS pre recorded
sessions and assignments were very good as there is a lot of
information in them that will help me in my job. Edureka is my
teaching GURU now...Thanks EDUREKA.”
Balu Samaga says - “It was a great experience to undergo and get
certified in the Data Science course from Edureka. Quality of the
training materials, assignments, project, support and other
infrastructures are a top notch.”
www.edureka.co/data-scienceEdureka’s Data Science Certification Training

More Related Content

What's hot (20)

Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
Tonmoy Bhagawati
 
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hakky St
 
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Simplilearn
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
Sulman Ahmed
 
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Simplilearn
 
Decision Tree Learning
Decision Tree LearningDecision Tree Learning
Decision Tree Learning
Md. Ariful Hoque
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Simplilearn
 
Dimension reduction techniques[Feature Selection]
Dimension reduction techniques[Feature Selection]Dimension reduction techniques[Feature Selection]
Dimension reduction techniques[Feature Selection]
AAKANKSHA JAIN
 
Isolation Forest
Isolation ForestIsolation Forest
Isolation Forest
Konkuk University, Korea
 
Machine Learning - Ensemble Methods
Machine Learning - Ensemble MethodsMachine Learning - Ensemble Methods
Machine Learning - Ensemble Methods
Andrew Ferlitsch
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Simplilearn
 
K means Clustering Algorithm
K means Clustering AlgorithmK means Clustering Algorithm
K means Clustering Algorithm
Kasun Ranga Wijeweera
 
From decision trees to random forests
From decision trees to random forestsFrom decision trees to random forests
From decision trees to random forests
Viet-Trung TRAN
 
Presentation on K-Means Clustering
Presentation on K-Means ClusteringPresentation on K-Means Clustering
Presentation on K-Means Clustering
Pabna University of Science & Technology
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
Sreenatha Reddy K R
 
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Simplilearn
 
Decision tree
Decision treeDecision tree
Decision tree
ShraddhaPandey45
 
Dbscan algorithom
Dbscan algorithomDbscan algorithom
Dbscan algorithom
Mahbubur Rahman Shimul
 
Building a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to ZBuilding a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to Z
Charles Vestur
 
Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
Tonmoy Bhagawati
 
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hands-On Machine Learning with Scikit-Learn and TensorFlow - Chapter8
Hakky St
 
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Simplilearn
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
Sulman Ahmed
 
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Simplilearn
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Simplilearn
 
Dimension reduction techniques[Feature Selection]
Dimension reduction techniques[Feature Selection]Dimension reduction techniques[Feature Selection]
Dimension reduction techniques[Feature Selection]
AAKANKSHA JAIN
 
Machine Learning - Ensemble Methods
Machine Learning - Ensemble MethodsMachine Learning - Ensemble Methods
Machine Learning - Ensemble Methods
Andrew Ferlitsch
 
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Simplilearn
 
From decision trees to random forests
From decision trees to random forestsFrom decision trees to random forests
From decision trees to random forests
Viet-Trung TRAN
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
Sreenatha Reddy K R
 
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Simplilearn
 
Building a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to ZBuilding a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to Z
Charles Vestur
 

Similar to Decision Tree Algorithm & Analysis | Machine Learning Algorithm | Data Science Training | Edureka (20)

Lect9 Decision tree
Lect9 Decision treeLect9 Decision tree
Lect9 Decision tree
hktripathy
 
Top 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceTop 5 algorithms used in Data Science
Top 5 algorithms used in Data Science
Edureka!
 
classification in data warehouse and mining
classification in data warehouse and miningclassification in data warehouse and mining
classification in data warehouse and mining
anjanasharma77573
 
supervised machine learning algorithms support vector machine
supervised machine learning algorithms support vector machinesupervised machine learning algorithms support vector machine
supervised machine learning algorithms support vector machine
pranalisonawane8600
 
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryA Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
IJERA Editor
 
Lecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdfLecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdf
ssuser4c50a9
 
Classification Using Decision tree
Classification Using Decision treeClassification Using Decision tree
Classification Using Decision tree
Mohd. Noor Abdul Hamid
 
decision tree.pdf
decision tree.pdfdecision tree.pdf
decision tree.pdf
DivitGoyal2
 
Chapter4-ML.pptx slide for concept of mechanic learning
Chapter4-ML.pptx slide  for concept of mechanic learningChapter4-ML.pptx slide  for concept of mechanic learning
Chapter4-ML.pptx slide for concept of mechanic learning
Hina636704
 
BIM Data Mining Unit3 by Tekendra Nath Yogi
 BIM Data Mining Unit3 by Tekendra Nath Yogi BIM Data Mining Unit3 by Tekendra Nath Yogi
BIM Data Mining Unit3 by Tekendra Nath Yogi
Tekendra Nath Yogi
 
decisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptxdecisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptx
ABINASHPADHY6
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data mining
Kamal Acharya
 
Business Analytics using R.ppt
Business Analytics using R.pptBusiness Analytics using R.ppt
Business Analytics using R.ppt
Rohit Raj
 
Classification Using Decision Trees and RulesChapter 5.docx
Classification Using Decision Trees and RulesChapter 5.docxClassification Using Decision Trees and RulesChapter 5.docx
Classification Using Decision Trees and RulesChapter 5.docx
monicafrancis71118
 
DecisionTree.pptx for btech cse student
DecisionTree.pptx for  btech cse studentDecisionTree.pptx for  btech cse student
DecisionTree.pptx for btech cse student
MaushamkumarRay
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Edureka!
 
ai4.ppt
ai4.pptai4.ppt
ai4.ppt
akshatsharma823122
 
Introduction to ML and Decision Tree
Introduction to ML and Decision TreeIntroduction to ML and Decision Tree
Introduction to ML and Decision Tree
Suman Debnath
 
ai4.ppt
ai4.pptai4.ppt
ai4.ppt
atul404633
 
ai4.ppt
ai4.pptai4.ppt
ai4.ppt
ssuser448ad3
 
Lect9 Decision tree
Lect9 Decision treeLect9 Decision tree
Lect9 Decision tree
hktripathy
 
Top 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceTop 5 algorithms used in Data Science
Top 5 algorithms used in Data Science
Edureka!
 
classification in data warehouse and mining
classification in data warehouse and miningclassification in data warehouse and mining
classification in data warehouse and mining
anjanasharma77573
 
supervised machine learning algorithms support vector machine
supervised machine learning algorithms support vector machinesupervised machine learning algorithms support vector machine
supervised machine learning algorithms support vector machine
pranalisonawane8600
 
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge DiscoveryA Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
A Survey Ondecision Tree Learning Algorithms for Knowledge Discovery
IJERA Editor
 
Lecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdfLecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdf
ssuser4c50a9
 
decision tree.pdf
decision tree.pdfdecision tree.pdf
decision tree.pdf
DivitGoyal2
 
Chapter4-ML.pptx slide for concept of mechanic learning
Chapter4-ML.pptx slide  for concept of mechanic learningChapter4-ML.pptx slide  for concept of mechanic learning
Chapter4-ML.pptx slide for concept of mechanic learning
Hina636704
 
BIM Data Mining Unit3 by Tekendra Nath Yogi
 BIM Data Mining Unit3 by Tekendra Nath Yogi BIM Data Mining Unit3 by Tekendra Nath Yogi
BIM Data Mining Unit3 by Tekendra Nath Yogi
Tekendra Nath Yogi
 
decisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptxdecisiontree-110906040745-phpapp01.pptx
decisiontree-110906040745-phpapp01.pptx
ABINASHPADHY6
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data mining
Kamal Acharya
 
Business Analytics using R.ppt
Business Analytics using R.pptBusiness Analytics using R.ppt
Business Analytics using R.ppt
Rohit Raj
 
Classification Using Decision Trees and RulesChapter 5.docx
Classification Using Decision Trees and RulesChapter 5.docxClassification Using Decision Trees and RulesChapter 5.docx
Classification Using Decision Trees and RulesChapter 5.docx
monicafrancis71118
 
DecisionTree.pptx for btech cse student
DecisionTree.pptx for  btech cse studentDecisionTree.pptx for  btech cse student
DecisionTree.pptx for btech cse student
MaushamkumarRay
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Edureka!
 
Introduction to ML and Decision Tree
Introduction to ML and Decision TreeIntroduction to ML and Decision Tree
Introduction to ML and Decision Tree
Suman Debnath
 
Ad

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 
What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 
Ad

Recently uploaded (20)

Ch01_Introduction_to_Information_Securit
Ch01_Introduction_to_Information_SecuritCh01_Introduction_to_Information_Securit
Ch01_Introduction_to_Information_Securit
KawukiDerrick
 
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...
apidays
 
apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...
apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...
apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...
apidays
 
apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)
apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)
apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)
apidays
 
apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...
apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...
apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...
apidays
 
apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...
apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...
apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...
apidays
 
THE FRIEDMAN TEST ( Biostatics B. Pharm)
THE FRIEDMAN TEST ( Biostatics B. Pharm)THE FRIEDMAN TEST ( Biostatics B. Pharm)
THE FRIEDMAN TEST ( Biostatics B. Pharm)
JishuHaldar
 
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...
apidays
 
apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...
apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...
apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...
apidays
 
Report_Government Authorities_Index_ENG_FIN.pdf
Report_Government Authorities_Index_ENG_FIN.pdfReport_Government Authorities_Index_ENG_FIN.pdf
Report_Government Authorities_Index_ENG_FIN.pdf
OlhaTatokhina1
 
apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)
apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)
apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)
apidays
 
BE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptx
BE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptxBE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptx
BE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptx
AaronBaluyut
 
METHODS OF DATA COLLECTION (Research methodology)
METHODS OF DATA COLLECTION (Research methodology)METHODS OF DATA COLLECTION (Research methodology)
METHODS OF DATA COLLECTION (Research methodology)
anwesha248
 
SAP_S4HANA_EWM_Food_Processing_Industry.pptx
SAP_S4HANA_EWM_Food_Processing_Industry.pptxSAP_S4HANA_EWM_Food_Processing_Industry.pptx
SAP_S4HANA_EWM_Food_Processing_Industry.pptx
vemulavenu484
 
2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg
2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg
2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg
mk1227103
 
1-2. Lab Introduction to Linux environment.ppt
1-2. Lab Introduction to Linux environment.ppt1-2. Lab Introduction to Linux environment.ppt
1-2. Lab Introduction to Linux environment.ppt
Wahajch
 
Pause Travail 22 Hostiou Girard 12 juin 2025.pdf
Pause Travail 22 Hostiou Girard 12 juin 2025.pdfPause Travail 22 Hostiou Girard 12 juin 2025.pdf
Pause Travail 22 Hostiou Girard 12 juin 2025.pdf
Institut de l'Elevage - Idele
 
Part Departement Head Presentation for Business
Part Departement Head Presentation for BusinessPart Departement Head Presentation for Business
Part Departement Head Presentation for Business
Rizki229625
 
apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)
apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)
apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)
apidays
 
MEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdf
MEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdfMEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdf
MEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdf
OlhaTatokhina1
 
Ch01_Introduction_to_Information_Securit
Ch01_Introduction_to_Information_SecuritCh01_Introduction_to_Information_Securit
Ch01_Introduction_to_Information_Securit
KawukiDerrick
 
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...
apidays
 
apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...
apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...
apidays New York 2025 - Spring Modulith Design for Microservices by Renjith R...
apidays
 
apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)
apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)
apidays New York 2025 - Life is But a (Data) Stream by Sandon Jacobs (Confluent)
apidays
 
apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...
apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...
apidays New York 2025 - Breaking Barriers: Lessons Learned from API Integrati...
apidays
 
apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...
apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...
apidays New York 2025 - Unifying OpenAPI & AsyncAPI by Naresh Jain & Hari Kri...
apidays
 
THE FRIEDMAN TEST ( Biostatics B. Pharm)
THE FRIEDMAN TEST ( Biostatics B. Pharm)THE FRIEDMAN TEST ( Biostatics B. Pharm)
THE FRIEDMAN TEST ( Biostatics B. Pharm)
JishuHaldar
 
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...
apidays
 
apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...
apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...
apidays New York 2025 - The Challenge is Not the Pattern, But the Best Integr...
apidays
 
Report_Government Authorities_Index_ENG_FIN.pdf
Report_Government Authorities_Index_ENG_FIN.pdfReport_Government Authorities_Index_ENG_FIN.pdf
Report_Government Authorities_Index_ENG_FIN.pdf
OlhaTatokhina1
 
apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)
apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)
apidays New York 2025 - Two tales of API Change Management by Eric Koleda (Coda)
apidays
 
BE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptx
BE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptxBE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptx
BE PROGRAMjwjwjwjsjsjsjsME TEMPLATE.pptx
AaronBaluyut
 
METHODS OF DATA COLLECTION (Research methodology)
METHODS OF DATA COLLECTION (Research methodology)METHODS OF DATA COLLECTION (Research methodology)
METHODS OF DATA COLLECTION (Research methodology)
anwesha248
 
SAP_S4HANA_EWM_Food_Processing_Industry.pptx
SAP_S4HANA_EWM_Food_Processing_Industry.pptxSAP_S4HANA_EWM_Food_Processing_Industry.pptx
SAP_S4HANA_EWM_Food_Processing_Industry.pptx
vemulavenu484
 
2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg
2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg
2.5-DESPATCH-ORDINARY MAILS.pptxlminub7b7t6f7h7t6f6g7g6fg
mk1227103
 
1-2. Lab Introduction to Linux environment.ppt
1-2. Lab Introduction to Linux environment.ppt1-2. Lab Introduction to Linux environment.ppt
1-2. Lab Introduction to Linux environment.ppt
Wahajch
 
Part Departement Head Presentation for Business
Part Departement Head Presentation for BusinessPart Departement Head Presentation for Business
Part Departement Head Presentation for Business
Rizki229625
 
apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)
apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)
apidays New York 2025 - Computers are still dumb by Ben Morss (DeepL)
apidays
 
MEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdf
MEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdfMEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdf
MEDIA_LITERACY_INDEX_OF_EDUCATORS_ENG.pdf
OlhaTatokhina1
 

Decision Tree Algorithm & Analysis | Machine Learning Algorithm | Data Science Training | Edureka

  • 1. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Decision Tree
  • 2. www.edureka.co/data-scienceEdureka’s Data Science Certification Training What Will You Learn Today? ClassificationMachine Learning Types Of Classifiers What Is Decision Tree? How Decision Tree Works? Demo In R: Diabetes Prevention Use Case 1 2 3 4 65
  • 3. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Machine Learning
  • 4. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Introduction To Machine Learning Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Training Data Learn Algorithm Build Model Perform Feedback
  • 5. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Machine Learning - Example  Amazon has huge amount of consumer purchasing data.  The data consists of consumer demographics (age, sex, location), purchasing history, past browsing history.  Based on this data, Amazon segments its customers, draws a pattern and recommends the right product to the right customer at the right time.
  • 6. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Classification
  • 7. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Introduction To Classification  Classification is the problem of identifying to which set of categories a new observation belongs.  It is a supervised learning model as the classifier already has a set of classified examples and from these examples, the classifier learns to assign unseen new examples.  Example: Assigning a given email into "spam" or "non-spam" category. Is this A or B ?
  • 8. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Classification - Example Feed the classifier with training data set and predefined labels. It will learn to categorize particular data under a specific label. How to train my model to identify spam mails from genuine mails? Source IP Address Phrases in the text Subject Line HTML Tags
  • 9. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Classification Use Cases Banking Remote sensing Medicine Banking Identification of loan risk applicants by their probability of defaulting payments. Medicine Identification of at-risk patients and disease trends. Remote sensing Identification of areas of similar land use in a GIS database. Marketing Identifying customer churn. Use-cases Marketing
  • 10. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Types Of Classifiers Decision Tree • Decision tree builds classification models in the form of a tree structure. • It breaks down a dataset into smaller and smaller subsets. • Random Forest is an ensemble classifier made using many decision tree models. • Ensemble models combine the results from different models. Random Forest Naïve Bayes • It is a classification technique based on Bayes' Theorem with an assumption of independence among attributes.
  • 11. www.edureka.co/data-scienceEdureka’s Data Science Certification Training What is Decision Tree?
  • 12. www.edureka.co/data-scienceEdureka’s Data Science Certification Training What Is Decision Tree?  A decision tree uses a tree structure to specify sequences of decisions and consequences.  A decision tree employs a structure of nodes and branches.  The depth of a node is the minimum number of steps required to reach the node from the root.  Eventually, a final point is reached and a prediction is made. Gender AgeIncome Yes No Yes No Root Node Internal Node Branch NodeDepth=1 Female Male <=40 >40 Leaf Node <=45000 >45000
  • 13. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Use Case - Credit Risk Detection  To minimize loss, the bank needs a decision rule to predict whom to give approval of the loan.  An applicant’s demographic (income, debts, credit history) and socio-economic profiles are considered.  Data science can help banks recognize behavior patterns and provide a complete view of individual customers.
  • 14. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works?
  • 15. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works? Let’s take an example, We have taken dataset consisting of: • Weather information of last 14 days • Whether match was played or not on that particular day Now using the decision tree we need to predict whether the game will happen if the weather condition is Outlook = Rain Humidity = High Wind = Weak Play = ?
  • 16. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works? From our data, we will choose one variable “Outlook” and will see how it affects the variable “Play” Day Outlook Humidity Wind Play D1 Sunny High Weak No D2 Sunny High Strong No D3 Overcast High Weak Yes D4 Rain High Weak Yes D5 Rain Normal Weak Yes D6 Rain Normal Strong No D7 Overcast Normal Strong Yes D8 Sunny High Weak No D9 Sunny Normal Weak Yes D10 Rain Normal Weak Yes D11 Sunny Normal Strong Yes D12 Overcast High Strong Yes D13 Overcast Normal Weak Yes D14 Rain High Strong No Outlook Play: 9 Yes, 5 No Sunny Overcast Rain There are 3 types of Outlook Here
  • 17. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works? We can further divide our data based on Outlook. Outlook Overcast Sunny Rain Day Outlook Humidity Wind D1 Sunny High Weak D2 Sunny High Strong D8 Sunny High Weak D9 Sunny Normal Weak D11 Sunny Normal Strong 2 Yes / 3 No Split further Pure subset Will play 3 Yes / 2 No Split further Day Outlook Humidity Wind D4 Rain High Weak D5 Rain Normal Weak D6 Rain Normal Strong D10 Rain Normal Weak D14 Rain High Strong We will split the data until we get pure subsets at every branch 9 Yes / 5 No Day Outlook Humidity Wind D3 Overcast High Weak D7 Overcast Normal Strong D12 Overcast High Strong D13 Overcast Normal Weak
  • 18. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works? We will use Humidity column to split the subset “Sunny” further. Will playWill not play Outlook Overcast Sunny Rain Humidity NormalHigh Day Humidity Wind D1 High Weak D2 High Strong D8 High Weak Pure subset Day Humidity Wind D9 Normal Weak D11 Normal Strong Pure subset 3 Yes / 2 No Split further Day Outlook Humidity Wind D4 Rain High Weak D5 Rain Normal Weak D6 Rain Normal Strong D10 Rain Normal Weak D14 Rain High Strong Pure subset Will play Day Outlook Humidity Wind D3 Overcast High Weak D7 Overcast Normal Strong D12 Overcast High Strong D13 Overcast Normal Weak
  • 19. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works? We will use Humidity column to split the subset “Sunny” further. Will playWill not play Outlook Overcast Sunny Rain Humidity NormalHigh Weak Strong Wind Will play Will not play Day Humidity Wind D1 High Weak D2 High Strong D8 High Weak Pure subset Day Humidity Wind D9 Normal Weak D11 Normal Strong Pure subset Day Humidity Wind D4 High Weak D5 Normal Weak D10 Normal Weak Pure subset Day Humidity Wind D6 Normal Strong D14 High Strong Pure subset Pure subset Will play Day Outlook Humidity Wind D3 Overcast High Weak D7 Overcast Normal Strong D12 Overcast High Strong D13 Overcast Normal Weak
  • 20. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How Decision Tree Works? We will use Humidity column to split the subset “Sunny” further. Will playWill not play Outlook Overcast Sunny Rain Humidity NormalHigh Weak Strong Wind Will play Will not play Will play
  • 21. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How will I know which attribute to take? I’ll show you how
  • 22. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Problem – Client Subscription Consider the case of a bank that wants to market its products to the appropriate customers. Given the demographics of clients and their reactions to previous campaign phone calls, the bank's goal is to predict which clients would subscribe. The attributes are: • Job • Marital status • Education • Housing • Loan • Contact • Poutcome
  • 23. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How To Choose An Attribute?  A common way to identify the most informative attribute is to use entropy-based methods.  The entropy methods select the most informative attribute.  Entropy (H) can be calculated as, x = Datapoint p(x) = Probability of x H = Entropy of x
  • 24. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How To Choose An Attribute? Now, let’s do some mathematics on it  Therefore, the root is only 10.55% pure on the subscribed = yes class.  Conversely, it is 89.45% pure on the subscribed = no class. P(subscribed=yes) = 0.1055 P(subscribed=no) = 0.8945 Hsubscribed = −0.1055·log20.1055–0.8945·log20.8945 ≈ 0.4862 P(subscribed = yes) = 1-1789/2000 =10.55% Let’s say, the overall fraction of the clients who have not subscribed to is 1,789 out of the total population of 2,000.
  • 25. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How To Choose An Attribute? Conditional entropy is, Hsubscribed|contact = 0.4661 Calculating conditional entropy for subscribed|contact gives us following result.
  • 26. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How To Choose An Attribute?  The information gain of an attribute A is defined as the difference between the base entropy (HS) and the conditional entropy of the attribute (HS|A).  Attribute poutcome has the most information gain and is the most informative variable. Therefore, poutcome is chosen for the first split of the decision tree. InfoGainA = HS – HS|A
  • 27. www.edureka.co/data-scienceEdureka’s Data Science Certification Training How To Choose An Attribute? Finally, we get the following decision tree Poutcome EducationNo Job Yes Root Node Branch Node Failure, Other, Unknown Secondary, tertiary Success Internal Node Primary, Unknown Leaf Node Admin, blue-collar, management, technician Self-employed, student, unemployed No Yes
  • 28. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Decision Tree - Pros And Cons
  • 30. www.edureka.co/data-scienceEdureka’s Data Science Certification Training What if we could predict the occurrence of diabetes and take appropriate measures beforehand to prevent it? Sure! Let me take you through the steps to predict the vulnerable patients.
  • 31. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation Doctor gets the following data from the medical history of the patient.
  • 32. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation We will divide our entire dataset into two subsets as: • Training dataset -> to train the model • Testing dataset -> to validate and make predictions
  • 33. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation  Here, we implement decision tree in R using following commands.
  • 34. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation  We get the output as follows but this is not easy to understand, so let’s visualize it for better understanding.
  • 35. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation For plotting we can use the following commands > plot(diabet_model,margin = 0.1) > text(diabet_model,use.n= TRUE,pretty = TRUE,cex =0.6) glucose_conc< 154.5 Diabetes_pedigree_fn<0.315glucose_conc< 131 blood_pressure>=72 NO 68/18 NO 12/3 YES 5/11 glucose_conc< 100.5 NO 107/3 BMI <26.35 Age >=53.5 NO 6/4 YES 9/65 NO 93/13 Age <30.5 Age >=53.5 NO 5/2 YES 13/39 NO 35/18
  • 36. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation Now, we can use our model to predict the output of our testing dataset. We can use the following code for predicting the output.  pred_diabet<-predict(diabet_model,newdata = diabet_test,type ="class")  pred_diabet
  • 37. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation We get the following output for our testing dataset where: “YES” means the probability of patient being vulnerable to diabetes is positive “NO” means the probability of patient being vulnerable to diabetes is negative.
  • 38. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data Acquisition Divide dataset Implement model Visualize Model Validation  library(caret)  confusionMatrix(table(pred_diabet,diabet_test$is_diabetic)) We can create confusion matrix for the model using the library caret to know how good is our model.
  • 39. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Demo Data acquisition Divide dataset Implement model Visualize Accuracy = 71.13% The accuracy (or the overall success rate) is a metric defining the rate at which a model has classified the records correctly. A good model should have a high accuracy score. Data Acquisition Divide dataset Implement model Visualize Model Validation
  • 40. www.edureka.co/data-scienceEdureka’s Data Science Certification Training Course Details Go to www.edureka.co/data-science Get Edureka Certified in Data Science Today! What our learners have to say about us! Shravan Reddy says- “I would like to recommend any one who wants to be a Data Scientist just one place: Edureka. Explanations are clean, clear, easy to understand. Their support team works very well.. I took the Data Science course and I'm going to take Machine Learning with Mahout and then Big Data and Hadoop”. Gnana Sekhar says - “Edureka Data science course provided me a very good mixture of theoretical and practical training. LMS pre recorded sessions and assignments were very good as there is a lot of information in them that will help me in my job. Edureka is my teaching GURU now...Thanks EDUREKA.” Balu Samaga says - “It was a great experience to undergo and get certified in the Data Science course from Edureka. Quality of the training materials, assignments, project, support and other infrastructures are a top notch.”

Editor's Notes