100% found this document useful (1 vote)

169 views8 pages

3) Code For ID3 Algorithm Implementation

The document loads and analyzes Iris flower data using Python libraries like Pandas and Seaborn. It uploads an Iris CSV file, loads the data into a Pandas dataframe, then performs various visualizations and analyses. These include scatter plots of features colored by species, box plots, density plots, and pair plots to understand relationships between features and species. It also fits a decision tree classifier to the data and plots the tree.

Uploaded by

Prajith Sprinťèř

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

169 views8 pages

3) Code For ID3 Algorithm Implementation

Uploaded by

Prajith Sprinťèř

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

import pandas as pd

from google.colab import files
uploaded = files.upload()

Choose Files No file chosen

Upload widget is only available when the cell has been
executed in the
current browser session. Please rerun this cell to enable.
Saving Iris.csv to Iris (1).csv

import io
Iris = pd.read_csv(io.BytesIO(uploaded['Iris.csv']))
Iris

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

... ... ... ... ... ... ...

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

150 rows × 6 columns

import pandas as pd

# We'll also import seaborn, a Python graphing library
import warnings # current version of seaborn generates a bunch of warnings that we'll igno
warnings.filterwarnings("ignore")
import seaborn as sns
import matplotlib.pyplot as plt
sns.set(style="white", color_codes=True)

Iris.head()
Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

Iris["Species"].value_counts()
3 4 4.6 3.1 1.5 0.2 Iris-setosa

Iris-setosa
4 5 50

5.0 3.6 1.4 0.2 Iris-setosa

Iris-versicolor 50

Iris-virginica 50

Name: Species, dtype: int64

# The first way we can plot things is using the .plot extension from Pandas dataframes
# We'll use this to make a scatterplot of the Iris features.
Iris.plot(kind="scatter", x="SepalLengthCm", y="SepalWidthCm")

*c* argument looks like a single numeric RGB or RGBA sequence, which should be avoide
<matplotlib.axes._subplots.AxesSubplot at 0x7fb55c046750>

# We can also use the seaborn library to make a similar plot
# A seaborn jointplot shows bivariate scatterplots and univariate histograms in the same f
sns.jointplot(x="SepalLengthCm", y="SepalWidthCm", data=Iris, size=10)
<seaborn.axisgrid.JointGrid at 0x7fb55bf27150>

# One piece of information missing in the plots above is what species each plant is
# We'll use seaborn's FacetGrid to color the scatterplot by species
sns.FacetGrid(Iris, hue="Species", size=5) \
.map(plt.scatter, "SepalLengthCm", "SepalWidthCm") \
.add_legend()

<seaborn.axisgrid.FacetGrid at 0x7fb55bd81710>

# We can look at an individual feature in Seaborn through a boxplot
sns.boxplot(x="Species", y="PetalLengthCm", data=Iris)
<matplotlib.axes._subplots.AxesSubplot at 0x7fb55bc8ced0>

# A final seaborn plot useful for looking at univariate relations is the kdeplot,
# which creates and visualizes a kernel density estimate of the underlying feature
sns.FacetGrid(Iris, hue="Species", size=6) \
.map(sns.kdeplot, "SepalLengthCm") \
.add_legend()

<seaborn.axisgrid.FacetGrid at 0x7fb5657b0350>

# Another useful seaborn plot is the pairplot, which shows the bivariate relation
# between each pair of features
#
# From the pairplot, we'll see that the Iris-setosa species is separataed from the other
# two across all feature combinations
sns.pairplot(Iris.drop("Id", axis=1), hue="Species", size=3)
<seaborn.axisgrid.PairGrid at 0x7fb56579ae50>

from sklearn.datasets import load_iris
from sklearn import tree
iris = load_iris()
X, y = iris.data, iris.target
clf = tree.DecisionTreeClassifier()
clf = clf.fit(X, y)
clf

DecisionTreeClassifier(ccp_alpha=0.0, class_weight=None, criterion='gini',

max_depth=None, max_features=None, max_leaf_nodes=None,

min_impurity_decrease=0.0, min_impurity_split=None,

min_samples_leaf=1, min_samples_split=2,

min_weight_fraction_leaf=0.0, presort='deprecated',

random_state=None, splitter='best')

tree.plot_tree(clf)

[Text(167.4, 199.32, 'X[2] <= 2.45\ngini = 0.667\nsamples = 150\nvalue = [50, 50, 50]
Text(141.64615384615385, 163.07999999999998, 'gini = 0.0\nsamples = 50\nvalue = [50,
Text(193.15384615384616, 163.07999999999998, 'X[3] <= 1.75\ngini = 0.5\nsamples = 10
Text(103.01538461538462, 126.83999999999999, 'X[2] <= 4.95\ngini = 0.168\nsamples =
Text(51.50769230769231, 90.6, 'X[3] <= 1.65\ngini = 0.041\nsamples = 48\nvalue = [0,
Text(25.753846153846155, 54.359999999999985, 'gini = 0.0\nsamples = 47\nvalue = [0,
Text(77.26153846153846, 54.359999999999985, 'gini = 0.0\nsamples = 1\nvalue = [0, 0,
Text(154.52307692307693, 90.6, 'X[3] <= 1.55\ngini = 0.444\nsamples = 6\nvalue = [0,
Text(128.76923076923077, 54.359999999999985, 'gini = 0.0\nsamples = 3\nvalue = [0, 0
Text(180.27692307692308, 54.359999999999985, 'X[0] <= 6.95\ngini = 0.444\nsamples =
Text(154.52307692307693, 18.119999999999976, 'gini = 0.0\nsamples = 2\nvalue = [0, 2
Text(206.03076923076924, 18.119999999999976, 'gini = 0.0\nsamples = 1\nvalue = [0, 0
Text(283.2923076923077, 126.83999999999999, 'X[2] <= 4.85\ngini = 0.043\nsamples = 4
Text(257.53846153846155, 90.6, 'X[1] <= 3.1\ngini = 0.444\nsamples = 3\nvalue = [0,
Text(231.7846153846154, 54.359999999999985, 'gini = 0.0\nsamples = 2\nvalue = [0, 0,
Text(283.2923076923077, 54.359999999999985, 'gini = 0.0\nsamples = 1\nvalue = [0, 1,
Text(309.04615384615386, 90.6, 'gini = 0.0\nsamples = 43\nvalue = [0, 0, 43]')]

import graphviz

dot_data = tree.export_graphviz(clf, out_file=None)

graph = graphviz.Source(dot_data)

graph.render("iris")

'iris.pdf'

dot_data = tree.export_graphviz(clf, out_file=None,
feature names=iris feature names
...                      feature_names=iris.feature_names,
...                      class_names=iris.target_names,
...                      filled=True, rounded=True,
...                      special_characters=True)

graph = graphviz.Source(dot_data)

graph
petal length (c
gini = 0
samples
value = [50
class = s

True

gini = 0.0
samples = 50
value = [50, 0, 0]
class = setosa

petal length (cm) ≤ 4

gini = 0.168
samples = 54
value = [0, 49, 5
class = versicolo

AI and Machine Learning in Action Real World Solutions For Coders
No ratings yet
AI and Machine Learning in Action Real World Solutions For Coders
175 pages
Seiko Kinetic Energy Supplier YT02A Manual
100% (2)
Seiko Kinetic Energy Supplier YT02A Manual
1 page
Applied Data Science Camp - Info
100% (1)
Applied Data Science Camp - Info
12 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Python The Inventory Project
No ratings yet
Python The Inventory Project
52 pages
Cardio Screen RF
100% (1)
Cardio Screen RF
27 pages
Thinkcspy 3
100% (1)
Thinkcspy 3
415 pages
Sales Forecasting
100% (1)
Sales Forecasting
10 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
Book
100% (1)
Book
480 pages
Merging - Scaled - 1D - & - Trying - Different - CLassification - ML - Models - .Ipynb - Colaboratory
100% (1)
Merging - Scaled - 1D - & - Trying - Different - CLassification - ML - Models - .Ipynb - Colaboratory
16 pages
0.1 Stock Data
100% (1)
0.1 Stock Data
4 pages
Classification Problems
100% (1)
Classification Problems
25 pages
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
100% (1)
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
6 pages
Lab 3. Linear Regression 230223
100% (1)
Lab 3. Linear Regression 230223
7 pages
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
100% (1)
SVM (Support Vector Machine) For Classification - by Aditya Kumar - Towards Data Science
28 pages
Regressao Linear Simples - Ipynb - Colaboratory
100% (1)
Regressao Linear Simples - Ipynb - Colaboratory
2 pages
ML Lect1
100% (1)
ML Lect1
51 pages
9 Regression
100% (1)
9 Regression
14 pages
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
100% (1)
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
11 pages
SQL Cheat Sheet
100% (1)
SQL Cheat Sheet
44 pages
Charmi Shah 20bcp299 Lab2
100% (1)
Charmi Shah 20bcp299 Lab2
7 pages
Decision Trees: at Some Point of Time You Have To Take A Decision Sitting On A Tree
100% (1)
Decision Trees: at Some Point of Time You Have To Take A Decision Sitting On A Tree
19 pages
Chapter-3-Linear Models For Regression
100% (1)
Chapter-3-Linear Models For Regression
61 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
K-NN (Nearest Neighbor)
100% (1)
K-NN (Nearest Neighbor)
17 pages
Teleco Cutomer Churn
100% (1)
Teleco Cutomer Churn
5 pages
Glass Classification
100% (2)
Glass Classification
3 pages
Csi 5155 ML Project Report
100% (1)
Csi 5155 ML Project Report
24 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Assignment10 4
100% (1)
Assignment10 4
3 pages
Introduction to Boosting: Slides Adapted from Che Wanxiang (车万翔) at HIT, and Robin Dhamankar of Many thanks!
100% (1)
Introduction to Boosting: Slides Adapted from Che Wanxiang (车万翔) at HIT, and Robin Dhamankar of Many thanks!
41 pages
CS550 Regression Aug12
100% (1)
CS550 Regression Aug12
63 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
Hypothesis and Hypothesis Testing
100% (1)
Hypothesis and Hypothesis Testing
59 pages
Assignment Updated 101
100% (1)
Assignment Updated 101
24 pages
PR01
100% (1)
PR01
41 pages
Vinee
100% (1)
Vinee
28 pages
ML Lab6.Ipynb - Colaboratory
100% (1)
ML Lab6.Ipynb - Colaboratory
5 pages
HW1
100% (1)
HW1
8 pages
Project 1 - Radio Link Failure Prediction
100% (1)
Project 1 - Radio Link Failure Prediction
8 pages
Taxi Trips Analysis Project 1682332303
100% (2)
Taxi Trips Analysis Project 1682332303
28 pages
EMF CheatSheet V4
100% (1)
EMF CheatSheet V4
2 pages
Loading The Dataset: First We Load The Dataset and Find Out The Number of Columns, Rows, NULL Values, Etc
100% (1)
Loading The Dataset: First We Load The Dataset and Find Out The Number of Columns, Rows, NULL Values, Etc
8 pages
Data Analytics Time Table V2
100% (1)
Data Analytics Time Table V2
6 pages
Xgboost in Online Transaction Fraud Detection
100% (1)
Xgboost in Online Transaction Fraud Detection
8 pages
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
100% (1)
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
17 pages
TP Regression
100% (1)
TP Regression
1 page
01-Introduction Machine Learning
100% (1)
01-Introduction Machine Learning
48 pages
Neural Network Based Rainfall Prediction System
100% (1)
Neural Network Based Rainfall Prediction System
6 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Bagging and Boosting
100% (1)
Bagging and Boosting
19 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
Bagging and Boosting Regression Algorithms
100% (1)
Bagging and Boosting Regression Algorithms
84 pages
Outliers, Hypothesis and Natural Language Processing
100% (1)
Outliers, Hypothesis and Natural Language Processing
7 pages
Python Material
No ratings yet
Python Material
13 pages
IRIS BPNN - Ipynb - Colaboratory
100% (1)
IRIS BPNN - Ipynb - Colaboratory
4 pages
Student Booklet For Sep 2015 v6
100% (1)
Student Booklet For Sep 2015 v6
50 pages
K Means Clustering
100% (1)
K Means Clustering
10 pages
Logistic Regression
100% (1)
Logistic Regression
10 pages
LPTHW
100% (1)
LPTHW
220 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
Npcil 2025 Interview Experience Chemical 1749837447
No ratings yet
Npcil 2025 Interview Experience Chemical 1749837447
15 pages
Balsus Analogue
No ratings yet
Balsus Analogue
8 pages
Graded Quiz - Module 4 (Page 3 of 10)
No ratings yet
Graded Quiz - Module 4 (Page 3 of 10)
1 page
5) Randomforest - Ipynb - Colaboratory
No ratings yet
5) Randomforest - Ipynb - Colaboratory
12 pages
Graded Quiz - Module 2 (Page 2 of 20)
No ratings yet
Graded Quiz - Module 2 (Page 2 of 20)
1 page
6) TCE MOOC-jLinear Regression
No ratings yet
6) TCE MOOC-jLinear Regression
19 pages
My Mother at Sixty Six PDF234
No ratings yet
My Mother at Sixty Six PDF234
5 pages
Autumn Break Assignment I
100% (1)
Autumn Break Assignment I
7 pages
BC0034-Computer Concepts C Programming-Part-2-MQP PDF
No ratings yet
BC0034-Computer Concepts C Programming-Part-2-MQP PDF
21 pages
DATA Replication
No ratings yet
DATA Replication
4 pages
Ace6000 Brochure en
No ratings yet
Ace6000 Brochure en
2 pages
INF10025 Tasks 01-3
No ratings yet
INF10025 Tasks 01-3
10 pages
ZTS-VVU-Kosice - RV-20 Bergekran - Und Abschleppfahrzeug
0% (1)
ZTS-VVU-Kosice - RV-20 Bergekran - Und Abschleppfahrzeug
2 pages
Certificate AutoCad & Revit
No ratings yet
Certificate AutoCad & Revit
2 pages
Compre Final Solutions
No ratings yet
Compre Final Solutions
13 pages
Practice Papers XII-Chemistry
No ratings yet
Practice Papers XII-Chemistry
125 pages
1-S2.0-S0885230824000962-Main Significance of Chirp MFCC As A Feature in Speech and Audio
No ratings yet
1-S2.0-S0885230824000962-Main Significance of Chirp MFCC As A Feature in Speech and Audio
11 pages
Structural Loading
No ratings yet
Structural Loading
35 pages
Automated Machine Tool Prognostics For Turning Operation Using Acoustic Emission and Learning Vector Quantization
No ratings yet
Automated Machine Tool Prognostics For Turning Operation Using Acoustic Emission and Learning Vector Quantization
5 pages
Astm Standards For Geomechanical Test: Fall Cone Liquid Limit BS 1377-2
No ratings yet
Astm Standards For Geomechanical Test: Fall Cone Liquid Limit BS 1377-2
1 page
Unit 2 - Wireless Network
75% (4)
Unit 2 - Wireless Network
15 pages
Class 12 Maths 1 Index
No ratings yet
Class 12 Maths 1 Index
14 pages
Survey4Forest Mensuration
No ratings yet
Survey4Forest Mensuration
11 pages
Multivariate Material
No ratings yet
Multivariate Material
58 pages
Maths PPT (Higher Transition Matrix)
No ratings yet
Maths PPT (Higher Transition Matrix)
13 pages
CHEM 1315 Exam 3 Practice B
No ratings yet
CHEM 1315 Exam 3 Practice B
6 pages
Modern Digital and Analog Communication Systems 4th Edition B. P. Lathi Instant Download
No ratings yet
Modern Digital and Analog Communication Systems 4th Edition B. P. Lathi Instant Download
64 pages
C-6 Science Worksheet - 9 Final
No ratings yet
C-6 Science Worksheet - 9 Final
3 pages
TAPISH GOEL Resume Metlife Manager v1
No ratings yet
TAPISH GOEL Resume Metlife Manager v1
3 pages
Offshore Patrol Vessel For CCG PDF
No ratings yet
Offshore Patrol Vessel For CCG PDF
87 pages
Ch2 Worksheet F07-Key
No ratings yet
Ch2 Worksheet F07-Key
4 pages
Cubes and Dice: Max. Marks: 15 No. of Qs. 15 Time: 12 Min. Date: ......... /........ /...............
No ratings yet
Cubes and Dice: Max. Marks: 15 No. of Qs. 15 Time: 12 Min. Date: ......... /........ /...............
3 pages
Powersoft Dm3004pfc4 Data en v1.6
No ratings yet
Powersoft Dm3004pfc4 Data en v1.6
2 pages
GROUP 15 Elementary Graph Algorithm
No ratings yet
GROUP 15 Elementary Graph Algorithm
37 pages
Compact I/O High-Speed Counter: Installation Instructions
No ratings yet
Compact I/O High-Speed Counter: Installation Instructions
32 pages
Philips Led Brp371 Led103 NW
No ratings yet
Philips Led Brp371 Led103 NW
3 pages

3) Code For ID3 Algorithm Implementation

Uploaded by

3) Code For ID3 Algorithm Implementation

Uploaded by

Choose Files No file chosen

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

... ... ... ... ... ... ...

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

150 rows × 6 columns

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

5.0 3.6 1.4 0.2 Iris-setosa

Name: Species, dtype: int64

DecisionTreeClassifier(ccp_alpha=0.0, class_weight=None, criterion='gini',

max_depth=None, max_features=None, max_leaf_nodes=None,

petal length (cm) ≤ 4

You might also like