Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
▪ What is Regression?
▪ Regression Use-case
▪ Types of Regression – Linear vs Logistic Regression
▪ What is Linear Regression?
▪ Finding best fit regression line using Least Square Method
▪ Checking goodness of fit using R squared Method
▪ Implementation of Linear Regression using Python
▪ Linear Regression Algorithm using Python from scratch
▪ Linear Regression Algorithm using Python (scikit lib)
Agenda for Today’s Session
“Regression analysis is a form of predictive modelling
technique which investigates the relationship between
a dependent and independent variable”What is
Regression?
Uses of
Regression
Three major uses for regression analysis are
▪ Determining the strength of predictors
▪ Forecasting an effect, and
▪ Trend forecasting
Linear vs
Logistic
Regression
Basis Linear Regression Logistic Regression
Core Concept The data is modelled
using a straight line
The probability of some
obtained event is
represented as a linear
function of a combination of
predictor variables.
Used with Continuous Variable Categorical Variable
Output/Prediction Value of the variable Probability of occurrence of
event
Accuracy and
Goodness of fit
measured by loss, R
squared, Adjusted R
squared etc.
Accuracy, Precision, Recall,
F1 score, ROC curve,
Confusion Matrix, etc
“Linear Regression is a method to predict dependent variable (Y)
based on values of independent variables (X). It can be used for the
cases where we want to predict some continuous quantity.”
What is Linear
Regression?
▪ Classification and Regression Capabilities
▪ Data Quality
▪ Computational Complexity
▪ Comprehensible and Transparent
Linear Regression
Selection
Criteria
▪ Evaluating Trends and Sales Estimates
▪ Analyzing the Impact of Price Changes
▪ Assessment of risk in financial services and
insurance domain
Where is
Linear
Regression
used?
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Line
Understanding
Linear
Regression
Algorithm
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
+ve
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
-ve
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Line
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Linear Regression
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Observation
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Regression Line
Least Square Method
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Estimated Value
Actual Value
error
error
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
Minimize the error
Understanding
Linear
Regression
Algorithm
Speed
Distance
X
Y
Machine Learning Training with Python www.edureka.co/python
+ve Relationship
Understanding
Linear
Regression
Algorithm
Speed
Distance
X
Y
Machine Learning Training with Python www.edureka.co/python
y = mx + c
Distance travelled in a
fixed duration of time
Understanding
Linear
Regression
Algorithm
Speed
Distance
X
Y
Machine Learning Training with Python www.edureka.co/python
y = mx + c
Speed of Vehicle
Understanding
Linear
Regression
Algorithm
Speed
Distance
X
Y
Machine Learning Training with Python www.edureka.co/python
y = mx + c
m = +ve slope of line
Understanding
Linear
Regression
Algorithm
Speed
Distance
X
Y
Machine Learning Training with Python www.edureka.co/python
y = mx + c
c = y - intercept of the line
Understanding
Linear
Regression
Algorithm
Speed
Time
X
Y
Machine Learning Training with Python www.edureka.co/python
-ve Relationship
Understanding
Linear
Regression
Algorithm
Speed
Time
X
Y
Machine Learning Training with Python www.edureka.co/python
y = - mx + c
Time taken to travel a
fixed distance
Understanding
Linear
Regression
Algorithm
Speed
Time
X
Y
Machine Learning Training with Python www.edureka.co/python
y = - mx + c
Speed of Vehicle
Understanding
Linear
Regression
Algorithm
Speed
Time
X
Y
Machine Learning Training with Python www.edureka.co/python
y = - mx + c
m = -ve slope of line
Understanding
Linear
Regression
Algorithm
Speed
Time
X
Y
Machine Learning Training with Python www.edureka.co/python
y = - mx + c
c = y - intercept of the line
Understanding
Linear
Regression
Algorithm
Independent Variable
DependentVariable
X
Y
Machine Learning Training with Python www.edureka.co/python
y = mx + c
Independent Variable
Dependent Variable
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean: ഥ𝒙 3 18/53.6 mean : ഥ𝒚
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6( , )
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean3( , )3.6
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
1
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
1
3
−
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
1
3
−
3
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
1
3
−3
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
2
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
2
−
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
2 −
3
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
2 − 3
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
3
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
3.6
3
−
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
3.6
3 −
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
3 −3.6
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.4
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.4
1.4
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 3 4 5 6
1
2
3
4
5
0
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
3
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
-2y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
where m = 𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
-1
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
2
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+cy = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-2
-1
0
1
2
-0.6
0.4
-1.6
0.4
1.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)𝑥 − ҧ𝑥 2
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
𝛴 = 10 𝛴 = 4
=
4
10
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
𝛴 = 10 𝛴 = 4
=
4
10
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
3.6
y = mx+c
𝛴 = 10 𝛴 = 4
=
4
10
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝑥 − ҧ𝑥 2
m =
𝛴
𝛴
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
0.4
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
3
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
3.6=0.4x3 + c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
3.6 = 1.2+ c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
3.6 - 1.2 = c
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
c = 2.4
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
mean 3 3.6
𝑥 − ҧ𝑥
-2
-1
0
1
2
𝑦 − ത𝑦
-0.6
0.4
-1.6
0.40.4
1.4
𝑥 − ҧ𝑥 2
4
1
0
1
4
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
-1
0
2
1.2
0.4
-1.6
0.4
1.4
-0.4
2.8
0.4
Independent Variable
Dependent Variable
X
Y
1 2 3 4 5 6
1
2
3
4
5
0 3
y = mx+c
(𝑥 − ҧ𝑥)(𝑦 − ത𝑦)
𝛴 = 10 𝛴 = 4
𝑥 − ҧ𝑥 2
m = 𝛴 =
4
10
c = 2.4
m = 0.4
c = 2.4
y = 0.4x + 2.4
Understanding Linear Regression Algorithm
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Mean Square Error
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
m = 0.4
c = 2.4
y = 0.4x + 2.4
For given m = 0.4 & c = 2.4, lets
predict values for y for x = {1,2,3,4,5}
y = 0.4 x 1+ 2.4 = 2.8
y = 0.4 x 2+ 2.4= 3.2
y = 0.4 x 3+ 2.4= 3.6
y = 0.4 x 4+ 2.4= 4.0
y = 0.4 x 5+ 2.4= 4.4
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Mean Square Error
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Regression Line
m = 0.4
c = 2.4
y = 0.4x + 2.4
For given m = 0.4 & c = 2.4, lets
predict values for y for x = {1,2,3,4,5}
y = 0.4 x 1+ 2.4 = 2.8
y = 0.4 x 2+ 2.4= 3.2
y = 0.4 x 3+ 2.4= 3.6
y = 0.4 x 4+ 2.4= 4.0
y = 0.4 x 5+ 2.4= 4.4
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Mean Square Error
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Regression Line
m = 0.4
c = 2.4
y = 0.4x + 2.4
For given m = 0.4 & c = 2.4, lets
predict values for y for x = {1,2,3,4,5}
y = 0.4 x 1+ 2.4 = 2.8
y = 0.4 x 2+ 2.4= 3.2
y = 0.4 x 3+ 2.4= 3.6
y = 0.4 x 4+ 2.4= 4.0
y = 0.4 x 5+ 2.4= 4.4
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Mean Square Error
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Regression Line Distance between actual
& predicted value
error
error
error
error
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Finding the
best fit line
Machine Learning Training with Python www.edureka.co/python
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Let’s check the Goodness of fit
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
What is R-
Square?
Machine Learning Training with Python www.edureka.co/python
▪ R-squared value is a statistical measure of how close
the data are to the fitted regression line
▪ It is also known as coefficient of determination, or the
coefficient of multiple determination
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
1 2 4 5 60 3
1
2
3
4
5
Actual Value
1
2
3
4
5
3
4
2
4
5
𝑥 𝑦
mean
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 4 5 60 3
1
2
3
4
5
Actual Value
mean
distance = actual - mean
Calculation of 𝑹 𝟐
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
1 2 4 5 60 3
1
2
3
4
5
Predicted Value
1
2
3
4
5
2.8
𝑥
3.2
4.0
3.6
4.4
Regression line
Calculation of 𝑹 𝟐
𝑦𝑝
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Independent Variable
Dependent Variable
X
Y
Machine Learning Training with Python www.edureka.co/python
Distance actual - mean
Distance predicted - mean
1 2 4 5 60 3
1
2
3
4
5
Actual vs Predicted Value
Regression line
vs
This is nothing but 𝑅2
=
Calculation of 𝑹 𝟐
𝑦 − ത𝑦 2
𝑦𝑝 − ത𝑦
2
𝛴
𝛴
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
– 0.6
0.4
–1.6
0.4
1.4
𝑦 − ത𝑦
mean y
3.6
𝑅2
=
𝑦 − ത𝑦 2
𝑦𝑝 − ത𝑦
2
𝛴
𝛴
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
– 0.6
0.4
–1.6
0.4
1.4
𝑦 − ത𝑦
mean y
3.6
𝑦𝑝𝑦 − ത𝑦 2
3.6
1.6
2.56
1.6
1.96
2.8
3.2
3.6
4.0
4.4
(𝑦𝑝 − ത𝑦)
-0.8
-0.4
0
0.4
0.8
𝑅2
=
𝑦 − ത𝑦 2
𝑦𝑝 − ത𝑦
2
𝛴
𝛴
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
– 0.6
0.4
–1.6
0.4
1.4
𝑦 − ത𝑦
mean y 3.6
𝑦𝑝𝑦 − ത𝑦 2
3.6
1.6
2.56
1.6
1.96
2.8
3.2
3.6
4.0
4.4
(𝑦𝑝 − ത𝑦)
-0.8
-0.4
0
0.4
0.8
𝑦𝑝 − ത𝑦
2
6.4
1.6
0
1.6
6.4
𝑅2
=
𝑦 − ത𝑦 2
𝑦𝑝 − ത𝑦
2
𝛴
𝛴
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
– 0.6
0.4
–1.6
0.4
1.4
𝑦 − ത𝑦
mean y 3.6
𝑦𝑝𝑦 − ത𝑦 2
0.36
0.16
2.56
0.16
1.96
2.8
3.2
3.6
4.0
4.4
(𝑦𝑝 − ത𝑦)
-0.8
-0.4
0
0.4
0.8
0.64
0.16
0
0.16
0.64
1.6
𝑅2
=
𝑦 − ത𝑦 2
(𝑦𝑝 −ത𝑦
2
𝛴
𝛴
5.2𝛴 𝛴
1.6
5.2
=
𝑦𝑝 − ത𝑦
2
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑥
1
2
3
4
5
3
4
2
4
5
𝑦
– 0.6
0.4
–1.6
0.4
1.4
𝑦 − ത𝑦
mean y 3.6
𝑦𝑝𝑦 − ത𝑦 2
3.6
1.6
2.56
1.6
1.96
2.8
3.2
3.6
4.0
4.4
(𝑦𝑝 − ത𝑦)
-0.8
-0.4
0
0.4
0.8
(𝑦𝑝 −−ത𝑦
2
6.4
1.6
0
1.6
6.4
11.32 16
𝑅2
≈ 0.3
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑅2
≈ 0.3
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Actual vs Predicted Value
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑅2
≈ 0.7
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Actual vs Predicted Value
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑅2
≈ 0.9
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Actual vs Predicted Value
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑅2
≈ 1
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Actual vs Predicted Value
Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python
Calculation of 𝑹 𝟐
𝑅2
≈ 0.02
Independent Variable
Dependent Variable
X
Y
1 2 4 5 60 3
1
2
3
4
5
Actual vs Predicted Value
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Are
Low R-squared
values always
Machine Learning Training with Python www.edureka.co/python
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Are
High R-squared
values always
Machine Learning Training with Python www.edureka.co/python
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
DEMO
Let’s learn to
code
© MadeByShape Ltd. Web Design Manchester 2018. All Rights Reserved.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.

More Related Content

PDF
Machine Learning Course | Edureka
PPTX
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
PPTX
Machine Learning-Linear regression
PPTX
Introduction to Machine Learning
PPTX
Machine Learning
PPTX
Tamilnadu and West Bengal.pptx
PDF
Coding for kids
PDF
Data Science Full Course | Edureka
Machine Learning Course | Edureka
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Machine Learning-Linear regression
Introduction to Machine Learning
Machine Learning
Tamilnadu and West Bengal.pptx
Coding for kids
Data Science Full Course | Edureka

What's hot (20)

PDF
An introduction to Machine Learning
ODP
Machine Learning with Decision trees
PPTX
Machine Learning With Python | Machine Learning Algorithms | Machine Learning...
PPTX
Machine Learning
PDF
Classification Based Machine Learning Algorithms
PPTX
Linear regression with gradient descent
PPTX
Machine learning ppt
PPTX
Classification and Regression
PPTX
supervised learning
PDF
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
PPTX
Basics of Soft Computing
PPTX
Presentation on supervised learning
PPTX
Machine Learning
PDF
Linear regression
PDF
Naive Bayes
PPTX
Logistic regression
PPTX
Ensemble learning
PPTX
Introduction to ML (Machine Learning)
ODP
Machine Learning With Logistic Regression
An introduction to Machine Learning
Machine Learning with Decision trees
Machine Learning With Python | Machine Learning Algorithms | Machine Learning...
Machine Learning
Classification Based Machine Learning Algorithms
Linear regression with gradient descent
Machine learning ppt
Classification and Regression
supervised learning
Supervised vs Unsupervised vs Reinforcement Learning | Edureka
Basics of Soft Computing
Presentation on supervised learning
Machine Learning
Linear regression
Naive Bayes
Logistic regression
Ensemble learning
Introduction to ML (Machine Learning)
Machine Learning With Logistic Regression
Ad

Similar to Linear Regression Algorithm | Linear Regression in Python | Machine Learning Algorithm | Edureka (20)

PDF
AI vs Machine Learning vs Deep Learning | Machine Learning Training with Pyth...
PDF
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
PPTX
Deep Learning with MXNet
PDF
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
PDF
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
PDF
The Future of AI on AWS
PDF
Python For Deep Learning - I | Python Basics | Python Tutorial | Python Train...
PPTX
ML for DS.pptx
PPTX
Supervised Machine Learning
PDF
Building Applications with Apache MXNet
PDF
Linear Regression With R
PDF
Time Series In R | Time Series Forecasting | Time Series Analysis | Data Scie...
PDF
IRJET - House Price Prediction using Machine Learning and RPA
PPTX
Sagemaker Automatic model tuning
PDF
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
PPTX
An Introduction to Reinforcement Learning (December 2018)
PPTX
updated stock market ppt.pptx stock market presentation
DOC
shailesh_resume
PPTX
World models v0.14
PDF
Text Mining Using R
AI vs Machine Learning vs Deep Learning | Machine Learning Training with Pyth...
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Deep Learning with MXNet
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
The Future of AI on AWS
Python For Deep Learning - I | Python Basics | Python Tutorial | Python Train...
ML for DS.pptx
Supervised Machine Learning
Building Applications with Apache MXNet
Linear Regression With R
Time Series In R | Time Series Forecasting | Time Series Analysis | Data Scie...
IRJET - House Price Prediction using Machine Learning and RPA
Sagemaker Automatic model tuning
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
An Introduction to Reinforcement Learning (December 2018)
updated stock market ppt.pptx stock market presentation
shailesh_resume
World models v0.14
Text Mining Using R
Ad

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
PDF
Top 5 Trending Business Intelligence Tools | Edureka
PDF
Tableau Tutorial for Data Science | Edureka
PDF
Python Programming Tutorial | Edureka
PDF
Top 5 PMP Certifications | Edureka
PDF
Top Maven Interview Questions in 2020 | Edureka
PDF
Linux Mint Tutorial | Edureka
PDF
How to Deploy Java Web App in AWS| Edureka
PDF
Importance of Digital Marketing | Edureka
PDF
RPA in 2020 | Edureka
PDF
Email Notifications in Jenkins | Edureka
PDF
EA Algorithm in Machine Learning | Edureka
PDF
Cognitive AI Tutorial | Edureka
PDF
AWS Cloud Practitioner Tutorial | Edureka
PDF
Blue Prism Top Interview Questions | Edureka
PDF
Big Data on AWS Tutorial | Edureka
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
PDF
Kubernetes Installation on Ubuntu | Edureka
PDF
Introduction to DevOps | Edureka
What to learn during the 21 days Lockdown | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Tableau Tutorial for Data Science | Edureka
Python Programming Tutorial | Edureka
Top 5 PMP Certifications | Edureka
Top Maven Interview Questions in 2020 | Edureka
Linux Mint Tutorial | Edureka
How to Deploy Java Web App in AWS| Edureka
Importance of Digital Marketing | Edureka
RPA in 2020 | Edureka
Email Notifications in Jenkins | Edureka
EA Algorithm in Machine Learning | Edureka
Cognitive AI Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Blue Prism Top Interview Questions | Edureka
Big Data on AWS Tutorial | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Kubernetes Installation on Ubuntu | Edureka
Introduction to DevOps | Edureka

Recently uploaded (20)

PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPT
What is a Computer? Input Devices /output devices
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PPTX
observCloud-Native Containerability and monitoring.pptx
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Architecture types and enterprise applications.pdf
PPTX
Modernising the Digital Integration Hub
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
August Patch Tuesday
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
What is a Computer? Input Devices /output devices
Assigned Numbers - 2025 - Bluetooth® Document
Hindi spoken digit analysis for native and non-native speakers
Taming the Chaos: How to Turn Unstructured Data into Decisions
A review of recent deep learning applications in wood surface defect identifi...
NewMind AI Weekly Chronicles – August ’25 Week III
Zenith AI: Advanced Artificial Intelligence
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
observCloud-Native Containerability and monitoring.pptx
Benefits of Physical activity for teenagers.pptx
Architecture types and enterprise applications.pdf
Modernising the Digital Integration Hub
Chapter 5: Probability Theory and Statistics
Developing a website for English-speaking practice to English as a foreign la...
August Patch Tuesday
Univ-Connecticut-ChatGPT-Presentaion.pdf
sustainability-14-14877-v2.pddhzftheheeeee
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...

Linear Regression Algorithm | Linear Regression in Python | Machine Learning Algorithm | Edureka

  • 1. Copyright © 2017, edureka and/or its affiliates. All rights reserved.
  • 2. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Linear Regression Algorithm
  • 3. Copyright © 2017, edureka and/or its affiliates. All rights reserved. ▪ What is Regression? ▪ Regression Use-case ▪ Types of Regression – Linear vs Logistic Regression ▪ What is Linear Regression? ▪ Finding best fit regression line using Least Square Method ▪ Checking goodness of fit using R squared Method ▪ Implementation of Linear Regression using Python ▪ Linear Regression Algorithm using Python from scratch ▪ Linear Regression Algorithm using Python (scikit lib) Agenda for Today’s Session
  • 4. “Regression analysis is a form of predictive modelling technique which investigates the relationship between a dependent and independent variable”What is Regression?
  • 5. Uses of Regression Three major uses for regression analysis are ▪ Determining the strength of predictors ▪ Forecasting an effect, and ▪ Trend forecasting
  • 6. Linear vs Logistic Regression Basis Linear Regression Logistic Regression Core Concept The data is modelled using a straight line The probability of some obtained event is represented as a linear function of a combination of predictor variables. Used with Continuous Variable Categorical Variable Output/Prediction Value of the variable Probability of occurrence of event Accuracy and Goodness of fit measured by loss, R squared, Adjusted R squared etc. Accuracy, Precision, Recall, F1 score, ROC curve, Confusion Matrix, etc
  • 7. “Linear Regression is a method to predict dependent variable (Y) based on values of independent variables (X). It can be used for the cases where we want to predict some continuous quantity.” What is Linear Regression?
  • 8. ▪ Classification and Regression Capabilities ▪ Data Quality ▪ Computational Complexity ▪ Comprehensible and Transparent Linear Regression Selection Criteria
  • 9. ▪ Evaluating Trends and Sales Estimates ▪ Analyzing the Impact of Price Changes ▪ Assessment of risk in financial services and insurance domain Where is Linear Regression used?
  • 10. Independent Variable DependentVariable X Y Machine Learning Training with Python www.edureka.co/python Line Understanding Linear Regression Algorithm
  • 17. Understanding Linear Regression Algorithm Independent Variable DependentVariable X Y Machine Learning Training with Python www.edureka.co/python Regression Line Least Square Method
  • 18. Understanding Linear Regression Algorithm Independent Variable DependentVariable X Y Machine Learning Training with Python www.edureka.co/python Estimated Value Actual Value error error
  • 21. Understanding Linear Regression Algorithm Speed Distance X Y Machine Learning Training with Python www.edureka.co/python y = mx + c Distance travelled in a fixed duration of time
  • 22. Understanding Linear Regression Algorithm Speed Distance X Y Machine Learning Training with Python www.edureka.co/python y = mx + c Speed of Vehicle
  • 23. Understanding Linear Regression Algorithm Speed Distance X Y Machine Learning Training with Python www.edureka.co/python y = mx + c m = +ve slope of line
  • 24. Understanding Linear Regression Algorithm Speed Distance X Y Machine Learning Training with Python www.edureka.co/python y = mx + c c = y - intercept of the line
  • 26. Understanding Linear Regression Algorithm Speed Time X Y Machine Learning Training with Python www.edureka.co/python y = - mx + c Time taken to travel a fixed distance
  • 27. Understanding Linear Regression Algorithm Speed Time X Y Machine Learning Training with Python www.edureka.co/python y = - mx + c Speed of Vehicle
  • 28. Understanding Linear Regression Algorithm Speed Time X Y Machine Learning Training with Python www.edureka.co/python y = - mx + c m = -ve slope of line
  • 29. Understanding Linear Regression Algorithm Speed Time X Y Machine Learning Training with Python www.edureka.co/python y = - mx + c c = y - intercept of the line
  • 30. Understanding Linear Regression Algorithm Independent Variable DependentVariable X Y Machine Learning Training with Python www.edureka.co/python y = mx + c Independent Variable Dependent Variable
  • 31. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 Understanding Linear Regression Algorithm
  • 32. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 Understanding Linear Regression Algorithm
  • 33. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 Understanding Linear Regression Algorithm
  • 34. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean: ഥ𝒙 3 18/53.6 mean : ഥ𝒚 Understanding Linear Regression Algorithm
  • 35. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6( , ) Understanding Linear Regression Algorithm
  • 36. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean3( , )3.6 Understanding Linear Regression Algorithm
  • 37. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 38. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 1 Understanding Linear Regression Algorithm
  • 39. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 1 3 − Understanding Linear Regression Algorithm
  • 40. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 1 3 − 3 Understanding Linear Regression Algorithm
  • 41. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 1 3 −3 Understanding Linear Regression Algorithm
  • 42. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 2 Understanding Linear Regression Algorithm
  • 43. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 2 − Understanding Linear Regression Algorithm
  • 44. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 2 − 3 Understanding Linear Regression Algorithm
  • 45. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 2 − 3 Understanding Linear Regression Algorithm
  • 46. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 Understanding Linear Regression Algorithm
  • 47. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 Understanding Linear Regression Algorithm
  • 48. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 Understanding Linear Regression Algorithm
  • 49. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 50. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 3 Understanding Linear Regression Algorithm
  • 51. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 3.6 3 − Understanding Linear Regression Algorithm
  • 52. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 3.6 3 − Understanding Linear Regression Algorithm
  • 53. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 3 −3.6 Understanding Linear Regression Algorithm
  • 54. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 Understanding Linear Regression Algorithm
  • 55. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 Understanding Linear Regression Algorithm
  • 56. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 Understanding Linear Regression Algorithm
  • 57. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.4 Understanding Linear Regression Algorithm
  • 58. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.4 1.4 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 59. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 3 4 5 6 1 2 3 4 5 0 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 3 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 -2y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 where m = 𝛴 Understanding Linear Regression Algorithm
  • 60. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 -1 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c Understanding Linear Regression Algorithm
  • 61. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c Understanding Linear Regression Algorithm
  • 62. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c Understanding Linear Regression Algorithm
  • 63. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 2 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c Understanding Linear Regression Algorithm
  • 64. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+cy = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 65. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c Understanding Linear Regression Algorithm
  • 66. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -2 -1 0 1 2 -0.6 0.4 -1.6 0.4 1.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c Understanding Linear Regression Algorithm
  • 67. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦)𝑥 − ҧ𝑥 2 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 68. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 69. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c 𝛴 = 10 𝛴 = 4 = 4 10 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 70. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c 𝛴 = 10 𝛴 = 4 = 4 10 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 71. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 3.6 y = mx+c 𝛴 = 10 𝛴 = 4 = 4 10 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝑥 − ҧ𝑥 2 m = 𝛴 𝛴 Understanding Linear Regression Algorithm
  • 72. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 0.4 Understanding Linear Regression Algorithm
  • 73. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 3 Understanding Linear Regression Algorithm
  • 74. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 3.6=0.4x3 + c Understanding Linear Regression Algorithm
  • 75. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 3.6 = 1.2+ c Understanding Linear Regression Algorithm
  • 76. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 3.6 - 1.2 = c Understanding Linear Regression Algorithm
  • 77. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 c = 2.4 Understanding Linear Regression Algorithm
  • 78. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 mean 3 3.6 𝑥 − ҧ𝑥 -2 -1 0 1 2 𝑦 − ത𝑦 -0.6 0.4 -1.6 0.40.4 1.4 𝑥 − ҧ𝑥 2 4 1 0 1 4 (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) -1 0 2 1.2 0.4 -1.6 0.4 1.4 -0.4 2.8 0.4 Independent Variable Dependent Variable X Y 1 2 3 4 5 6 1 2 3 4 5 0 3 y = mx+c (𝑥 − ҧ𝑥)(𝑦 − ത𝑦) 𝛴 = 10 𝛴 = 4 𝑥 − ҧ𝑥 2 m = 𝛴 = 4 10 c = 2.4 m = 0.4 c = 2.4 y = 0.4x + 2.4 Understanding Linear Regression Algorithm
  • 79. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Mean Square Error Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 m = 0.4 c = 2.4 y = 0.4x + 2.4 For given m = 0.4 & c = 2.4, lets predict values for y for x = {1,2,3,4,5} y = 0.4 x 1+ 2.4 = 2.8 y = 0.4 x 2+ 2.4= 3.2 y = 0.4 x 3+ 2.4= 3.6 y = 0.4 x 4+ 2.4= 4.0 y = 0.4 x 5+ 2.4= 4.4
  • 80. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Mean Square Error Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Regression Line m = 0.4 c = 2.4 y = 0.4x + 2.4 For given m = 0.4 & c = 2.4, lets predict values for y for x = {1,2,3,4,5} y = 0.4 x 1+ 2.4 = 2.8 y = 0.4 x 2+ 2.4= 3.2 y = 0.4 x 3+ 2.4= 3.6 y = 0.4 x 4+ 2.4= 4.0 y = 0.4 x 5+ 2.4= 4.4
  • 81. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Mean Square Error Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Regression Line m = 0.4 c = 2.4 y = 0.4x + 2.4 For given m = 0.4 & c = 2.4, lets predict values for y for x = {1,2,3,4,5} y = 0.4 x 1+ 2.4 = 2.8 y = 0.4 x 2+ 2.4= 3.2 y = 0.4 x 3+ 2.4= 3.6 y = 0.4 x 4+ 2.4= 4.0 y = 0.4 x 5+ 2.4= 4.4
  • 82. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Mean Square Error Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Regression Line Distance between actual & predicted value error error error error
  • 83. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Finding the best fit line Machine Learning Training with Python www.edureka.co/python
  • 84. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Let’s check the Goodness of fit
  • 85. Copyright © 2017, edureka and/or its affiliates. All rights reserved. What is R- Square? Machine Learning Training with Python www.edureka.co/python ▪ R-squared value is a statistical measure of how close the data are to the fitted regression line ▪ It is also known as coefficient of determination, or the coefficient of multiple determination
  • 86. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 1 2 4 5 60 3 1 2 3 4 5 Actual Value 1 2 3 4 5 3 4 2 4 5 𝑥 𝑦 mean
  • 87. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 4 5 60 3 1 2 3 4 5 Actual Value mean distance = actual - mean Calculation of 𝑹 𝟐
  • 88. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python 1 2 4 5 60 3 1 2 3 4 5 Predicted Value 1 2 3 4 5 2.8 𝑥 3.2 4.0 3.6 4.4 Regression line Calculation of 𝑹 𝟐 𝑦𝑝
  • 89. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Independent Variable Dependent Variable X Y Machine Learning Training with Python www.edureka.co/python Distance actual - mean Distance predicted - mean 1 2 4 5 60 3 1 2 3 4 5 Actual vs Predicted Value Regression line vs This is nothing but 𝑅2 = Calculation of 𝑹 𝟐 𝑦 − ത𝑦 2 𝑦𝑝 − ത𝑦 2 𝛴 𝛴
  • 90. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 – 0.6 0.4 –1.6 0.4 1.4 𝑦 − ത𝑦 mean y 3.6 𝑅2 = 𝑦 − ത𝑦 2 𝑦𝑝 − ത𝑦 2 𝛴 𝛴
  • 91. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 – 0.6 0.4 –1.6 0.4 1.4 𝑦 − ത𝑦 mean y 3.6 𝑦𝑝𝑦 − ത𝑦 2 3.6 1.6 2.56 1.6 1.96 2.8 3.2 3.6 4.0 4.4 (𝑦𝑝 − ത𝑦) -0.8 -0.4 0 0.4 0.8 𝑅2 = 𝑦 − ത𝑦 2 𝑦𝑝 − ത𝑦 2 𝛴 𝛴
  • 92. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 – 0.6 0.4 –1.6 0.4 1.4 𝑦 − ത𝑦 mean y 3.6 𝑦𝑝𝑦 − ത𝑦 2 3.6 1.6 2.56 1.6 1.96 2.8 3.2 3.6 4.0 4.4 (𝑦𝑝 − ത𝑦) -0.8 -0.4 0 0.4 0.8 𝑦𝑝 − ത𝑦 2 6.4 1.6 0 1.6 6.4 𝑅2 = 𝑦 − ത𝑦 2 𝑦𝑝 − ത𝑦 2 𝛴 𝛴
  • 93. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 – 0.6 0.4 –1.6 0.4 1.4 𝑦 − ത𝑦 mean y 3.6 𝑦𝑝𝑦 − ത𝑦 2 0.36 0.16 2.56 0.16 1.96 2.8 3.2 3.6 4.0 4.4 (𝑦𝑝 − ത𝑦) -0.8 -0.4 0 0.4 0.8 0.64 0.16 0 0.16 0.64 1.6 𝑅2 = 𝑦 − ത𝑦 2 (𝑦𝑝 −ത𝑦 2 𝛴 𝛴 5.2𝛴 𝛴 1.6 5.2 = 𝑦𝑝 − ത𝑦 2
  • 94. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑥 1 2 3 4 5 3 4 2 4 5 𝑦 – 0.6 0.4 –1.6 0.4 1.4 𝑦 − ത𝑦 mean y 3.6 𝑦𝑝𝑦 − ത𝑦 2 3.6 1.6 2.56 1.6 1.96 2.8 3.2 3.6 4.0 4.4 (𝑦𝑝 − ത𝑦) -0.8 -0.4 0 0.4 0.8 (𝑦𝑝 −−ത𝑦 2 6.4 1.6 0 1.6 6.4 11.32 16 𝑅2 ≈ 0.3
  • 95. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑅2 ≈ 0.3 Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Actual vs Predicted Value
  • 96. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑅2 ≈ 0.7 Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Actual vs Predicted Value
  • 97. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑅2 ≈ 0.9 Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Actual vs Predicted Value
  • 98. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑅2 ≈ 1 Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Actual vs Predicted Value
  • 99. Copyright © 2017, edureka and/or its affiliates. All rights reserved.Machine Learning Training with Python www.edureka.co/python Calculation of 𝑹 𝟐 𝑅2 ≈ 0.02 Independent Variable Dependent Variable X Y 1 2 4 5 60 3 1 2 3 4 5 Actual vs Predicted Value
  • 100. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Are Low R-squared values always Machine Learning Training with Python www.edureka.co/python
  • 101. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Are High R-squared values always Machine Learning Training with Python www.edureka.co/python
  • 102. Copyright © 2017, edureka and/or its affiliates. All rights reserved. DEMO Let’s learn to code © MadeByShape Ltd. Web Design Manchester 2018. All Rights Reserved.
  • 103. Copyright © 2017, edureka and/or its affiliates. All rights reserved.