SlideShare a Scribd company logo
Machine Learning types Dr. Mostafa A. Elhosseini
YouTube Channel
ML Jargons
≡ Training set: the examples that the system uses to learn
≡ Training instance: Training sample -> each training example
≡ Label: spam or not
≡ feature = attribute + value
≡ Predictors: features
≡ Ex. Price of a car
▪ Set of features (mileage, age, brand) called predictors
▪ Feature : Mileage = 15,000
≡ Performance measure P:
≡ Ratio of correctly classified emails
Machine learning definition
≡ Machine Learning is the science (and art) of programming
computers so they can learn from data
≡ The field of study that gives computers the ability to learn without
being explicitly programmed – Arthur Samuel, 1959
≡ A computer program is said to learn from experience E with respect
to some task T and some performance measure P, if its performance
on T, as measured by P, improves with experience E. Tom Mitchell,
1997
ML types
≡ Whether or not they are trained with human supervision
▪ supervised,
▪ unsupervised,
▪ semisupervised, and
▪ Reinforcement Learning
≡ Whether or not they can learn incrementally on the fly
▪ online versus batch learning
≡ Whether they work by simply comparing new data points to known
data points, or instead detect patterns in the training data and build
a predictive model, much like scientists do
▪ instance-based versus model-based learning
Supervised / Unsupervised
≡ In supervised learning, the training data you feed to the algorithm
includes the desired solutions, called labels
▪ Classification
▪ Regression
≡ Classification
▪ The spam filter is a good example of this: it is trained with many example emails along with their
class (spam or ham), and it must learn how to classify new
Supervised / Unsupervised
≡ Regression
▪ predict a target numeric value, such
as the price of a car, given a set of
features (mileage, age, brand, etc.)
called predictors.
≡ some regression algorithms can
be used for classification as well
▪ Logistic Regression is commonly used
for classification, as it can output a
value that corresponds to the
probability of belonging to a given
class
Supervised / Unsupervised
≡ The training data is unlabeled.
≡ The system tries to learn
without a teacher.
≡ Most important unsupervised
Applications
▪ Clustering
▪ Blog visitors
▪ Visualization and dimensionality
reduction
▪ Association rule learning
Unsupervised// Dimensionality Reduction
≡ Simplify the data without losing too much information
≡ One way to do this is to merge several correlated features into one
▪ car’s mileage may be very correlated with its age, so the dimensionality
reduction algorithm will merge them into one feature that represents the
car’s wear and tear. This is called feature extraction
≡ It is often a good idea to try to reduce the dimension of your
training data
▪ It will run much faster,
▪ The data will take up less disk and memory space, and in some cases
▪ It may also perform better
Unsupervised // anomaly detection
≡ Detecting unusual credit card
transactions to prevent fraud,
catching manufacturing defects, or
automatically removing outliers
from a dataset before feeding it to
another learning algorithm.
≡ The system is trained with normal
instances, and when it sees a new
instance it can tell whether it looks
like a normal one or whether it is
likely an anomaly
Unsupervised // Association rule learning
≡ The goal is to dig into large amounts of
data and discover interesting relations
between attributes.
≡ For example, suppose you own a
supermarket. Running an association rule
on your sales logs may reveal that
people who purchase barbecue sauce
and potato chips also tend to buy steak.
Thus, you may want to place these items
close to each other.
Batch/ Online learning
≡ Incapable of learning incrementally
≡ Trained using all the available data
≡ Take a lot of time and computing resources
≡ Done offline
≡ Ex. If a new type of spam -> you need to train a new version of the
system from scratch on the full dataset
Batch/ Online learning
≡ you train the system incrementally by feeding it data instances
sequentially, either individually or by small groups called mini-
batches.
≡ Each learning step is fast and cheap, so the system can learn about
new data on the fly, as it arrives
≡ This whole process is usually done offline (i.e., not on the live
system), so online learning can be a confusing name. Think of it as
incremental learning.
≡ Learning rate
▪ How fast they should adapt to changing data
Instance-based // Model-based
≡One more way to categorize Machine Learning systems is by how they
generalize
▪ Needs to be able to generalize to examples it has never seen before
≡ Having a good performance measure on the training data is good,
but insufficient; the true goal is to perform well on new instances
≡ There are two main approaches to generalization: instance-based
learning and model-based learning
Instance-based // Model-based
≡Simply to learn by heart
▪ For Spam filter: just flag all emails that are identical to emails that have
already been flagged by users
▪ Not the worst solution, but certainly not the best.
≡ Instead of just flagging emails that are identical to known spam
emails, your spam filter could be Programmed to also flag emails that
are very similar to known spam emails.
≡ This requires a measure of Similarity between two emails
≡ The system would flag an email as spam if it has many words in
common with a known spam email
Instance-based // Model-based
Instance-based // Model-based
≡ Another way to
generalize from a
set of examples is
to build a model
of these
examples, then
use that model to
make predictions.
This is called
model-based
learning

More Related Content

PDF
Lecture 01 intro. to ml and overview
PPTX
Types of Machine Learning- Tanvir Siddike Moin
PDF
Machine learning
PPTX
AI_06_Machine Learning.pptx
PPTX
Lecture 1.pptxgggggggggggggggggggggggggggggggggggggggggggg
PDF
Machine Learning - A Simplified view
PDF
Machine Learning Landscape
PPTX
WEEK 4 - Beginning With Machine Learning_020418.pptx
Lecture 01 intro. to ml and overview
Types of Machine Learning- Tanvir Siddike Moin
Machine learning
AI_06_Machine Learning.pptx
Lecture 1.pptxgggggggggggggggggggggggggggggggggggggggggggg
Machine Learning - A Simplified view
Machine Learning Landscape
WEEK 4 - Beginning With Machine Learning_020418.pptx

Similar to Lecture 02 ml supervised and unsupervised (20)

PPTX
chapter Three artificial intelligence 1.pptx
PPTX
unit 1.2 supervised learning.pptx
PPTX
Introduction to Machine Learning
PPTX
Machine learning
PPTX
Learning – Types of Machine Learning – Supervised Learning – Unsupervised UNI...
PPTX
Doctor, Ismail ishengoma PowerPointL3.pptx
PPTX
Day15.pptx school of computer science and ai
PPTX
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
PPTX
introduction to machine learning
DOCX
Introduction to Machine Learning for btech 7th sem
PDF
Week 1.pdf
PPTX
Module 4.pptx............................
PDF
Overview of machine learning
PPTX
Session 17-18 machine learning very important and good type student favour.pptx
PPTX
3171617_introduction_applied machine learning.pptx
PPTX
AI-900 - Fundamental Principles of ML.pptx
PDF
An Introduction to Machine Learning
DOC
Lecture #1: Introduction to machine learning (ML)
PPTX
Machine learning for beginners students.
PPTX
It's Machine Learning Basics -- For You!
chapter Three artificial intelligence 1.pptx
unit 1.2 supervised learning.pptx
Introduction to Machine Learning
Machine learning
Learning – Types of Machine Learning – Supervised Learning – Unsupervised UNI...
Doctor, Ismail ishengoma PowerPointL3.pptx
Day15.pptx school of computer science and ai
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
introduction to machine learning
Introduction to Machine Learning for btech 7th sem
Week 1.pdf
Module 4.pptx............................
Overview of machine learning
Session 17-18 machine learning very important and good type student favour.pptx
3171617_introduction_applied machine learning.pptx
AI-900 - Fundamental Principles of ML.pptx
An Introduction to Machine Learning
Lecture #1: Introduction to machine learning (ML)
Machine learning for beginners students.
It's Machine Learning Basics -- For You!
Ad

More from Mostafa El-Hosseini (17)

PDF
why now Deep Neural Networks?
PDF
Activation functions types
PDF
Why activation function
PDF
Logistic Regression (Binary Classification)
PDF
Model validation and_early_stopping_-_shooting
PDF
Lecture 01 _perceptron_intro
PDF
Lecture 19 chapter_4_regularized_linear_models
PDF
Svm rbf kernel
PDF
Lecture 24 support vector machine kernel
PDF
Lecture 23 support vector classifier
PDF
Lecture 12 binary classifier confusion matrix
PDF
Lecture 11 linear regression
PDF
PDF
Naive bayes classifier python session
PDF
PDF
Lecture 08 prepare the data for ml algorithm
why now Deep Neural Networks?
Activation functions types
Why activation function
Logistic Regression (Binary Classification)
Model validation and_early_stopping_-_shooting
Lecture 01 _perceptron_intro
Lecture 19 chapter_4_regularized_linear_models
Svm rbf kernel
Lecture 24 support vector machine kernel
Lecture 23 support vector classifier
Lecture 12 binary classifier confusion matrix
Lecture 11 linear regression
Naive bayes classifier python session
Lecture 08 prepare the data for ml algorithm
Ad

Recently uploaded (20)

PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Construction Project Organization Group 2.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
Digital Logic Computer Design lecture notes
DOCX
573137875-Attendance-Management-System-original
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PDF
Well-logging-methods_new................
PPTX
Artificial Intelligence
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
CH1 Production IntroductoryConcepts.pptx
PPT
Mechanical Engineering MATERIALS Selection
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
web development for engineering and engineering
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
UNIT 4 Total Quality Management .pptx
Foundation to blockchain - A guide to Blockchain Tech
Construction Project Organization Group 2.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Digital Logic Computer Design lecture notes
573137875-Attendance-Management-System-original
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
Well-logging-methods_new................
Artificial Intelligence
R24 SURVEYING LAB MANUAL for civil enggi
CH1 Production IntroductoryConcepts.pptx
Mechanical Engineering MATERIALS Selection
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
web development for engineering and engineering
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Operating System & Kernel Study Guide-1 - converted.pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...

Lecture 02 ml supervised and unsupervised

  • 1. Machine Learning types Dr. Mostafa A. Elhosseini YouTube Channel
  • 2. ML Jargons ≡ Training set: the examples that the system uses to learn ≡ Training instance: Training sample -> each training example ≡ Label: spam or not ≡ feature = attribute + value ≡ Predictors: features ≡ Ex. Price of a car ▪ Set of features (mileage, age, brand) called predictors ▪ Feature : Mileage = 15,000 ≡ Performance measure P: ≡ Ratio of correctly classified emails
  • 3. Machine learning definition ≡ Machine Learning is the science (and art) of programming computers so they can learn from data ≡ The field of study that gives computers the ability to learn without being explicitly programmed – Arthur Samuel, 1959 ≡ A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E. Tom Mitchell, 1997
  • 4. ML types ≡ Whether or not they are trained with human supervision ▪ supervised, ▪ unsupervised, ▪ semisupervised, and ▪ Reinforcement Learning ≡ Whether or not they can learn incrementally on the fly ▪ online versus batch learning ≡ Whether they work by simply comparing new data points to known data points, or instead detect patterns in the training data and build a predictive model, much like scientists do ▪ instance-based versus model-based learning
  • 5. Supervised / Unsupervised ≡ In supervised learning, the training data you feed to the algorithm includes the desired solutions, called labels ▪ Classification ▪ Regression ≡ Classification ▪ The spam filter is a good example of this: it is trained with many example emails along with their class (spam or ham), and it must learn how to classify new
  • 6. Supervised / Unsupervised ≡ Regression ▪ predict a target numeric value, such as the price of a car, given a set of features (mileage, age, brand, etc.) called predictors. ≡ some regression algorithms can be used for classification as well ▪ Logistic Regression is commonly used for classification, as it can output a value that corresponds to the probability of belonging to a given class
  • 7. Supervised / Unsupervised ≡ The training data is unlabeled. ≡ The system tries to learn without a teacher. ≡ Most important unsupervised Applications ▪ Clustering ▪ Blog visitors ▪ Visualization and dimensionality reduction ▪ Association rule learning
  • 8. Unsupervised// Dimensionality Reduction ≡ Simplify the data without losing too much information ≡ One way to do this is to merge several correlated features into one ▪ car’s mileage may be very correlated with its age, so the dimensionality reduction algorithm will merge them into one feature that represents the car’s wear and tear. This is called feature extraction ≡ It is often a good idea to try to reduce the dimension of your training data ▪ It will run much faster, ▪ The data will take up less disk and memory space, and in some cases ▪ It may also perform better
  • 9. Unsupervised // anomaly detection ≡ Detecting unusual credit card transactions to prevent fraud, catching manufacturing defects, or automatically removing outliers from a dataset before feeding it to another learning algorithm. ≡ The system is trained with normal instances, and when it sees a new instance it can tell whether it looks like a normal one or whether it is likely an anomaly
  • 10. Unsupervised // Association rule learning ≡ The goal is to dig into large amounts of data and discover interesting relations between attributes. ≡ For example, suppose you own a supermarket. Running an association rule on your sales logs may reveal that people who purchase barbecue sauce and potato chips also tend to buy steak. Thus, you may want to place these items close to each other.
  • 11. Batch/ Online learning ≡ Incapable of learning incrementally ≡ Trained using all the available data ≡ Take a lot of time and computing resources ≡ Done offline ≡ Ex. If a new type of spam -> you need to train a new version of the system from scratch on the full dataset
  • 12. Batch/ Online learning ≡ you train the system incrementally by feeding it data instances sequentially, either individually or by small groups called mini- batches. ≡ Each learning step is fast and cheap, so the system can learn about new data on the fly, as it arrives ≡ This whole process is usually done offline (i.e., not on the live system), so online learning can be a confusing name. Think of it as incremental learning. ≡ Learning rate ▪ How fast they should adapt to changing data
  • 13. Instance-based // Model-based ≡One more way to categorize Machine Learning systems is by how they generalize ▪ Needs to be able to generalize to examples it has never seen before ≡ Having a good performance measure on the training data is good, but insufficient; the true goal is to perform well on new instances ≡ There are two main approaches to generalization: instance-based learning and model-based learning
  • 14. Instance-based // Model-based ≡Simply to learn by heart ▪ For Spam filter: just flag all emails that are identical to emails that have already been flagged by users ▪ Not the worst solution, but certainly not the best. ≡ Instead of just flagging emails that are identical to known spam emails, your spam filter could be Programmed to also flag emails that are very similar to known spam emails. ≡ This requires a measure of Similarity between two emails ≡ The system would flag an email as spam if it has many words in common with a known spam email
  • 16. Instance-based // Model-based ≡ Another way to generalize from a set of examples is to build a model of these examples, then use that model to make predictions. This is called model-based learning