SlideShare a Scribd company logo
Machine Learning, Data Mining, Genetic Algorithms, Neural Networks   ISYS370 Dr. R. Weber
Concept Learning is a Form of Inductive Learning Learner uses: positive examples (instances ARE examples of a concept) and  negative examples (instances ARE NOT examples of a concept)
Concept Learning Needs empirical validation Dense or sparse data determine quality of different methods
Validation of Concept Learning i The learned concept should be able to correctly classify new instances of the concept When it succeeds in a real instance of the concept it finds true positives  When it fails in a real instance of the concept it finds false negatives
Validation of Concept Learning ii The learned concept should be able to correctly classify new instances of the concept When it succeeds in a counterexample it finds true negatives When it fails in a counterexample it finds false positives
Rule Learning Learning widely used in data mining Version Space Learning is a search method to learn rules Decision Trees
Decision trees Knowledge representation formalism Represent mutually exclusive rules (disjunction) A way of breaking up a data set into classes or categories Classification rules that determine, for each instance with attribute values, whether it belongs to one or another class Not incremental
Decision  trees - leaf nodes (classes) -  decision nodes  (tests on attribute values) - from decision nodes branches grow for each possible outcome of the test From Cawsey, 1997
Decision tree induction Goal is to correctly classify all example data Several algorithms to induce decision trees:  ID3 (Quinlan 1979) , CLS, ACLS, ASSISTANT, IND, C4.5 Constructs decision tree from past data Attempts to find the simplest tree (not guaranteed because it is based on heuristics)
From: a set of target classes Training data containing objects of more than one class ID3 uses test to refine the training data set into subsets that contain objects of only one class each Choosing the right test is the key ID3 algorithm
Information gain or ‘minimum entropy’ Maximizing information gain corresponds to minimizing entropy Predictive features (good indicators of the outcome) How does ID3 chooses tests
Information gain or ‘minimum entropy’ Maximizing information gain corresponds to minimizing entropy Predictive features (good indicators of the outcome) Choosing tests
Bad 400 salaried 1,500 4 Very good 300 Waged 3,000 3 Very bad 600 Salaried 4,000 2 Good 200 Salaried 2,000 1 Loan status Repayment Job status Monthy income
Link analysis Deviation detection Data mining tasks ii Rules:  Association generation Relationships between entities How things change over time, trends
KDD applications Fraud detection Telecom (calling cards, cell phones) Credit cards Health insurance Loan approval Investment analysis Marketing and sales data analysis Identify potential customers Effectiveness of sales campaign Store layout
Text mining The problem starts with a query and the solution is a set of information (e.g., patterns, connections, profiles, trends) contained in several different texts that are potentially relevant to the initial query.
Text mining applications IBM Text Navigator Cluster documents by content; Each document is annotated by the 2 most frequently used words in the cluster; Concept Extraction (Los Alamos) Text analysis of medical records; Uses a clustering approach based on trigram representation; Documents in vectors, cosine for comparison;
rule-based ES case-based reasoning inductive ML, NN algorithms deductive reasoning analogical reasoning inductive reasoning search Problem solving  method Reasoning  type
Genetic Algorithms (GA)
Genetic algorithms (i) learn by experimentation based on human genetics, it originates new solutions  representational restrictions good to improve quality of other methods e.g., search algorithms, CBR evolutionary algorithms (broader)
Genetic algorithms (ii) requires an evaluation function to guide the process population of genomes represent possible solutions operations are applied over these genomes operations can be mutation, crossover operations produce new offspring an evaluation function tests how fit an offspring is  the fittest will survive to mate again
Genetic Algorithms ii http://ai.bpa.arizona.edu/~mramsey/ga.html   You can change parameters http://www.rennard.org/alife/english/gavgb.html  Steven Thompson presented
Neural Networks (NN)
~= 2 nd -5 th  week training vision the evidence
the evidence ~= 2 nd -5 th  week training vision 10
the evidence ~= 2 nd -5 th  week training vision 10
the evidence ~= 2 nd -5 th  week training vision
NN: model of brains input output neurons synapses electric transmissions :
Elements input nodes output nodes links weights
terminology input  and  output nodes  (or units) connected by  links each  link  has a numeric  weight weights  store information networks are trained on  training sets  (examples) and after are tested on  test sets  to assess networks’  accuracy learning/training  takes place as weights are updated to reflect the input/output behavior
The concept => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 1    Yes, 0    No => mammal 1 1 0
The concept => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 1    Yes, 0    No
The concept => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0.5  0.5  0.5 1    Yes, 0    No
=> mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0*0.5+1*0.5+1*0.5= 1 1*0.5+0*0.5+0*0.5= 0.5 1*0.5+1*0.5+0*0.5= 1 Goal is to have  weights that recognize different  representations of mammals and birds as such  0.5  0.5  0.5
=> mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0*0.5+1*0.5+1*0.5= 1 1*0.5+0*0.5+0*0.5= 0.5 1*0.5+1*0.5+0*0.5= 1 Suppose we want bird to be greater 0.5  and mammal to be equal or less than 0.5 0.5  0.5  0.5
=> mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0*0.25+1*0.25+1*0.5= 0.75 1*0.25+0*0.25+0*0.5= 0.25 1*0.25+1*0.25+0*0.5= 0.5 Suppose we want bird to be greater 0.5  and mammal to be equal or less than 0.5 0.25  0.25  0.5
The training Output=Step(  w  f  ) learning  takes place as weights are updated to reflect the input/output behavior => mammal (1) => bird  (0) 0 1 1 4 legs flies eggs i=1 i=2 i=3 j=1 j=2 j=3 ij Goal minimize error between representation of the expected and actual outcome 20   ij 0  0  0 0  0  0 0  0  0 1  0  0 1  0  0 1  0  0 1  0  0 1  0  0 1  1  1 1  0  0 1  1  1 1  1  1
NN demo…..
Characteristics NN implement inductive learning algorithms (through generalization) therefore, it requires several training examples to learn NN do not provide an explanation why the task performed the way it was no explicit knowledge; uses data Classification (pattern recognition),  clustering, diagnosis, optimization, forecasting (prediction), modeling, reconstruction, routing
Where are NN applicable? Where they can form a model from training data alone; When there may be an algorithm, but it is not known, or has too many variables; There are enough examples available It is easier to let the network learn from examples Other inductive learning methods may not be as accurate
Applications (i) predict movement of stocks, currencies, etc., from previous data; to recognize signatures made (e.g. in a bank) with those stored; to monitor the state of aircraft engines (by monitoring vibration levels and sound, early warning of engine problems can be given; British Rail have been testing an application to monitor diesel engines;
Applications (ii) Pronunciation (rules with many exceptions) Handwritten character recognition (network w/ 200,000 is impossible to train, final 9,760 weights, used 7300 examples to train and 2,000 to test, 99% accuracy) Learn brain patterns to control and activate limbs as in the “Rats control a robot by thought alone” article Credit assignment
CMU Driving ALVINN learns from human drivers how to steer a vehicle along a single lane on a highway ALVINN is implemented in two vehicles equipped with computer-controlled steering, acceleration, and braking  cars can reach 70 m/h with ALVINN programs that consider all the problem environment reach 4 m/h only
Why using NN for the driving task?  there is no good theory of driving, but it is easy to collect training samples training data is obtained with a human* driving the vehicle 5min training, 10 min algorithm runs driving is continuous and noisy almost all features contribute with useful information *humans are not very good generators of training instances when they behave too regularly without making mistakes
INPUT: video camera generates array of 30x32 grid of input nodes OUTPUT: 30 nodes layer corresponding to steering direction vehicle steers to the direction of the layer with highest activation the neural network
Resources http://www.cs.stir.ac.uk/~lss/NNIntro/InvSlides.html#what http://www.ri.cmu.edu/projects/project_160.html http://www.txtwriter.com/Onscience/Articles/ratrobot.html

More Related Content

What's hot (18)

PPTX
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Simplilearn
 
PDF
Prediction of Euro 50 Using Back Propagation Neural Network (BPNN) and Geneti...
AI Publications
 
PPTX
Implementing Item Response Theory
Nathan Thompson
 
PDF
Introduction to the Genetic Algorithm
Qiang Hao
 
PPT
Introduction to Item Response Theory
OpenThink Labs
 
PPTX
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Parth Khare
 
PDF
Genetic Algorithms
Alaa Khamis, PhD, SMIEEE
 
PPT
Topic_6
butest
 
PPTX
Analysis of Parameter using Fuzzy Genetic Algorithm in E-learning System
Harshal Jain
 
PDF
Handling Imbalanced Data: SMOTE vs. Random Undersampling
IRJET Journal
 
PDF
Module 6: Ensemble Algorithms
Sara Hooker
 
PPTX
Machine Learning Unit 2 Semester 3 MSc IT Part 2 Mumbai University
Madhav Mishra
 
PDF
Machine Learning Interview Questions
Rock Interview
 
PPTX
Introduction to unidimensional item response model
Sumit Das
 
PPTX
Classical Test Theory (CTT)- By Dr. Jai Singh
Academy for Higher Education and Social Science Research
 
PPTX
Survival of the Fittest: Using Genetic Algorithm for Data Mining Optimization
Or Levi
 
PPTX
Genetic Algorithm
SEKHARREDDYAMBATI
 
PPTX
Introduction to Item Response Theory
Nathan Thompson
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Simplilearn
 
Prediction of Euro 50 Using Back Propagation Neural Network (BPNN) and Geneti...
AI Publications
 
Implementing Item Response Theory
Nathan Thompson
 
Introduction to the Genetic Algorithm
Qiang Hao
 
Introduction to Item Response Theory
OpenThink Labs
 
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Parth Khare
 
Genetic Algorithms
Alaa Khamis, PhD, SMIEEE
 
Topic_6
butest
 
Analysis of Parameter using Fuzzy Genetic Algorithm in E-learning System
Harshal Jain
 
Handling Imbalanced Data: SMOTE vs. Random Undersampling
IRJET Journal
 
Module 6: Ensemble Algorithms
Sara Hooker
 
Machine Learning Unit 2 Semester 3 MSc IT Part 2 Mumbai University
Madhav Mishra
 
Machine Learning Interview Questions
Rock Interview
 
Introduction to unidimensional item response model
Sumit Das
 
Classical Test Theory (CTT)- By Dr. Jai Singh
Academy for Higher Education and Social Science Research
 
Survival of the Fittest: Using Genetic Algorithm for Data Mining Optimization
Or Levi
 
Genetic Algorithm
SEKHARREDDYAMBATI
 
Introduction to Item Response Theory
Nathan Thompson
 

Viewers also liked (20)

PPTX
10-15 511 genetic algorithms and machine learning (alan nochenson)
Alan Nochenson
 
KEY
Github in a Graph
akollegger
 
PDF
Finding the insights hidden in your graph data
DataStax
 
PDF
Fouille de données issues d’un grand graphe par carte de Kohonen à noyau
tuxette
 
PPTX
Graph Database Prototyping made easy with Graphgen
Christophe Willemsen
 
PDF
20141015 how graphs revolutionize access management
Rik Van Bruggen
 
PDF
The Impact of Algorithmic Trading
Lov Loothra
 
PDF
Bringing graph technologies to data analysis : the case of Azerbaijan in th...
Linkurious
 
PPTX
NoSQL Graph Databases - Why, When and Where
Eugene Hanikblum
 
PDF
Link Analysis
Carlos Castillo (ChaTo)
 
PDF
An Introduction to Neural Networks and Machine Learning
Chris Nicholls
 
PDF
Reinforcing AML systems with graph technologies.
Linkurious
 
PDF
Introduction to the graph technologies landscape
Linkurious
 
PPTX
Building a Scalable Data Science Platform with R
DataWorks Summit/Hadoop Summit
 
PDF
GraphGen: Conducting Graph Analytics over Relational Databases
PyData
 
PDF
Solving Problems with Graphs
Marko Rodriguez
 
PDF
Using graph technologies to fight fraud
Linkurious
 
PDF
Intro to Graphs and Neo4j
Neo4j
 
DOCX
Link analysis .. Data Mining
Mustafa Salam
 
PPT
Business Intelligence For Anti-Money Laundering
Kartik Mehta
 
10-15 511 genetic algorithms and machine learning (alan nochenson)
Alan Nochenson
 
Github in a Graph
akollegger
 
Finding the insights hidden in your graph data
DataStax
 
Fouille de données issues d’un grand graphe par carte de Kohonen à noyau
tuxette
 
Graph Database Prototyping made easy with Graphgen
Christophe Willemsen
 
20141015 how graphs revolutionize access management
Rik Van Bruggen
 
The Impact of Algorithmic Trading
Lov Loothra
 
Bringing graph technologies to data analysis : the case of Azerbaijan in th...
Linkurious
 
NoSQL Graph Databases - Why, When and Where
Eugene Hanikblum
 
An Introduction to Neural Networks and Machine Learning
Chris Nicholls
 
Reinforcing AML systems with graph technologies.
Linkurious
 
Introduction to the graph technologies landscape
Linkurious
 
Building a Scalable Data Science Platform with R
DataWorks Summit/Hadoop Summit
 
GraphGen: Conducting Graph Analytics over Relational Databases
PyData
 
Solving Problems with Graphs
Marko Rodriguez
 
Using graph technologies to fight fraud
Linkurious
 
Intro to Graphs and Neo4j
Neo4j
 
Link analysis .. Data Mining
Mustafa Salam
 
Business Intelligence For Anti-Money Laundering
Kartik Mehta
 
Ad

Similar to Machine Learning, Data Mining, Genetic Algorithms, Neural ... (20)

PDF
EssentialsOfMachineLearning.pdf
Ankita Tiwari
 
PPT
coppin chapter 10e.ppt
butest
 
PPT
l15.ppt
butest
 
PPT
LECTURE8.PPT
butest
 
PPT
ML_Overview.ppt
ParveshKumar17303
 
PPTX
ML_Overview.pptx
ssuserb0b8ed1
 
PPT
ML_Overview.ppt
vijay251387
 
PPT
ML overview
NoopurRathore1
 
PPT
MLlecture1.ppt
butest
 
PPT
MLlecture1.ppt
butest
 
PPT
Different learning Techniques in Artificial Intelligence
vipsitaswati
 
PPTX
AI: Learning in AI
DataminingTools Inc
 
PPTX
AI: Learning in AI
Datamining Tools
 
PPT
Machine Learning
butest
 
PPT
c23_ml1.ppt
Faiz430036
 
PPT
Machine Learning and Inductive Inference
butest
 
PDF
Machine Learning- Perceptron_Backpropogation_Module 3.pdf
Dr. Shivashankar
 
PPT
Machine Learning
butest
 
PPT
Machine Learning
butest
 
PPTX
machine leraning : main principles and techniques
johngeorgakis99
 
EssentialsOfMachineLearning.pdf
Ankita Tiwari
 
coppin chapter 10e.ppt
butest
 
l15.ppt
butest
 
LECTURE8.PPT
butest
 
ML_Overview.ppt
ParveshKumar17303
 
ML_Overview.pptx
ssuserb0b8ed1
 
ML_Overview.ppt
vijay251387
 
ML overview
NoopurRathore1
 
MLlecture1.ppt
butest
 
MLlecture1.ppt
butest
 
Different learning Techniques in Artificial Intelligence
vipsitaswati
 
AI: Learning in AI
DataminingTools Inc
 
AI: Learning in AI
Datamining Tools
 
Machine Learning
butest
 
c23_ml1.ppt
Faiz430036
 
Machine Learning and Inductive Inference
butest
 
Machine Learning- Perceptron_Backpropogation_Module 3.pdf
Dr. Shivashankar
 
Machine Learning
butest
 
Machine Learning
butest
 
machine leraning : main principles and techniques
johngeorgakis99
 
Ad

More from butest (20)

PDF
EL MODELO DE NEGOCIO DE YOUTUBE
butest
 
DOC
1. MPEG I.B.P frame之不同
butest
 
PDF
LESSONS FROM THE MICHAEL JACKSON TRIAL
butest
 
PPT
Timeline: The Life of Michael Jackson
butest
 
DOCX
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
butest
 
PDF
LESSONS FROM THE MICHAEL JACKSON TRIAL
butest
 
PPTX
Com 380, Summer II
butest
 
PPT
PPT
butest
 
DOCX
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
butest
 
DOC
MICHAEL JACKSON.doc
butest
 
PPTX
Social Networks: Twitter Facebook SL - Slide 1
butest
 
PPT
Facebook
butest
 
DOCX
Executive Summary Hare Chevrolet is a General Motors dealership ...
butest
 
DOC
Welcome to the Dougherty County Public Library's Facebook and ...
butest
 
DOC
NEWS ANNOUNCEMENT
butest
 
DOC
C-2100 Ultra Zoom.doc
butest
 
DOC
MAC Printing on ITS Printers.doc.doc
butest
 
DOC
Mac OS X Guide.doc
butest
 
DOC
hier
butest
 
DOC
WEB DESIGN!
butest
 
EL MODELO DE NEGOCIO DE YOUTUBE
butest
 
1. MPEG I.B.P frame之不同
butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
butest
 
Timeline: The Life of Michael Jackson
butest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
butest
 
Com 380, Summer II
butest
 
PPT
butest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
butest
 
MICHAEL JACKSON.doc
butest
 
Social Networks: Twitter Facebook SL - Slide 1
butest
 
Facebook
butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
butest
 
NEWS ANNOUNCEMENT
butest
 
C-2100 Ultra Zoom.doc
butest
 
MAC Printing on ITS Printers.doc.doc
butest
 
Mac OS X Guide.doc
butest
 
hier
butest
 
WEB DESIGN!
butest
 

Machine Learning, Data Mining, Genetic Algorithms, Neural ...

  • 1. Machine Learning, Data Mining, Genetic Algorithms, Neural Networks ISYS370 Dr. R. Weber
  • 2. Concept Learning is a Form of Inductive Learning Learner uses: positive examples (instances ARE examples of a concept) and negative examples (instances ARE NOT examples of a concept)
  • 3. Concept Learning Needs empirical validation Dense or sparse data determine quality of different methods
  • 4. Validation of Concept Learning i The learned concept should be able to correctly classify new instances of the concept When it succeeds in a real instance of the concept it finds true positives When it fails in a real instance of the concept it finds false negatives
  • 5. Validation of Concept Learning ii The learned concept should be able to correctly classify new instances of the concept When it succeeds in a counterexample it finds true negatives When it fails in a counterexample it finds false positives
  • 6. Rule Learning Learning widely used in data mining Version Space Learning is a search method to learn rules Decision Trees
  • 7. Decision trees Knowledge representation formalism Represent mutually exclusive rules (disjunction) A way of breaking up a data set into classes or categories Classification rules that determine, for each instance with attribute values, whether it belongs to one or another class Not incremental
  • 8. Decision trees - leaf nodes (classes) - decision nodes (tests on attribute values) - from decision nodes branches grow for each possible outcome of the test From Cawsey, 1997
  • 9. Decision tree induction Goal is to correctly classify all example data Several algorithms to induce decision trees: ID3 (Quinlan 1979) , CLS, ACLS, ASSISTANT, IND, C4.5 Constructs decision tree from past data Attempts to find the simplest tree (not guaranteed because it is based on heuristics)
  • 10. From: a set of target classes Training data containing objects of more than one class ID3 uses test to refine the training data set into subsets that contain objects of only one class each Choosing the right test is the key ID3 algorithm
  • 11. Information gain or ‘minimum entropy’ Maximizing information gain corresponds to minimizing entropy Predictive features (good indicators of the outcome) How does ID3 chooses tests
  • 12. Information gain or ‘minimum entropy’ Maximizing information gain corresponds to minimizing entropy Predictive features (good indicators of the outcome) Choosing tests
  • 13. Bad 400 salaried 1,500 4 Very good 300 Waged 3,000 3 Very bad 600 Salaried 4,000 2 Good 200 Salaried 2,000 1 Loan status Repayment Job status Monthy income
  • 14. Link analysis Deviation detection Data mining tasks ii Rules: Association generation Relationships between entities How things change over time, trends
  • 15. KDD applications Fraud detection Telecom (calling cards, cell phones) Credit cards Health insurance Loan approval Investment analysis Marketing and sales data analysis Identify potential customers Effectiveness of sales campaign Store layout
  • 16. Text mining The problem starts with a query and the solution is a set of information (e.g., patterns, connections, profiles, trends) contained in several different texts that are potentially relevant to the initial query.
  • 17. Text mining applications IBM Text Navigator Cluster documents by content; Each document is annotated by the 2 most frequently used words in the cluster; Concept Extraction (Los Alamos) Text analysis of medical records; Uses a clustering approach based on trigram representation; Documents in vectors, cosine for comparison;
  • 18. rule-based ES case-based reasoning inductive ML, NN algorithms deductive reasoning analogical reasoning inductive reasoning search Problem solving method Reasoning type
  • 20. Genetic algorithms (i) learn by experimentation based on human genetics, it originates new solutions representational restrictions good to improve quality of other methods e.g., search algorithms, CBR evolutionary algorithms (broader)
  • 21. Genetic algorithms (ii) requires an evaluation function to guide the process population of genomes represent possible solutions operations are applied over these genomes operations can be mutation, crossover operations produce new offspring an evaluation function tests how fit an offspring is the fittest will survive to mate again
  • 22. Genetic Algorithms ii http://ai.bpa.arizona.edu/~mramsey/ga.html You can change parameters http://www.rennard.org/alife/english/gavgb.html Steven Thompson presented
  • 24. ~= 2 nd -5 th week training vision the evidence
  • 25. the evidence ~= 2 nd -5 th week training vision 10
  • 26. the evidence ~= 2 nd -5 th week training vision 10
  • 27. the evidence ~= 2 nd -5 th week training vision
  • 28. NN: model of brains input output neurons synapses electric transmissions :
  • 29. Elements input nodes output nodes links weights
  • 30. terminology input and output nodes (or units) connected by links each link has a numeric weight weights store information networks are trained on training sets (examples) and after are tested on test sets to assess networks’ accuracy learning/training takes place as weights are updated to reflect the input/output behavior
  • 31. The concept => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 1  Yes, 0  No => mammal 1 1 0
  • 32. The concept => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 1  Yes, 0  No
  • 33. The concept => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0.5 0.5 0.5 1  Yes, 0  No
  • 34. => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0*0.5+1*0.5+1*0.5= 1 1*0.5+0*0.5+0*0.5= 0.5 1*0.5+1*0.5+0*0.5= 1 Goal is to have weights that recognize different representations of mammals and birds as such 0.5 0.5 0.5
  • 35. => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0*0.5+1*0.5+1*0.5= 1 1*0.5+0*0.5+0*0.5= 0.5 1*0.5+1*0.5+0*0.5= 1 Suppose we want bird to be greater 0.5 and mammal to be equal or less than 0.5 0.5 0.5 0.5
  • 36. => mammal => bird 0 1 1 4 legs fly lay eggs 1 0 0 => mammal 1 1 0 0*0.25+1*0.25+1*0.5= 0.75 1*0.25+0*0.25+0*0.5= 0.25 1*0.25+1*0.25+0*0.5= 0.5 Suppose we want bird to be greater 0.5 and mammal to be equal or less than 0.5 0.25 0.25 0.5
  • 37. The training Output=Step(  w f ) learning takes place as weights are updated to reflect the input/output behavior => mammal (1) => bird (0) 0 1 1 4 legs flies eggs i=1 i=2 i=3 j=1 j=2 j=3 ij Goal minimize error between representation of the expected and actual outcome 20 ij 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 1 0 0 1 0 0 1 0 0 1 1 1 1 0 0 1 1 1 1 1 1
  • 39. Characteristics NN implement inductive learning algorithms (through generalization) therefore, it requires several training examples to learn NN do not provide an explanation why the task performed the way it was no explicit knowledge; uses data Classification (pattern recognition), clustering, diagnosis, optimization, forecasting (prediction), modeling, reconstruction, routing
  • 40. Where are NN applicable? Where they can form a model from training data alone; When there may be an algorithm, but it is not known, or has too many variables; There are enough examples available It is easier to let the network learn from examples Other inductive learning methods may not be as accurate
  • 41. Applications (i) predict movement of stocks, currencies, etc., from previous data; to recognize signatures made (e.g. in a bank) with those stored; to monitor the state of aircraft engines (by monitoring vibration levels and sound, early warning of engine problems can be given; British Rail have been testing an application to monitor diesel engines;
  • 42. Applications (ii) Pronunciation (rules with many exceptions) Handwritten character recognition (network w/ 200,000 is impossible to train, final 9,760 weights, used 7300 examples to train and 2,000 to test, 99% accuracy) Learn brain patterns to control and activate limbs as in the “Rats control a robot by thought alone” article Credit assignment
  • 43. CMU Driving ALVINN learns from human drivers how to steer a vehicle along a single lane on a highway ALVINN is implemented in two vehicles equipped with computer-controlled steering, acceleration, and braking cars can reach 70 m/h with ALVINN programs that consider all the problem environment reach 4 m/h only
  • 44. Why using NN for the driving task? there is no good theory of driving, but it is easy to collect training samples training data is obtained with a human* driving the vehicle 5min training, 10 min algorithm runs driving is continuous and noisy almost all features contribute with useful information *humans are not very good generators of training instances when they behave too regularly without making mistakes
  • 45. INPUT: video camera generates array of 30x32 grid of input nodes OUTPUT: 30 nodes layer corresponding to steering direction vehicle steers to the direction of the layer with highest activation the neural network

Editor's Notes

  • #15: What is predictive modeling? Predictive modeling uses demographic, medical and pharmacy claims information to determine the range and intensity of medical problems for a given population of insured persons. This assessment of risk allows health plans, payers and provider groups to plan, evaluate and fund health care management programs more effectively. From: http://www.dxcgrisksmart.com/faq.html
  • #20: TELL THE CAT STORY
  • #24: TELL THE CAT STORY