SlideShare a Scribd company logo
1Ā© 2017 The MathWorks, Inc.
Developing and Deploying Deep Learning Based
Computer Vision Systems
Alka Nair
Application Engineer
2
Alexnet Vehicle Detection
People detection
Lane detection
~30 Fps
(Tegra X1)
~66 Fps
(Tegra X1)
~20 Fps
(K40c)
~130 Fps
(K40c)
3
Deep Learning Applications in Computer Vision
HIGHWAY_SCENE
Classification
Semantic SegmentationRain Detection and Removal
Human Aware Navigation for Robots
4
Lane Detection on a Tesla K40 GPU
5
End-to-End Application: Lane Detection
Transfer Learning
AlexNet – 1000 class classification
Lane detection
CNN
Post-processing
(find left/right lane
points)
Image
Parabolic lane
coefficients in
world coordinates
Left lane co-efficients
Right lane co-efficients
Output of CNN is lane parabola coefficients according to: y = ax^2 + bx + c
MATLAB : A SINGLE PLATFORM FOR DEEP LEARNING TRAINING & DEPLOYMENT
6
Deep Learning Challenges
Big Data
ā–Ŗ Handling large amounts of data
ā–Ŗ Labeling thousands of images & videos
Training and Testing Deep Neural Networks
ā–Ŗ Accessing reference models from research
ā–Ŗ Understanding network behavior
ā–Ŗ Tuning hyperparameters and refining architectures
ā–Ŗ Training takes hours-days
Seamless Deployment onto embedded hardware
Real world systems use more than deep
learning
Deep learning frameworks do not
include ā€œclassicalā€ computer vision
Not a deep learning expert
7
Access Large Sets of Images
Handle Large Sets of Images
Easily manage large sets of images
- Single line of code to access images
- Operates on disk, database, big-data file system
imageData =
imageDataStore(ā€˜vehicles’)
Easily manage large sets of images
- Single line of code to access images
- Operates on disk, database, big-data file
system
Organize Images in Folders
(~ 10,000 images , 5 folders)
8
Handle big image collection without big changes
Images in local directory
Images on HDFS
9
Accelerating Ground Truth Labeling
10
Generate Training Data from Labeled Images
Labeled Lane Boundaries in
Image Coordinates
Correspond to coefficients of parabola representing left and right lane (a,b,c).
Ground Truth Exported from
Ground Truth Labeler App
Parabolic Lane Boundary
Modeling
>> findparabolicLaneBoundaries
Lane Boundary Models
in World Coordinates
11
End-to-End Application: Lane Detection
Transfer Learning
AlexNet – 1000 class classification
Lane detection
CNN
Post-processing
(find left/right lane
points)
Image
Parabolic lane
coefficients in
vehicle coordinates
Left lane co-efficients
Right lane co-efficients
Output of CNN is lane parabola coefficients according to: y = ax^2 + bx + c
MATLAB : A SINGLE PLATFORM FOR DEEP LEARNING TRAINING & DEPLOYMENT
12
Deep Learning Challenges
Big Data
ā–Ŗ Handling large amounts of data
ā–Ŗ Labeling thousands of images & videos
Training and Testing Deep Neural Networks
ā–Ŗ Accessing reference models from research
ā–Ŗ Understanding network behavior
ā–Ŗ Tuning hyperparameters and refining architectures
ā–Ŗ Training takes hours-days
Seamless Deployment onto embedded hardware
Real world systems use more than deep
learning
Deep learning frameworks do not
include ā€œclassicalā€ computer vision
Not a deep learning expert
13
Transfer Learning Workflow
Early layers Last layers
1 million images
1000s classes
Load pretrained network
Fewer classes
Learn faster
New layers
Replace final layers
100s images
10s classes
Training options
Train network
Trained
Network
Predict and assess
network accuracy
14
Import Pre-Trained Models and Network Architectures
Pretrained Models
ā–Ŗ AlexNet
ā–Ŗ VGG-16
ā–Ŗ VGG-19
ā–Ŗ GoogLeNet
ā–Ŗ Resnet50
ā–Ŗ InceptionV3
ā–Ŗ ResNet - 101
Import Models from Frameworks
ā–Ŗ Caffe Model Importer
(including Caffe Model Zoo)
– importCaffeLayers
– importCaffeNetwork
ā–Ŗ TensorFlow-Keras Model Importer
– importKerasLayers
– importKerasNetwork
Download from within MATLAB
net = alexnet;
net = vgg16;
net = vgg19;
net = googlenet;
net = resnet50;
net = inceptionv3;
net = resnet101;
15
Visualizations for Understanding Network Behavior
ā–Ŗ Custom visualizations
– Example: Class Activation Maps
Filters
…
Activations
Deep Dream
http://cnnlocalization.csail.mit.edu/Zhou_Learning_Deep_Features_CVPR_2016_paper.pdf
16
Augment Training Images
imageAugmenter = imageDataAugmenter('RandRotation',[-180 180])
Rotation
Reflection
Scaling
Shearing
Translation
Colour pre-processing
Resize / Random crop / Centre crop
17
Transfer Learning Workflow
Early layers Last layers
1 million images
1000s classes
Load pretrained network
Fewer classes
Learn faster
New layers
Replace final layers
100s images
10s classes
Training options
Train network
Trained
Network
Predict and assess
network accuracy
18
Transfer Learning Workflow
Early layers Last layers
1 million images
1000s classes
Load pretrained network
Fewer classes
Learn faster
New layers
Replace final layers
100s images
10s classes
Training options
Train network
Trained
Network
Predict and assess
network accuracy
19
Transfer Learning Workflow
Early layers Last layers
1 million images
1000s classes
Load pretrained network
Fewer classes
Learn faster
New layers
Replace final layers
100s images
10s classes
Training options
Train network
Trained
Network
Predict and assess
network accuracy
20
Training Deep Neural Networks
trainingOptions
ā–Ŗ Plot training metrics
– Training accuracy, smoothed training
accuracy, validation accuracy
– Training loss, smoothed training loss,
and validation loss
ā–Ŗ Debug training
– Stop and check current state
– Save / load checkpoint networks
– Custom output function (stopping
condition, visualization, etc.)
ā–Ŗ Bayesian optimization for
hyperparameter tuning Learn More
21
Transfer Learning Workflow
Early layers Last layers
1 million images
1000s classes
Load pretrained network
Fewer classes
Learn faster
New layers
Replace final layers
100s images
10s classes
Training options
Train network
Trained
Network
Predict and assess
network accuracy
MATLAB Provides Evaluation Frameworks for Different Classes of Deep Learning Problems
22
Lane Detection
23
Deep learning on CPU, GPU, multi-GPU and clusters
Single CPU Single CPU
Single GPU
Single CPU
Multiple GPUs
On-prem server with
GPUs
Cloud GPUs
(AWS, Azure, etc.)
Deep Learning on
Cloud Whitepaper
24
Training in MATLAB is fast
MATLAB is more than 4x
faster than TensorFlow
AlexNet CNN architecture trained on the ImageNet dataset, using batch size of 32, on a Windows 10 desktop with single
NVIDIA GPU (Titan Xp). TensorFlow version 1.2.0.
25
Deep Learning Challenges
Big Data
ā–Ŗ Handling large amounts of data
ā–Ŗ Labeling thousands of images & videos
Training and Testing Deep Neural Networks
ā–Ŗ Accessing reference models from research
ā–Ŗ Understanding network behavior
ā–Ŗ Tuning hyperparameters and refining architectures
ā–Ŗ Training takes hours-days
Seamless Deployment onto embedded hardware
Real world systems use more than deep
learning
Deep learning frameworks do not
include ā€œclassicalā€ computer vision
Not a deep learning expert
26
Algorithm Design to Embedded Deployment Workflow
Conventional Approach
Desktop GPU
High-level language
Deep learning framework
Large, complex software stack
1
Desktop GPU
C++
C/C++
Low-level APIs
Application-specific libraries
2
C++
Embedded GPU
C/C++
Target-optimized libraries
Optimize for memory & speed
3
Challenges
• Integrating multiple libraries and
packages
• Verifying and maintaining multiple
implementations
• Algorithm & vendor lock-in
27
GPU Coder for Deployment: New Product in R2017b
Neural Networks
Deep Learning, machine learning
Image Processing and
Computer Vision
Image filtering, feature detection/extraction
Signal Processing and
Communications
FFT, filtering, cross correlation,
7x faster than state-of-art 700x faster than CPUs
for feature extraction
20x faster than
CPUs for FFTs
GPU Coder
Accelerated implementation of
parallel algorithms on GPUs
28
Algorithm Design to Embedded Deployment Workflow
with GPU Coder
MATLAB algorithm
(functional reference)
Functional test1 Deployment
unit-test
2
Desktop
GPU
C++
Deployment
integration-test
3
Desktop
GPU
C++
Real-time test4
Embedded GPU
.mex .lib Cross-compiled
.lib
Build type
Call CUDA
from MATLAB
directly
Call CUDA from
(C++) hand-
coded main()
Call CUDA from (C++)
hand-coded main().
29
CUDA Code Generation from GPU Coder
30
31
End-to-End Application: Lane Detection
Transfer Learning
AlexNet – 1000 class classification
Lane detection
CNN
Post-processing
(find left/right lane
points)
Image
Parabolic lane
coefficients in
world coordinates
Left lane co-efficients
Right lane co-efficients
Output of CNN is lane parabola coefficients according to: y = ax^2 + bx + c
MATLAB : A SINGLE PLATFORM FOR DEEP LEARNING TRAINING & DEPLOYMENT
https://tinyurl.com/ybaxnxjg
32
Alexnet Inference on Intel CPUs
MATLAB
(R2017b Release 2)
TensorFlow
MXNet
Caffe2
33
Alexnet Inference on NVIDIA Titan Xp
MATLAB GPU Coder
(R2017b)
TensorFlow (1.2.0)
Caffe2 (0.8.1)
Framespersecond
Batch Size
CPU Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz
GPU Pascal Titan Xp
cuDNN v5
Testing platform
MXNet (0.10)
MATLAB (R2017b)
2x 7x5x
34
Alexnet inference on NVIDIA GPUs
0
1
2
3
4
5
6
7
8
9 CPU resident memory
GPU peak memory (nvidia-smi)
Memoryusage(GB)
Batch Size1 16 32 64
CPU Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50 GHz
GPU Tesla K40c
Py-Caffe
GPUCoder
TensorFlow
MATLABw/PCT
C++-Caffe
35
Design Deep Neural Networks in MATLAB and Deploy with GPU
Coder
Design Deep Learning &
Vision Algorithms
Highlights
ā–Ŗ Manage large image sets
ā–Ŗ Easy access to models like AlexNet, GoogleNet
ā–Ŗ Pre-built training frameworks
ā–Ŗ Automate ground truth labeling apps
Highlights
ā–Ŗ Automate optimized CUDA code
generation with GPU Coder
ā–Ŗ Deployed models upto 4.5x faster
than Caffe2 and 7x faster than
Tensor
High Performance Deployment
36
Deep Learning Onramp
Free Introductory Course
Available Here
37
Visit MathWorks Booth to Learn More
HIGHWAY_SCENE
Classification
Car Car CarDetection
Lane Lane
Regression
Semantic
Segmentation

Recommended

Serverless Summit - Quiz
Serverless Summit - Quiz
CodeOps Technologies LLP
Ā 
Build a Chatbot in Ten Minutes - Dave Kerr - Serverless Summit
Build a Chatbot in Ten Minutes - Dave Kerr - Serverless Summit
CodeOps Technologies LLP
Ā 
Serverless Architecture Patterns - Manoj Ganapathi - Serverless Summit
Serverless Architecture Patterns - Manoj Ganapathi - Serverless Summit
CodeOps Technologies LLP
Ā 
Serverless in azure
Serverless in azure
Veresh Jain
Ā 
Performance Monitoring with AOP and Amazon CloudWatch
Performance Monitoring with AOP and Amazon CloudWatch
Yan Cui
Ā 
How to double .net code value
How to double .net code value
javOnet
Ā 
Democratizing Serverless—The Open Source Fn Project - Serverless Summit
Democratizing Serverless—The Open Source Fn Project - Serverless Summit
CodeOps Technologies LLP
Ā 
Clovać‚’ę”Æćˆć‚‹ęŠ€č”“ ę©Ÿę¢°å­¦ēæ’é…äæ”åŸŗē›¤ć®ć”ē“¹ä»‹
Clovać‚’ę”Æćˆć‚‹ęŠ€č”“ ę©Ÿę¢°å­¦ēæ’é…äæ”åŸŗē›¤ć®ć”ē“¹ä»‹
LINE Corporation
Ā 
Javantura v4 - Java and lambdas and streams - are they better than for loops ...
Javantura v4 - Java and lambdas and streams - are they better than for loops ...
HUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association
Ā 
ONAP on Vagrant
ONAP on Vagrant
Victor Morales
Ā 
Architecture Sustaining LINE Sticker services
Architecture Sustaining LINE Sticker services
LINE Corporation
Ā 
SOA with PHP and Symfony
SOA with PHP and Symfony
MichalSchroeder
Ā 
Javantura v4 - FreeMarker in Spring web - Marin Kalapać
Javantura v4 - FreeMarker in Spring web - Marin Kalapać
HUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association
Ā 
Reactive programming intro
Reactive programming intro
Ahmed Ehab AbdulAziz
Ā 
Continuous integration
Continuous integration
Lior Tal
Ā 
Sigmoid akka stream
Sigmoid akka stream
Sigmoid
Ā 
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward
Ā 
Will ServerLess kill containers and Operations
Will ServerLess kill containers and Operations
Stephane Woillez
Ā 
How to deploy your Apps in serverless-way using App Engine.pptx
How to deploy your Apps in serverless-way using App Engine.pptx
Institut Teknologi Sepuluh Nopember Surabaya
Ā 
Neuro4j Workflow Overview
Neuro4j Workflow Overview
Dmytro Pavlikovskiy
Ā 
Reactive programming
Reactive programming
Nick Hodge
Ā 
Foundational Design Patterns for Multi-Purpose Applications
Foundational Design Patterns for Multi-Purpose Applications
Ching-Hwa Yu
Ā 
APIdays Paris 2018 - Secure & Manage APIs with GraphQL, Ozair Sheikh, Directo...
APIdays Paris 2018 - Secure & Manage APIs with GraphQL, Ozair Sheikh, Directo...
apidays
Ā 
Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow
Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow
Databricks
Ā 
4. introduction to Asp.Net MVC - Part II
4. introduction to Asp.Net MVC - Part II
Rohit Rao
Ā 
Async CTP 3 Presentation for MUGH 2012
Async CTP 3 Presentation for MUGH 2012
Sri Kanth
Ā 
A SOA approximation on symfony
A SOA approximation on symfony
Joseluis Laso
Ā 
Reactive programming with rx java
Reactive programming with rx java
CongTrung Vnit
Ā 
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
Edge AI and Vision Alliance
Ā 
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
Edge AI and Vision Alliance
Ā 

More Related Content

What's hot (20)

Javantura v4 - Java and lambdas and streams - are they better than for loops ...
Javantura v4 - Java and lambdas and streams - are they better than for loops ...
HUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association
Ā 
ONAP on Vagrant
ONAP on Vagrant
Victor Morales
Ā 
Architecture Sustaining LINE Sticker services
Architecture Sustaining LINE Sticker services
LINE Corporation
Ā 
SOA with PHP and Symfony
SOA with PHP and Symfony
MichalSchroeder
Ā 
Javantura v4 - FreeMarker in Spring web - Marin Kalapać
Javantura v4 - FreeMarker in Spring web - Marin Kalapać
HUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association
Ā 
Reactive programming intro
Reactive programming intro
Ahmed Ehab AbdulAziz
Ā 
Continuous integration
Continuous integration
Lior Tal
Ā 
Sigmoid akka stream
Sigmoid akka stream
Sigmoid
Ā 
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward
Ā 
Will ServerLess kill containers and Operations
Will ServerLess kill containers and Operations
Stephane Woillez
Ā 
How to deploy your Apps in serverless-way using App Engine.pptx
How to deploy your Apps in serverless-way using App Engine.pptx
Institut Teknologi Sepuluh Nopember Surabaya
Ā 
Neuro4j Workflow Overview
Neuro4j Workflow Overview
Dmytro Pavlikovskiy
Ā 
Reactive programming
Reactive programming
Nick Hodge
Ā 
Foundational Design Patterns for Multi-Purpose Applications
Foundational Design Patterns for Multi-Purpose Applications
Ching-Hwa Yu
Ā 
APIdays Paris 2018 - Secure & Manage APIs with GraphQL, Ozair Sheikh, Directo...
APIdays Paris 2018 - Secure & Manage APIs with GraphQL, Ozair Sheikh, Directo...
apidays
Ā 
Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow
Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow
Databricks
Ā 
4. introduction to Asp.Net MVC - Part II
4. introduction to Asp.Net MVC - Part II
Rohit Rao
Ā 
Async CTP 3 Presentation for MUGH 2012
Async CTP 3 Presentation for MUGH 2012
Sri Kanth
Ā 
A SOA approximation on symfony
A SOA approximation on symfony
Joseluis Laso
Ā 
Reactive programming with rx java
Reactive programming with rx java
CongTrung Vnit
Ā 
Architecture Sustaining LINE Sticker services
Architecture Sustaining LINE Sticker services
LINE Corporation
Ā 
SOA with PHP and Symfony
SOA with PHP and Symfony
MichalSchroeder
Ā 
Continuous integration
Continuous integration
Lior Tal
Ā 
Sigmoid akka stream
Sigmoid akka stream
Sigmoid
Ā 
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward
Ā 
Will ServerLess kill containers and Operations
Will ServerLess kill containers and Operations
Stephane Woillez
Ā 
Reactive programming
Reactive programming
Nick Hodge
Ā 
Foundational Design Patterns for Multi-Purpose Applications
Foundational Design Patterns for Multi-Purpose Applications
Ching-Hwa Yu
Ā 
APIdays Paris 2018 - Secure & Manage APIs with GraphQL, Ozair Sheikh, Directo...
APIdays Paris 2018 - Secure & Manage APIs with GraphQL, Ozair Sheikh, Directo...
apidays
Ā 
Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow
Automatic Forecasting using Prophet, Databricks, Delta Lake and MLflow
Databricks
Ā 
4. introduction to Asp.Net MVC - Part II
4. introduction to Asp.Net MVC - Part II
Rohit Rao
Ā 
Async CTP 3 Presentation for MUGH 2012
Async CTP 3 Presentation for MUGH 2012
Sri Kanth
Ā 
A SOA approximation on symfony
A SOA approximation on symfony
Joseluis Laso
Ā 
Reactive programming with rx java
Reactive programming with rx java
CongTrung Vnit
Ā 

Similar to Developing and Deploying Deep Learning Based Computer Vision Systems - Alka Nair - MathWorks (20)

"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
Edge AI and Vision Alliance
Ā 
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
Edge AI and Vision Alliance
Ā 
Introducing Deep learning with Matlab
Introducing Deep learning with Matlab
Massimo Talia
Ā 
"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...
"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...
Edge AI and Vision Alliance
Ā 
NVIDIA ę·±åŗ¦å­øēæ’ę•™č‚²ę©Ÿę§‹ (DLI): Neural network deployment
NVIDIA ę·±åŗ¦å­øēæ’ę•™č‚²ę©Ÿę§‹ (DLI): Neural network deployment
NVIDIA Taiwan
Ā 
Deep-learning-for-computer-vision-applications-using-matlab.pdf
Deep-learning-for-computer-vision-applications-using-matlab.pdf
AubainYro1
Ā 
Large Scale Deep Learning with TensorFlow
Large Scale Deep Learning with TensorFlow
Jen Aman
Ā 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
Rakuten Group, Inc.
Ā 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
WithTheBest
Ā 
Deep learning-practical
Deep learning-practical
Hitesh Mohapatra
Ā 
Introducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applications
Rokesh Jankie
Ā 
ā€œDeploying Deep Learning Applications on FPGAs with MATLAB,ā€ a Presentation f...
ā€œDeploying Deep Learning Applications on FPGAs with MATLAB,ā€ a Presentation f...
Edge AI and Vision Alliance
Ā 
A Platform for Accelerating Machine Learning Applications
A Platform for Accelerating Machine Learning Applications
NVIDIA Taiwan
Ā 
abelbrownnvidiarakuten2016-170208065814 (1).pptx
abelbrownnvidiarakuten2016-170208065814 (1).pptx
gopikahari7
Ā 
Introduction to deep learning
Introduction to deep learning
Abhishek Bhandwaldar
Ā 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
Ā 
Xilinx Inference solution for DL using OpenPOWER systems
Xilinx Inference solution for DL using OpenPOWER systems
Ganesan Narayanasamy
Ā 
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
Brodmann17
Ā 
BMW:Application and Deployment of Deep Learning in Autonomous Driving.pdf
BMW:Application and Deployment of Deep Learning in Autonomous Driving.pdf
keithzhangding
Ā 
Neural Networks from Scratch - TensorFlow 101
Neural Networks from Scratch - TensorFlow 101
Gerold Bausch
Ā 
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
"Deploying Deep Learning Models on Embedded Processors for Autonomous Systems...
Edge AI and Vision Alliance
Ā 
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
Edge AI and Vision Alliance
Ā 
Introducing Deep learning with Matlab
Introducing Deep learning with Matlab
Massimo Talia
Ā 
"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...
"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...
Edge AI and Vision Alliance
Ā 
NVIDIA ę·±åŗ¦å­øēæ’ę•™č‚²ę©Ÿę§‹ (DLI): Neural network deployment
NVIDIA ę·±åŗ¦å­øēæ’ę•™č‚²ę©Ÿę§‹ (DLI): Neural network deployment
NVIDIA Taiwan
Ā 
Deep-learning-for-computer-vision-applications-using-matlab.pdf
Deep-learning-for-computer-vision-applications-using-matlab.pdf
AubainYro1
Ā 
Large Scale Deep Learning with TensorFlow
Large Scale Deep Learning with TensorFlow
Jen Aman
Ā 
Introduction to Deep Learning (NVIDIA)
Introduction to Deep Learning (NVIDIA)
Rakuten Group, Inc.
Ā 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
WithTheBest
Ā 
Deep learning-practical
Deep learning-practical
Hitesh Mohapatra
Ā 
Introducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applications
Rokesh Jankie
Ā 
ā€œDeploying Deep Learning Applications on FPGAs with MATLAB,ā€ a Presentation f...
ā€œDeploying Deep Learning Applications on FPGAs with MATLAB,ā€ a Presentation f...
Edge AI and Vision Alliance
Ā 
A Platform for Accelerating Machine Learning Applications
A Platform for Accelerating Machine Learning Applications
NVIDIA Taiwan
Ā 
abelbrownnvidiarakuten2016-170208065814 (1).pptx
abelbrownnvidiarakuten2016-170208065814 (1).pptx
gopikahari7
Ā 
Introduction to deep learning
Introduction to deep learning
Abhishek Bhandwaldar
Ā 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
Ā 
Xilinx Inference solution for DL using OpenPOWER systems
Xilinx Inference solution for DL using OpenPOWER systems
Ganesan Narayanasamy
Ā 
DLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
Brodmann17
Ā 
BMW:Application and Deployment of Deep Learning in Autonomous Driving.pdf
BMW:Application and Deployment of Deep Learning in Autonomous Driving.pdf
keithzhangding
Ā 
Neural Networks from Scratch - TensorFlow 101
Neural Networks from Scratch - TensorFlow 101
Gerold Bausch
Ā 

More from CodeOps Technologies LLP (20)

AWS Serverless Event-driven Architecture - in lastminute.com meetup
AWS Serverless Event-driven Architecture - in lastminute.com meetup
CodeOps Technologies LLP
Ā 
Understanding azure batch service
Understanding azure batch service
CodeOps Technologies LLP
Ā 
DEVOPS AND MACHINE LEARNING
DEVOPS AND MACHINE LEARNING
CodeOps Technologies LLP
Ā 
SERVERLESS MIDDLEWARE IN AZURE FUNCTIONS
SERVERLESS MIDDLEWARE IN AZURE FUNCTIONS
CodeOps Technologies LLP
Ā 
BUILDING SERVERLESS SOLUTIONS WITH AZURE FUNCTIONS
BUILDING SERVERLESS SOLUTIONS WITH AZURE FUNCTIONS
CodeOps Technologies LLP
Ā 
APPLYING DEVOPS STRATEGIES ON SCALE USING AZURE DEVOPS SERVICES
APPLYING DEVOPS STRATEGIES ON SCALE USING AZURE DEVOPS SERVICES
CodeOps Technologies LLP
Ā 
BUILD, TEST & DEPLOY .NET CORE APPS IN AZURE DEVOPS
BUILD, TEST & DEPLOY .NET CORE APPS IN AZURE DEVOPS
CodeOps Technologies LLP
Ā 
CREATE RELIABLE AND LOW-CODE APPLICATION IN SERVERLESS MANNER
CREATE RELIABLE AND LOW-CODE APPLICATION IN SERVERLESS MANNER
CodeOps Technologies LLP
Ā 
CREATING REAL TIME DASHBOARD WITH BLAZOR, AZURE FUNCTION COSMOS DB AN AZURE S...
CREATING REAL TIME DASHBOARD WITH BLAZOR, AZURE FUNCTION COSMOS DB AN AZURE S...
CodeOps Technologies LLP
Ā 
WRITE SCALABLE COMMUNICATION APPLICATION WITH POWER OF SERVERLESS
WRITE SCALABLE COMMUNICATION APPLICATION WITH POWER OF SERVERLESS
CodeOps Technologies LLP
Ā 
Training And Serving ML Model Using Kubeflow by Jayesh Sharma
Training And Serving ML Model Using Kubeflow by Jayesh Sharma
CodeOps Technologies LLP
Ā 
Deploy Microservices To Kubernetes Without Secrets by Reenu Saluja
Deploy Microservices To Kubernetes Without Secrets by Reenu Saluja
CodeOps Technologies LLP
Ā 
Leverage Azure Tech stack for any Kubernetes cluster via Azure Arc by Saiyam ...
Leverage Azure Tech stack for any Kubernetes cluster via Azure Arc by Saiyam ...
CodeOps Technologies LLP
Ā 
YAML Tips For Kubernetes by Neependra Khare
YAML Tips For Kubernetes by Neependra Khare
CodeOps Technologies LLP
Ā 
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
CodeOps Technologies LLP
Ā 
Monitor Azure Kubernetes Cluster With Prometheus by Mamta Jha
Monitor Azure Kubernetes Cluster With Prometheus by Mamta Jha
CodeOps Technologies LLP
Ā 
Jet brains space intro presentation
Jet brains space intro presentation
CodeOps Technologies LLP
Ā 
Functional Programming in Java 8 - Lambdas and Streams
Functional Programming in Java 8 - Lambdas and Streams
CodeOps Technologies LLP
Ā 
Distributed Tracing: New DevOps Foundation
Distributed Tracing: New DevOps Foundation
CodeOps Technologies LLP
Ā 
"Distributed Tracing: New DevOps Foundation" by Jayesh Ahire
"Distributed Tracing: New DevOps Foundation" by Jayesh Ahire
CodeOps Technologies LLP
Ā 
AWS Serverless Event-driven Architecture - in lastminute.com meetup
AWS Serverless Event-driven Architecture - in lastminute.com meetup
CodeOps Technologies LLP
Ā 
SERVERLESS MIDDLEWARE IN AZURE FUNCTIONS
SERVERLESS MIDDLEWARE IN AZURE FUNCTIONS
CodeOps Technologies LLP
Ā 
BUILDING SERVERLESS SOLUTIONS WITH AZURE FUNCTIONS
BUILDING SERVERLESS SOLUTIONS WITH AZURE FUNCTIONS
CodeOps Technologies LLP
Ā 
APPLYING DEVOPS STRATEGIES ON SCALE USING AZURE DEVOPS SERVICES
APPLYING DEVOPS STRATEGIES ON SCALE USING AZURE DEVOPS SERVICES
CodeOps Technologies LLP
Ā 
BUILD, TEST & DEPLOY .NET CORE APPS IN AZURE DEVOPS
BUILD, TEST & DEPLOY .NET CORE APPS IN AZURE DEVOPS
CodeOps Technologies LLP
Ā 
CREATE RELIABLE AND LOW-CODE APPLICATION IN SERVERLESS MANNER
CREATE RELIABLE AND LOW-CODE APPLICATION IN SERVERLESS MANNER
CodeOps Technologies LLP
Ā 
CREATING REAL TIME DASHBOARD WITH BLAZOR, AZURE FUNCTION COSMOS DB AN AZURE S...
CREATING REAL TIME DASHBOARD WITH BLAZOR, AZURE FUNCTION COSMOS DB AN AZURE S...
CodeOps Technologies LLP
Ā 
WRITE SCALABLE COMMUNICATION APPLICATION WITH POWER OF SERVERLESS
WRITE SCALABLE COMMUNICATION APPLICATION WITH POWER OF SERVERLESS
CodeOps Technologies LLP
Ā 
Training And Serving ML Model Using Kubeflow by Jayesh Sharma
Training And Serving ML Model Using Kubeflow by Jayesh Sharma
CodeOps Technologies LLP
Ā 
Deploy Microservices To Kubernetes Without Secrets by Reenu Saluja
Deploy Microservices To Kubernetes Without Secrets by Reenu Saluja
CodeOps Technologies LLP
Ā 
Leverage Azure Tech stack for any Kubernetes cluster via Azure Arc by Saiyam ...
Leverage Azure Tech stack for any Kubernetes cluster via Azure Arc by Saiyam ...
CodeOps Technologies LLP
Ā 
YAML Tips For Kubernetes by Neependra Khare
YAML Tips For Kubernetes by Neependra Khare
CodeOps Technologies LLP
Ā 
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
CodeOps Technologies LLP
Ā 
Monitor Azure Kubernetes Cluster With Prometheus by Mamta Jha
Monitor Azure Kubernetes Cluster With Prometheus by Mamta Jha
CodeOps Technologies LLP
Ā 
Functional Programming in Java 8 - Lambdas and Streams
Functional Programming in Java 8 - Lambdas and Streams
CodeOps Technologies LLP
Ā 
Distributed Tracing: New DevOps Foundation
Distributed Tracing: New DevOps Foundation
CodeOps Technologies LLP
Ā 
"Distributed Tracing: New DevOps Foundation" by Jayesh Ahire
"Distributed Tracing: New DevOps Foundation" by Jayesh Ahire
CodeOps Technologies LLP
Ā 

Recently uploaded (20)

You are not excused! How to avoid security blind spots on the way to production
You are not excused! How to avoid security blind spots on the way to production
Michele Leroux Bustamante
Ā 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
Ā 
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
Ā 
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
Priyanka Aash
Ā 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
Ā 
Curietech AI in action - Accelerate MuleSoft development
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
Ā 
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
Ā 
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Josef Weingand
Ā 
"Database isolation: how we deal with hundreds of direct connections to the d...
"Database isolation: how we deal with hundreds of direct connections to the d...
Fwdays
Ā 
AI vs Human Writing: Can You Tell the Difference?
AI vs Human Writing: Can You Tell the Difference?
Shashi Sathyanarayana, Ph.D
Ā 
"Scaling in space and time with Temporal", Andriy Lupa.pdf
"Scaling in space and time with Temporal", Andriy Lupa.pdf
Fwdays
Ā 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
Ā 
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
Ā 
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
Ā 
PyCon SG 25 - Firecracker Made Easy with Python.pdf
PyCon SG 25 - Firecracker Made Easy with Python.pdf
Muhammad Yuga Nugraha
Ā 
Daily Lesson Log MATATAG ICT TEchnology 8
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
Ā 
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Priyanka Aash
Ā 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
Ā 
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
Ā 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
Ā 
You are not excused! How to avoid security blind spots on the way to production
You are not excused! How to avoid security blind spots on the way to production
Michele Leroux Bustamante
Ā 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
Ā 
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
Ā 
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
Priyanka Aash
Ā 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
Ā 
Curietech AI in action - Accelerate MuleSoft development
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
Ā 
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
Ā 
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Josef Weingand
Ā 
"Database isolation: how we deal with hundreds of direct connections to the d...
"Database isolation: how we deal with hundreds of direct connections to the d...
Fwdays
Ā 
AI vs Human Writing: Can You Tell the Difference?
AI vs Human Writing: Can You Tell the Difference?
Shashi Sathyanarayana, Ph.D
Ā 
"Scaling in space and time with Temporal", Andriy Lupa.pdf
"Scaling in space and time with Temporal", Andriy Lupa.pdf
Fwdays
Ā 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
Ā 
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
Ā 
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
Ā 
PyCon SG 25 - Firecracker Made Easy with Python.pdf
PyCon SG 25 - Firecracker Made Easy with Python.pdf
Muhammad Yuga Nugraha
Ā 
Daily Lesson Log MATATAG ICT TEchnology 8
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
Ā 
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Priyanka Aash
Ā 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
Ā 
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
Ā 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
Ā 

Developing and Deploying Deep Learning Based Computer Vision Systems - Alka Nair - MathWorks

  • 1. 1Ā© 2017 The MathWorks, Inc. Developing and Deploying Deep Learning Based Computer Vision Systems Alka Nair Application Engineer
  • 2. 2 Alexnet Vehicle Detection People detection Lane detection ~30 Fps (Tegra X1) ~66 Fps (Tegra X1) ~20 Fps (K40c) ~130 Fps (K40c)
  • 3. 3 Deep Learning Applications in Computer Vision HIGHWAY_SCENE Classification Semantic SegmentationRain Detection and Removal Human Aware Navigation for Robots
  • 4. 4 Lane Detection on a Tesla K40 GPU
  • 5. 5 End-to-End Application: Lane Detection Transfer Learning AlexNet – 1000 class classification Lane detection CNN Post-processing (find left/right lane points) Image Parabolic lane coefficients in world coordinates Left lane co-efficients Right lane co-efficients Output of CNN is lane parabola coefficients according to: y = ax^2 + bx + c MATLAB : A SINGLE PLATFORM FOR DEEP LEARNING TRAINING & DEPLOYMENT
  • 6. 6 Deep Learning Challenges Big Data ā–Ŗ Handling large amounts of data ā–Ŗ Labeling thousands of images & videos Training and Testing Deep Neural Networks ā–Ŗ Accessing reference models from research ā–Ŗ Understanding network behavior ā–Ŗ Tuning hyperparameters and refining architectures ā–Ŗ Training takes hours-days Seamless Deployment onto embedded hardware Real world systems use more than deep learning Deep learning frameworks do not include ā€œclassicalā€ computer vision Not a deep learning expert
  • 7. 7 Access Large Sets of Images Handle Large Sets of Images Easily manage large sets of images - Single line of code to access images - Operates on disk, database, big-data file system imageData = imageDataStore(ā€˜vehicles’) Easily manage large sets of images - Single line of code to access images - Operates on disk, database, big-data file system Organize Images in Folders (~ 10,000 images , 5 folders)
  • 8. 8 Handle big image collection without big changes Images in local directory Images on HDFS
  • 10. 10 Generate Training Data from Labeled Images Labeled Lane Boundaries in Image Coordinates Correspond to coefficients of parabola representing left and right lane (a,b,c). Ground Truth Exported from Ground Truth Labeler App Parabolic Lane Boundary Modeling >> findparabolicLaneBoundaries Lane Boundary Models in World Coordinates
  • 11. 11 End-to-End Application: Lane Detection Transfer Learning AlexNet – 1000 class classification Lane detection CNN Post-processing (find left/right lane points) Image Parabolic lane coefficients in vehicle coordinates Left lane co-efficients Right lane co-efficients Output of CNN is lane parabola coefficients according to: y = ax^2 + bx + c MATLAB : A SINGLE PLATFORM FOR DEEP LEARNING TRAINING & DEPLOYMENT
  • 12. 12 Deep Learning Challenges Big Data ā–Ŗ Handling large amounts of data ā–Ŗ Labeling thousands of images & videos Training and Testing Deep Neural Networks ā–Ŗ Accessing reference models from research ā–Ŗ Understanding network behavior ā–Ŗ Tuning hyperparameters and refining architectures ā–Ŗ Training takes hours-days Seamless Deployment onto embedded hardware Real world systems use more than deep learning Deep learning frameworks do not include ā€œclassicalā€ computer vision Not a deep learning expert
  • 13. 13 Transfer Learning Workflow Early layers Last layers 1 million images 1000s classes Load pretrained network Fewer classes Learn faster New layers Replace final layers 100s images 10s classes Training options Train network Trained Network Predict and assess network accuracy
  • 14. 14 Import Pre-Trained Models and Network Architectures Pretrained Models ā–Ŗ AlexNet ā–Ŗ VGG-16 ā–Ŗ VGG-19 ā–Ŗ GoogLeNet ā–Ŗ Resnet50 ā–Ŗ InceptionV3 ā–Ŗ ResNet - 101 Import Models from Frameworks ā–Ŗ Caffe Model Importer (including Caffe Model Zoo) – importCaffeLayers – importCaffeNetwork ā–Ŗ TensorFlow-Keras Model Importer – importKerasLayers – importKerasNetwork Download from within MATLAB net = alexnet; net = vgg16; net = vgg19; net = googlenet; net = resnet50; net = inceptionv3; net = resnet101;
  • 15. 15 Visualizations for Understanding Network Behavior ā–Ŗ Custom visualizations – Example: Class Activation Maps Filters … Activations Deep Dream http://cnnlocalization.csail.mit.edu/Zhou_Learning_Deep_Features_CVPR_2016_paper.pdf
  • 16. 16 Augment Training Images imageAugmenter = imageDataAugmenter('RandRotation',[-180 180]) Rotation Reflection Scaling Shearing Translation Colour pre-processing Resize / Random crop / Centre crop
  • 17. 17 Transfer Learning Workflow Early layers Last layers 1 million images 1000s classes Load pretrained network Fewer classes Learn faster New layers Replace final layers 100s images 10s classes Training options Train network Trained Network Predict and assess network accuracy
  • 18. 18 Transfer Learning Workflow Early layers Last layers 1 million images 1000s classes Load pretrained network Fewer classes Learn faster New layers Replace final layers 100s images 10s classes Training options Train network Trained Network Predict and assess network accuracy
  • 19. 19 Transfer Learning Workflow Early layers Last layers 1 million images 1000s classes Load pretrained network Fewer classes Learn faster New layers Replace final layers 100s images 10s classes Training options Train network Trained Network Predict and assess network accuracy
  • 20. 20 Training Deep Neural Networks trainingOptions ā–Ŗ Plot training metrics – Training accuracy, smoothed training accuracy, validation accuracy – Training loss, smoothed training loss, and validation loss ā–Ŗ Debug training – Stop and check current state – Save / load checkpoint networks – Custom output function (stopping condition, visualization, etc.) ā–Ŗ Bayesian optimization for hyperparameter tuning Learn More
  • 21. 21 Transfer Learning Workflow Early layers Last layers 1 million images 1000s classes Load pretrained network Fewer classes Learn faster New layers Replace final layers 100s images 10s classes Training options Train network Trained Network Predict and assess network accuracy MATLAB Provides Evaluation Frameworks for Different Classes of Deep Learning Problems
  • 23. 23 Deep learning on CPU, GPU, multi-GPU and clusters Single CPU Single CPU Single GPU Single CPU Multiple GPUs On-prem server with GPUs Cloud GPUs (AWS, Azure, etc.) Deep Learning on Cloud Whitepaper
  • 24. 24 Training in MATLAB is fast MATLAB is more than 4x faster than TensorFlow AlexNet CNN architecture trained on the ImageNet dataset, using batch size of 32, on a Windows 10 desktop with single NVIDIA GPU (Titan Xp). TensorFlow version 1.2.0.
  • 25. 25 Deep Learning Challenges Big Data ā–Ŗ Handling large amounts of data ā–Ŗ Labeling thousands of images & videos Training and Testing Deep Neural Networks ā–Ŗ Accessing reference models from research ā–Ŗ Understanding network behavior ā–Ŗ Tuning hyperparameters and refining architectures ā–Ŗ Training takes hours-days Seamless Deployment onto embedded hardware Real world systems use more than deep learning Deep learning frameworks do not include ā€œclassicalā€ computer vision Not a deep learning expert
  • 26. 26 Algorithm Design to Embedded Deployment Workflow Conventional Approach Desktop GPU High-level language Deep learning framework Large, complex software stack 1 Desktop GPU C++ C/C++ Low-level APIs Application-specific libraries 2 C++ Embedded GPU C/C++ Target-optimized libraries Optimize for memory & speed 3 Challenges • Integrating multiple libraries and packages • Verifying and maintaining multiple implementations • Algorithm & vendor lock-in
  • 27. 27 GPU Coder for Deployment: New Product in R2017b Neural Networks Deep Learning, machine learning Image Processing and Computer Vision Image filtering, feature detection/extraction Signal Processing and Communications FFT, filtering, cross correlation, 7x faster than state-of-art 700x faster than CPUs for feature extraction 20x faster than CPUs for FFTs GPU Coder Accelerated implementation of parallel algorithms on GPUs
  • 28. 28 Algorithm Design to Embedded Deployment Workflow with GPU Coder MATLAB algorithm (functional reference) Functional test1 Deployment unit-test 2 Desktop GPU C++ Deployment integration-test 3 Desktop GPU C++ Real-time test4 Embedded GPU .mex .lib Cross-compiled .lib Build type Call CUDA from MATLAB directly Call CUDA from (C++) hand- coded main() Call CUDA from (C++) hand-coded main().
  • 29. 29 CUDA Code Generation from GPU Coder
  • 30. 30
  • 31. 31 End-to-End Application: Lane Detection Transfer Learning AlexNet – 1000 class classification Lane detection CNN Post-processing (find left/right lane points) Image Parabolic lane coefficients in world coordinates Left lane co-efficients Right lane co-efficients Output of CNN is lane parabola coefficients according to: y = ax^2 + bx + c MATLAB : A SINGLE PLATFORM FOR DEEP LEARNING TRAINING & DEPLOYMENT https://tinyurl.com/ybaxnxjg
  • 32. 32 Alexnet Inference on Intel CPUs MATLAB (R2017b Release 2) TensorFlow MXNet Caffe2
  • 33. 33 Alexnet Inference on NVIDIA Titan Xp MATLAB GPU Coder (R2017b) TensorFlow (1.2.0) Caffe2 (0.8.1) Framespersecond Batch Size CPU Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz GPU Pascal Titan Xp cuDNN v5 Testing platform MXNet (0.10) MATLAB (R2017b) 2x 7x5x
  • 34. 34 Alexnet inference on NVIDIA GPUs 0 1 2 3 4 5 6 7 8 9 CPU resident memory GPU peak memory (nvidia-smi) Memoryusage(GB) Batch Size1 16 32 64 CPU Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50 GHz GPU Tesla K40c Py-Caffe GPUCoder TensorFlow MATLABw/PCT C++-Caffe
  • 35. 35 Design Deep Neural Networks in MATLAB and Deploy with GPU Coder Design Deep Learning & Vision Algorithms Highlights ā–Ŗ Manage large image sets ā–Ŗ Easy access to models like AlexNet, GoogleNet ā–Ŗ Pre-built training frameworks ā–Ŗ Automate ground truth labeling apps Highlights ā–Ŗ Automate optimized CUDA code generation with GPU Coder ā–Ŗ Deployed models upto 4.5x faster than Caffe2 and 7x faster than Tensor High Performance Deployment
  • 36. 36 Deep Learning Onramp Free Introductory Course Available Here
  • 37. 37 Visit MathWorks Booth to Learn More HIGHWAY_SCENE Classification Car Car CarDetection Lane Lane Regression Semantic Segmentation