SlideShare a Scribd company logo
1
Object Detection
By Usman Qayyum
4, Dec, 2018
Talk Covers Three Papers (Object Detection -> Embedded Computing)
2
SqueezeNet-2016SSD-2016 TinySSD-2018
=+
Image Classification/Object Detection
● Autonomous vehicles, smart video surveillance, facial detection and various
applications, fast and robust object detection is need of an hour
● Nonly recognizing and classifying every object in an image, but localizing each one by
drawing the appropriate bounding box around it.
3
CNN Migration (Image Classification)
4
Object Detection as Classification
CNN
deer?
cat?
background?
Object Detection as Classification
CNN
deer?
cat?
background?
Object Detection as Classification
CNN
deer?
cat?
background?
Object Detection as Classification
with Sliding Window
CNN
deer?
cat?
background?
Object Detection as Classification
with Box Proposals
Box Proposal Method : Selective Search
Segmentation As Selective
Search for Object
Recognition. van de Sande
et al. ICCV 2011
Idea behind Object Detectors
● Box Proposals
● Classifier Algorithm
11
RCNN
Rich feature hierarchies for accurate object detection and semantic segmentation.
Girshick et al. CVPR 2014.
https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf
Fast-RCNN
Fast R-CNN. Girshick. ICCV 2015.
https://arxiv.org/abs/1504.08083
Idea: No need to recompute features for every box independently,
Regress refined bounding box coordinates.
Faster-RCNN
Ren et al. NIPS 2015.
https://arxiv.org/abs/1506.01497
Idea: Integrate the Bounding Box Propos
als as part of the CNN predictions
YOLO- You Only Look Once
● Single Shot Detector
Redmon et al. CVPR 2016.
https://arxiv.org/abs/1506.02640
Idea: No bounding box proposals.
Predict a class and a box for every
location in a grid.
SSD: Single Shot Detector
Liu et al. ECCV 2016.
Idea: Similar to YOLO, but denser grid map, multiscale grid maps. + Data augm
entation + Hard negative mining + Other design choices in the network.
-The overall objective loss function is a weighted sum of the localization loss and the confidence loss(conf)
N: the number of matched default boxes
l: predicted boxes g: the ground truth box
x=1 denotes some certain default box is matched to a ground truth box17
1
( , , , ) ( ( , ) ( , , ))conf locL x c l g L x c L x l g
N
 
SSD: Single Shot Detector
Performance
18
Accuracy Vs Computation
19
AI Workload Migration
Embedded
(Mobile/Edge)
Server/Clou
d
Execution/Inference
Training
Execution/Inference
Intelligence &
Analytics
Key Use Cases
Vision | Audio | Security
Benefits
Low Latency | Privacy
AI in Embedded Devices
21
How ? (AI in Embedded Devices)
Pruning Quantization22
SqueezeNet (Parameter Reduction)
● Strategy 1. Replace 3x3 filters with 1x1 filters
○ Parameters per filter: (3x3 filter) = 9 * (1x1 filter)
● Strategy 2. Decrease the number of input channels to 3x3 filters
○ Total # of parameters: (# of input channels) * (# of filters) * ( # of parameters per filter)
● Strategy 3. Downsample late in the network so that convolution layers have large
activation maps
○ Size of activation maps: the size of input data, the choice of layers in which to downsample in the
CNN architecture
23
Iandola, Forrest N., et al. "SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size."
Strategy#1 Conv1x1 or Kernel Reduction
24
Microarchitecture – Fire Module
25
Squeeze Layer
Set s1x1 < (e1x1 + e3x3),
limits the # of input channels to 3*3 filters
Strategy 2. Decrease the number of input channels to
3x3 filters
Total # of parameters: (# of input channels) * (# of
filters) * ( # of parameters per filter)
How much can we limit
s1x1?
Strategy 1. Replace 3*3 filters with 1*1 filters
Parameters per filter: (3*3 filter) = 9 * (1*1 filter)
How much can we replace 3*3 with 1*1?
(e1x1 vs e3x3 )?
Expand
● In the "expand" modules, what are the
tradeoffs when we turn the knob
between mostly 1x1 and mostly 3x3
filters?
● Hypothesis: if having more weights
leads to higher accuracy, then having
all 3x3 filters should give the highest
accuracy
27
28
Macroarchitecture
29
Strategy 3. Downsample late in the network so that
convolution layers have large activation maps
Size of activation maps: the size of input data, the
choice of layers in which to downsample in the CNN
architecture
Performance
30
TinySSD (SSD with Microarchitecture)
31
Thanks for your attention.
32

More Related Content

PDF
Deep learning based object detection basics
PPTX
Deep learning for object detection
PPTX
Object detection
PDF
Object Detection Using R-CNN Deep Learning Framework
PPTX
Deep learning based object detection
PDF
A Brief History of Object Detection / Tommi Kerola
PPTX
Object detection
PPTX
You Only Look Once: Unified, Real-Time Object Detection
Deep learning based object detection basics
Deep learning for object detection
Object detection
Object Detection Using R-CNN Deep Learning Framework
Deep learning based object detection
A Brief History of Object Detection / Tommi Kerola
Object detection
You Only Look Once: Unified, Real-Time Object Detection

What's hot (20)

PPTX
Object detection with deep learning
PDF
Introduction to object detection
PPTX
Multi Object Tracking | Presentation 1 | ID 103001
PPTX
PPTX
Object detection
PDF
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
PDF
Image segmentation with deep learning
PPTX
Real Time Object Dectection using machine learning
PPTX
Object detection presentation
PDF
Deep Learning - Convolutional Neural Networks
PDF
Codetecon #KRK 3 - Object detection with Deep Learning
PPTX
Real Time Object Tracking
PDF
PR-132: SSD: Single Shot MultiBox Detector
PDF
Object detection and Instance Segmentation
PDF
Deep sort and sort paper introduce presentation
PDF
Moving Object Detection And Tracking Using CNN
PPT
Action Recognition (Thesis presentation)
PDF
Convolutional Neural Network Models - Deep Learning
PDF
Computer Vision
Object detection with deep learning
Introduction to object detection
Multi Object Tracking | Presentation 1 | ID 103001
Object detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
Image segmentation with deep learning
Real Time Object Dectection using machine learning
Object detection presentation
Deep Learning - Convolutional Neural Networks
Codetecon #KRK 3 - Object detection with Deep Learning
Real Time Object Tracking
PR-132: SSD: Single Shot MultiBox Detector
Object detection and Instance Segmentation
Deep sort and sort paper introduce presentation
Moving Object Detection And Tracking Using CNN
Action Recognition (Thesis presentation)
Convolutional Neural Network Models - Deep Learning
Computer Vision
Ad

Similar to Object Detection using Deep Neural Networks (20)

PPTX
object-detection.pptx
PDF
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
PPTX
Anomaly Detection with Azure and .net
PPTX
Object detection - RCNNs vs Retinanet
PDF
Cheatsheet convolutional-neural-networks
PDF
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
PPTX
Presentation2.pptx of sota seminar iit kanpur
PPTX
CM20315_10_Convolutional neural networkArchitecture
PDF
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
PDF
IRJET- A Review Paper on Object Detection using Zynq-7000 FPGA for an Embedde...
PDF
物件偵測與辨識技術
PDF
IRJET- Real-Time Object Detection using Deep Learning: A Survey
PDF
Applying convolutional neural networks for limited-memory application
PDF
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
PDF
“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...
PDF
REVIEW ON OBJECT DETECTION WITH CNN
PPTX
Dov Nimratz, Roman Chobik "Embedded artificial intelligence"
PDF
Applying Deep Learning Vision Technology to low-cost/power Embedded Systems
PDF
_AI_Stanford_Super_#DeepLearning_Cheat_Sheet!_😊🙃😀🙃😊.pdf
PDF
super-cheatsheet-deep-learning.pdf
object-detection.pptx
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
Anomaly Detection with Azure and .net
Object detection - RCNNs vs Retinanet
Cheatsheet convolutional-neural-networks
“Trends in Neural Network Topologies for Vision at the Edge,” a Presentation ...
Presentation2.pptx of sota seminar iit kanpur
CM20315_10_Convolutional neural networkArchitecture
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET- A Review Paper on Object Detection using Zynq-7000 FPGA for an Embedde...
物件偵測與辨識技術
IRJET- Real-Time Object Detection using Deep Learning: A Survey
Applying convolutional neural networks for limited-memory application
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...
REVIEW ON OBJECT DETECTION WITH CNN
Dov Nimratz, Roman Chobik "Embedded artificial intelligence"
Applying Deep Learning Vision Technology to low-cost/power Embedded Systems
_AI_Stanford_Super_#DeepLearning_Cheat_Sheet!_😊🙃😀🙃😊.pdf
super-cheatsheet-deep-learning.pdf
Ad

More from Usman Qayyum (6)

PDF
Artificial intelligence on the Edge
PPTX
Ai for kids
PPTX
Deep Reinforcement Learning
PDF
Deep Learning disruption
PDF
Thermal colorization using Deep Neural Network
PDF
Introduction to deep Learning
Artificial intelligence on the Edge
Ai for kids
Deep Reinforcement Learning
Deep Learning disruption
Thermal colorization using Deep Neural Network
Introduction to deep Learning

Recently uploaded (20)

PDF
Open folder Downloads.pdf yes yes ges yes
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Open Quiz Monsoon Mind Game Final Set.pptx
PPTX
Cell Structure & Organelles in detailed.
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Mga Unang Hakbang Tungo Sa Tao by Joe Vibar Nero.pdf
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
How to Manage Starshipit in Odoo 18 - Odoo Slides
PDF
Electrolyte Disturbances and Fluid Management A clinical and physiological ap...
PDF
From loneliness to social connection charting
PDF
Piense y hagase Rico - Napoleon Hill Ccesa007.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Nursing Management of Patients with Disorders of Ear, Nose, and Throat (ENT) ...
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
English Language Teaching from Post-.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
Open folder Downloads.pdf yes yes ges yes
STATICS OF THE RIGID BODIES Hibbelers.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Open Quiz Monsoon Mind Game Final Set.pptx
Cell Structure & Organelles in detailed.
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Mga Unang Hakbang Tungo Sa Tao by Joe Vibar Nero.pdf
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
How to Manage Starshipit in Odoo 18 - Odoo Slides
Electrolyte Disturbances and Fluid Management A clinical and physiological ap...
From loneliness to social connection charting
Piense y hagase Rico - Napoleon Hill Ccesa007.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Nursing Management of Patients with Disorders of Ear, Nose, and Throat (ENT) ...
Abdominal Access Techniques with Prof. Dr. R K Mishra
Pharmacology of Heart Failure /Pharmacotherapy of CHF
English Language Teaching from Post-.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Renaissance Architecture: A Journey from Faith to Humanism

Object Detection using Deep Neural Networks

  • 1. 1 Object Detection By Usman Qayyum 4, Dec, 2018
  • 2. Talk Covers Three Papers (Object Detection -> Embedded Computing) 2 SqueezeNet-2016SSD-2016 TinySSD-2018 =+
  • 3. Image Classification/Object Detection ● Autonomous vehicles, smart video surveillance, facial detection and various applications, fast and robust object detection is need of an hour ● Nonly recognizing and classifying every object in an image, but localizing each one by drawing the appropriate bounding box around it. 3
  • 4. CNN Migration (Image Classification) 4
  • 5. Object Detection as Classification CNN deer? cat? background?
  • 6. Object Detection as Classification CNN deer? cat? background?
  • 7. Object Detection as Classification CNN deer? cat? background?
  • 8. Object Detection as Classification with Sliding Window CNN deer? cat? background?
  • 9. Object Detection as Classification with Box Proposals
  • 10. Box Proposal Method : Selective Search Segmentation As Selective Search for Object Recognition. van de Sande et al. ICCV 2011
  • 11. Idea behind Object Detectors ● Box Proposals ● Classifier Algorithm 11
  • 12. RCNN Rich feature hierarchies for accurate object detection and semantic segmentation. Girshick et al. CVPR 2014. https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf
  • 13. Fast-RCNN Fast R-CNN. Girshick. ICCV 2015. https://arxiv.org/abs/1504.08083 Idea: No need to recompute features for every box independently, Regress refined bounding box coordinates.
  • 14. Faster-RCNN Ren et al. NIPS 2015. https://arxiv.org/abs/1506.01497 Idea: Integrate the Bounding Box Propos als as part of the CNN predictions
  • 15. YOLO- You Only Look Once ● Single Shot Detector Redmon et al. CVPR 2016. https://arxiv.org/abs/1506.02640 Idea: No bounding box proposals. Predict a class and a box for every location in a grid.
  • 16. SSD: Single Shot Detector Liu et al. ECCV 2016. Idea: Similar to YOLO, but denser grid map, multiscale grid maps. + Data augm entation + Hard negative mining + Other design choices in the network.
  • 17. -The overall objective loss function is a weighted sum of the localization loss and the confidence loss(conf) N: the number of matched default boxes l: predicted boxes g: the ground truth box x=1 denotes some certain default box is matched to a ground truth box17 1 ( , , , ) ( ( , ) ( , , ))conf locL x c l g L x c L x l g N   SSD: Single Shot Detector
  • 20. AI Workload Migration Embedded (Mobile/Edge) Server/Clou d Execution/Inference Training Execution/Inference Intelligence & Analytics Key Use Cases Vision | Audio | Security Benefits Low Latency | Privacy
  • 21. AI in Embedded Devices 21
  • 22. How ? (AI in Embedded Devices) Pruning Quantization22
  • 23. SqueezeNet (Parameter Reduction) ● Strategy 1. Replace 3x3 filters with 1x1 filters ○ Parameters per filter: (3x3 filter) = 9 * (1x1 filter) ● Strategy 2. Decrease the number of input channels to 3x3 filters ○ Total # of parameters: (# of input channels) * (# of filters) * ( # of parameters per filter) ● Strategy 3. Downsample late in the network so that convolution layers have large activation maps ○ Size of activation maps: the size of input data, the choice of layers in which to downsample in the CNN architecture 23 Iandola, Forrest N., et al. "SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size."
  • 24. Strategy#1 Conv1x1 or Kernel Reduction 24
  • 25. Microarchitecture – Fire Module 25 Squeeze Layer Set s1x1 < (e1x1 + e3x3), limits the # of input channels to 3*3 filters Strategy 2. Decrease the number of input channels to 3x3 filters Total # of parameters: (# of input channels) * (# of filters) * ( # of parameters per filter) How much can we limit s1x1? Strategy 1. Replace 3*3 filters with 1*1 filters Parameters per filter: (3*3 filter) = 9 * (1*1 filter) How much can we replace 3*3 with 1*1? (e1x1 vs e3x3 )?
  • 26. Expand ● In the "expand" modules, what are the tradeoffs when we turn the knob between mostly 1x1 and mostly 3x3 filters? ● Hypothesis: if having more weights leads to higher accuracy, then having all 3x3 filters should give the highest accuracy 27
  • 27. 28
  • 28. Macroarchitecture 29 Strategy 3. Downsample late in the network so that convolution layers have large activation maps Size of activation maps: the size of input data, the choice of layers in which to downsample in the CNN architecture
  • 30. TinySSD (SSD with Microarchitecture) 31
  • 31. Thanks for your attention. 32