SlideShare a Scribd company logo
2
Most read
5
Most read
18
Most read
Object Detection with deep learning
SUBMITTED TO: MR. B. SURESH SUBMITTED BY:
HIMANSHU MAURYA(9917102004)
SUSHANT SHRIVASTAVA(9917102023)
BHUVNESH KUMAR BHARDWAJ(9917102028)
1. INTRODUCTION TO
OBJECT DETECTION
○ Object detection is scanning and searching for an object in an image or
a video.
Fig. 1 Object detection
Literature Review.
• Object detection is a common term for computer vision techniques classifying and
locating objects in an image. Modern object detection is largely based on use of
convolutional neural networks Some of the most relevant system types today are Faster
R-CNN, R-FCN, Multibox Single Shot Detector (SSD) and YOLO (You Only Look
Once) [1]. Original R-CNN method worked by running a neural net classifier on samples
cropped from images using externally computed box proposals (=samples cropped with
externally computed box proposals; feature extraction done on all the cropped samples).
This approach was computationally expensive due to many crops.
• Single Shot Multibox Detector (SSD) differs from the R-CNN based approaches by not
requiring a second stage per-proposal classification operation. This makes it fast enough
for real-time detection applications. However, this comes with a price of reduced
precision . “SSD with MobileNet” refers to a model where model meta architecture is
SSD and the feature extractor type is MobileNet.
2. Generic object detection
● Generic object detection aims at locating and classifying
existing object in any one image and labelling them with
rectangular BBs to show the confidences of existences.
Fig. 2 Generic object detection
3. Basic architecture of CNN
Convolutional Neural Network (CNN) is a Deep Learning
algorithm which can take in an input image, assign importance to
various aspects/objects in the image and be able to differentiate
one from the other.[2]
Fig. 3 Basic architecture of CNN
4. Building the CNN
● Convolution
● Polling
● Flattening
4.1 Convolution
● Convolution preserves the spatial relationship between pixels
by learning image features using small squares of input data.
FIG. 4.1 Convolution
4.2 POOLING
● It reduces the dimensionality of each feature map but retains
the most important information.
FIG. 4.2 POOLING
4.3 FLATTENING
● Here the matrix is converted into a linear array so that to input
it into the nodes of our neural network.
FIG. 4.3 FLATTENING
5. Dataset & Preprocessing
COCO stands for Common Objects in Context, this dataset contains around 330K labelled images. COCO is
a large-scale object detection, segmentation, and captioning dataset.[3]
5.1 Features of dataset
· Object segmentation
· Recognition in context
· 330K images (>200K labeled)
· 1.5 million object instances
· 80 object categories
· 91 stuff categories
5.2 Data Preprocessing
● Since the model is pre trained, there is no need for data Preprocessing.
6. What is SSD?
● SSD(Single Shot Detector) is a is designed for object
detection in real-time.
FIG 5. Single Shot Detector.
7. Object detection using SSD algorithm.
● It is a three steps Process:
1. Region Proposal
2. Feature Generation
3. Classification
FIG. 6 Object detection using SSD
8. SSD FRAMEWORK
● Multi-scale feature maps for detection.
● Convolutional predictors for detection.
● Default boxes and aspect ratios.
FIG. 7 SSD FRAMEWORK
9. Feature extraction
● In this stage ,each region proposal is warped or cropped into
a fixed resolution and the SSD module is utilized to extract
features.
FIG. 8 Feature extraction
10. Classification and Localization
● Classify each region using MobileNet V1 Architecture for each
category by passing feature vector created from feature extraction
and scored region are then adjusted with bounding box regression.
● This architecture uses depthwise separable convolutions which
significantly reduces the number of parameters when compared to
the network with normal convolutions.
FIG. 9 Depth Wise Separable
Convolution
11. MobileNet V1 Architecture
● It uses Separable Convolution to reduce the model size and
complexity.
● Smaller model size: Fewer number of parameters.
● Smaller complexity: Fewer Multiplications and Additions
(Multi-Adds).
Fig. 10 MobileNet V1 Architecture
12. Advantages of MobileNet V1 Architecture
● The main advantages is their accuracy in image recognition
problem.
● It takes less time.
● Improve the quality of candidate bounding boxes.
13. Tools And Libraries
● Anaconda — Anaconda is a free and open source distribution of the Python and R programming languages
for data science and machine learning related applications.
● Spyder — Spyder is an open source cross-platform IDE for scientific programming in the Python language.
● Tensorflow — TensorFlow is an open-source software library for dataflow programming across a range of
tasks.
● NumPy- NumPy is a Python package which stands for ‘Numerical Python’. It is the core library for scientific
computing, which contains a powerful n-dimensional array object, provide tools for integrating C, C++ etc.
● Matplotlib- Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of
hardcopy formats and interactive environments across platforms.
● Urllib - Urllib is a Python module that can be used for opening URLs. It defines functions and classes to help in URL
actions. With Python you can also access and retrieve data from the internet like XML, HTML, JSON, etc.
References
1. Zhong-Qiu Zhao , Member, IEEE, Peng Zheng, Shou-Tao Xu, and Xindong Wu , Fellow, IEEE(2016)
2. https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-
99760835f148
3. http://cocodataset.org/#home
LINKS TO FIGURES:-
1.
2. https://towardsdatascience.com/going-deep-into-object-detection-bed442d92b34
3. https://medium.com/datadriveninvestor/convolutional-neural-network-cnn-simplified-ecafd4ee52c5
4. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53
5. https://www.researchgate.net/figure/The-architecture-of-Single-Shot-Multibox-Detector-SSD-It-considers- only-two-
stage-by_fig9_327491507
6. Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott Reed4, Cheng-Yang Fu1, Alexander C.
Berg1(2016)
7. Sermanet,P.,Eigen,D.,Zhang,X.,Mathieu,M.,Fergus,R.,LeCun,Y.: Overfeat:Integrated recognition, localization and
detection using convolutional networks. In: ICLR. (2014)
8. https://towardsdatascience.com/cnn-application-on-structured-data-automated-feature-extraction-8f2cd28d9a7e
9. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-
3bd2b1164a53
10.https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-
99760835f148
https://machinethink.net/blog/object-detection/
THANK YOU.

More Related Content

PDF
Introduction to object detection
PPTX
Object Detection using Deep Neural Networks
PPTX
Object detection
PPTX
Object Detection & Tracking
PDF
Deep learning based object detection basics
PPTX
Object detection presentation
PPTX
Deep learning based object detection
PPTX
Deep learning for object detection
Introduction to object detection
Object Detection using Deep Neural Networks
Object detection
Object Detection & Tracking
Deep learning based object detection basics
Object detection presentation
Deep learning based object detection
Deep learning for object detection

What's hot (20)

PDF
Moving Object Detection And Tracking Using CNN
PPTX
Object detection
PPTX
Deep Learning in Computer Vision
PDF
Faster R-CNN: Towards real-time object detection with region proposal network...
PDF
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
PPTX
Moving object detection in video surveillance
PPTX
Histogram Specification or Matching Problem
PPTX
Image classification using CNN
PPTX
Handwritten Digit Recognition(Convolutional Neural Network) PPT
PPTX
Object Recognition
PPTX
Deep learning presentation
PPTX
Real Time Object Tracking
PPTX
Computer vision
PPTX
Computer Vision
PDF
Faster R-CNN - PR012
PPTX
Introduction to CNN
PDF
Feature selection
PDF
Emotion detection using cnn.pptx
PPTX
Introduction to Machine Learning
PPTX
Object Detection with Tensorflow
Moving Object Detection And Tracking Using CNN
Object detection
Deep Learning in Computer Vision
Faster R-CNN: Towards real-time object detection with region proposal network...
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
Moving object detection in video surveillance
Histogram Specification or Matching Problem
Image classification using CNN
Handwritten Digit Recognition(Convolutional Neural Network) PPT
Object Recognition
Deep learning presentation
Real Time Object Tracking
Computer vision
Computer Vision
Faster R-CNN - PR012
Introduction to CNN
Feature selection
Emotion detection using cnn.pptx
Introduction to Machine Learning
Object Detection with Tensorflow
Ad

Similar to Object detection with deep learning (20)

PDF
ObjectDetectionUsingMachineLearningandNeuralNetworks.pdf
PDF
IRJET- Real-Time Object Detection System using Caffe Model
PDF
Object Detetcion using SSD-MobileNet
PDF
kanimozhi2019.pdf
PDF
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
PDF
Modern convolutional object detectors
PPTX
Object detection with Tensorflow Api
PDF
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
PPTX
Deep learning based object detection
PDF
Object Detection for Autonomous Cars using AI/ML
PDF
Object Detection An Overview
PPTX
Anomaly Detection with Azure and .net
PPTX
Anomaly Detection with Azure and .NET
PDF
Applying convolutional neural networks for limited-memory application
PDF
IRJET- Real-Time Object Detection using Deep Learning: A Survey
PPTX
SMART RECOGNITION FOR OBJECT DETECTION.pptx
PDF
IRJET- Object Detection in an Image using Deep Learning
PDF
物件偵測與辨識技術
PDF
Object Detection and Tracking AI Robot
PPTX
seminar ppt.pptx
ObjectDetectionUsingMachineLearningandNeuralNetworks.pdf
IRJET- Real-Time Object Detection System using Caffe Model
Object Detetcion using SSD-MobileNet
kanimozhi2019.pdf
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
Modern convolutional object detectors
Object detection with Tensorflow Api
“Modern Machine Vision from Basics to Advanced Deep Learning,” a Presentation...
Deep learning based object detection
Object Detection for Autonomous Cars using AI/ML
Object Detection An Overview
Anomaly Detection with Azure and .net
Anomaly Detection with Azure and .NET
Applying convolutional neural networks for limited-memory application
IRJET- Real-Time Object Detection using Deep Learning: A Survey
SMART RECOGNITION FOR OBJECT DETECTION.pptx
IRJET- Object Detection in an Image using Deep Learning
物件偵測與辨識技術
Object Detection and Tracking AI Robot
seminar ppt.pptx
Ad

Recently uploaded (20)

PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PDF
Digital Logic Computer Design lecture notes
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
additive manufacturing of ss316l using mig welding
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Current and future trends in Computer Vision.pptx
PDF
Well-logging-methods_new................
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
composite construction of structures.pdf
PPTX
Construction Project Organization Group 2.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
Model Code of Practice - Construction Work - 21102022 .pdf
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
Digital Logic Computer Design lecture notes
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Embodied AI: Ushering in the Next Era of Intelligent Systems
CYBER-CRIMES AND SECURITY A guide to understanding
additive manufacturing of ss316l using mig welding
OOP with Java - Java Introduction (Basics)
Current and future trends in Computer Vision.pptx
Well-logging-methods_new................
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
composite construction of structures.pdf
Construction Project Organization Group 2.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf

Object detection with deep learning

  • 1. Object Detection with deep learning SUBMITTED TO: MR. B. SURESH SUBMITTED BY: HIMANSHU MAURYA(9917102004) SUSHANT SHRIVASTAVA(9917102023) BHUVNESH KUMAR BHARDWAJ(9917102028)
  • 2. 1. INTRODUCTION TO OBJECT DETECTION ○ Object detection is scanning and searching for an object in an image or a video. Fig. 1 Object detection
  • 3. Literature Review. • Object detection is a common term for computer vision techniques classifying and locating objects in an image. Modern object detection is largely based on use of convolutional neural networks Some of the most relevant system types today are Faster R-CNN, R-FCN, Multibox Single Shot Detector (SSD) and YOLO (You Only Look Once) [1]. Original R-CNN method worked by running a neural net classifier on samples cropped from images using externally computed box proposals (=samples cropped with externally computed box proposals; feature extraction done on all the cropped samples). This approach was computationally expensive due to many crops. • Single Shot Multibox Detector (SSD) differs from the R-CNN based approaches by not requiring a second stage per-proposal classification operation. This makes it fast enough for real-time detection applications. However, this comes with a price of reduced precision . “SSD with MobileNet” refers to a model where model meta architecture is SSD and the feature extractor type is MobileNet.
  • 4. 2. Generic object detection ● Generic object detection aims at locating and classifying existing object in any one image and labelling them with rectangular BBs to show the confidences of existences. Fig. 2 Generic object detection
  • 5. 3. Basic architecture of CNN Convolutional Neural Network (CNN) is a Deep Learning algorithm which can take in an input image, assign importance to various aspects/objects in the image and be able to differentiate one from the other.[2] Fig. 3 Basic architecture of CNN
  • 6. 4. Building the CNN ● Convolution ● Polling ● Flattening
  • 7. 4.1 Convolution ● Convolution preserves the spatial relationship between pixels by learning image features using small squares of input data. FIG. 4.1 Convolution
  • 8. 4.2 POOLING ● It reduces the dimensionality of each feature map but retains the most important information. FIG. 4.2 POOLING
  • 9. 4.3 FLATTENING ● Here the matrix is converted into a linear array so that to input it into the nodes of our neural network. FIG. 4.3 FLATTENING
  • 10. 5. Dataset & Preprocessing COCO stands for Common Objects in Context, this dataset contains around 330K labelled images. COCO is a large-scale object detection, segmentation, and captioning dataset.[3] 5.1 Features of dataset · Object segmentation · Recognition in context · 330K images (>200K labeled) · 1.5 million object instances · 80 object categories · 91 stuff categories 5.2 Data Preprocessing ● Since the model is pre trained, there is no need for data Preprocessing.
  • 11. 6. What is SSD? ● SSD(Single Shot Detector) is a is designed for object detection in real-time. FIG 5. Single Shot Detector.
  • 12. 7. Object detection using SSD algorithm. ● It is a three steps Process: 1. Region Proposal 2. Feature Generation 3. Classification FIG. 6 Object detection using SSD
  • 13. 8. SSD FRAMEWORK ● Multi-scale feature maps for detection. ● Convolutional predictors for detection. ● Default boxes and aspect ratios. FIG. 7 SSD FRAMEWORK
  • 14. 9. Feature extraction ● In this stage ,each region proposal is warped or cropped into a fixed resolution and the SSD module is utilized to extract features. FIG. 8 Feature extraction
  • 15. 10. Classification and Localization ● Classify each region using MobileNet V1 Architecture for each category by passing feature vector created from feature extraction and scored region are then adjusted with bounding box regression. ● This architecture uses depthwise separable convolutions which significantly reduces the number of parameters when compared to the network with normal convolutions. FIG. 9 Depth Wise Separable Convolution
  • 16. 11. MobileNet V1 Architecture ● It uses Separable Convolution to reduce the model size and complexity. ● Smaller model size: Fewer number of parameters. ● Smaller complexity: Fewer Multiplications and Additions (Multi-Adds). Fig. 10 MobileNet V1 Architecture
  • 17. 12. Advantages of MobileNet V1 Architecture ● The main advantages is their accuracy in image recognition problem. ● It takes less time. ● Improve the quality of candidate bounding boxes.
  • 18. 13. Tools And Libraries ● Anaconda — Anaconda is a free and open source distribution of the Python and R programming languages for data science and machine learning related applications. ● Spyder — Spyder is an open source cross-platform IDE for scientific programming in the Python language. ● Tensorflow — TensorFlow is an open-source software library for dataflow programming across a range of tasks. ● NumPy- NumPy is a Python package which stands for ‘Numerical Python’. It is the core library for scientific computing, which contains a powerful n-dimensional array object, provide tools for integrating C, C++ etc. ● Matplotlib- Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms. ● Urllib - Urllib is a Python module that can be used for opening URLs. It defines functions and classes to help in URL actions. With Python you can also access and retrieve data from the internet like XML, HTML, JSON, etc.
  • 19. References 1. Zhong-Qiu Zhao , Member, IEEE, Peng Zheng, Shou-Tao Xu, and Xindong Wu , Fellow, IEEE(2016) 2. https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning- 99760835f148 3. http://cocodataset.org/#home LINKS TO FIGURES:- 1. 2. https://towardsdatascience.com/going-deep-into-object-detection-bed442d92b34 3. https://medium.com/datadriveninvestor/convolutional-neural-network-cnn-simplified-ecafd4ee52c5 4. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53 5. https://www.researchgate.net/figure/The-architecture-of-Single-Shot-Multibox-Detector-SSD-It-considers- only-two- stage-by_fig9_327491507 6. Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott Reed4, Cheng-Yang Fu1, Alexander C. Berg1(2016) 7. Sermanet,P.,Eigen,D.,Zhang,X.,Mathieu,M.,Fergus,R.,LeCun,Y.: Overfeat:Integrated recognition, localization and detection using convolutional networks. In: ICLR. (2014) 8. https://towardsdatascience.com/cnn-application-on-structured-data-automated-feature-extraction-8f2cd28d9a7e 9. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way- 3bd2b1164a53 10.https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning- 99760835f148 https://machinethink.net/blog/object-detection/