SlideShare a Scribd company logo
5
Most read
12
Most read
15
Most read
You Only Look Once:
Unified, Real-Time Object Detection
Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi (2016)
The YOLO Detection System
The YOLO Detection System
(1) resizes the input image to 448 × 448.
(2) runs a single convolutional network on the image.
(3) thresholds the resulting detections by the model’s confidence.
https://www.jeremyjordan.me/object-detection-one-stage/
Non-maximum suppression
Bounding Box, Confidence and Class Probability
YOLO reframes
object detection
as a regression
problem.
• The image is divided into an S × S grid and for each grid cell predicts B bounding
boxes (x, y, w, h), confidence for those boxes, and C class probabilities.
• These predictions are encoded as an S × S × (B ∗ 5 + C) tensor.
Bounding Box, Confidence and Class Probability
The confidence of the bounding box
Formally we define
confidence as Pr(Object) ∗
IOU . If no object exists in that
cell, the confidence scores
should be zero.
The Neural Network Architecture
For evaluating YOLO on PASCAL VOC, we use S = 7, B = 2. PASCAL VOC has 20 labelled
classes so C = 20. Our final prediction is a 7 × 7 × (2∗5 + 20) tensor.
Loss Function
The size of the bounding box
The confidence of the bounding box
The probability of the class
Evaluation Metric
Confusion Matrix
Intersection Over Union (IOU) and Object Detection
https://devblogs.nvidia.com/exploring-spacenet-dataset-using-digits/
Recall-Precision Curve and Average Precision
https://acutecaretesting.org/en/articles/precision-
recall-curves-what-are-they-and-how-are-they-used
Ideally, the value of the Precision does not
decrease as the increase of the value of Recall.
The general definition for the Average Precision
(AP) is finding the area under the precision-recall
curve.
https://medium.com/@jonathan_hui/ma
p-mean-average-precision-for-object-
detection-45c121a31173
The dataset contains 5 apples only. We
collect all the predictions made for apples
in all the images and rank it in descending
order according to the predicted
confidence level.
The second column indicates whether the
prediction is correct or not. In this example,
the prediction is correct if IoU ≥ 0.5.
Recall-Precision Curve and Average Precision
An average for the 11-point interpolated AP is calculated and the curve is divided from
0 to 1.0 into 11 points
Average Precision (AP) is the
area under the precision-recall
curve.
mAP (mean average precision) is the average of the AP for each class.
Average Precision and mean Average Precision
Experimental Results
Fast YOLO uses a neural network
with fewer convolutional layers (9
instead of 24) and fewer filters in
those layers.
Comparison to Other Real-Time Systems
YOLO is 10 mAP more accurate than the fast version while still well above
real-time in speed.
VOC 2007 Error Analysis
•Correct: correct class and IOU > .5
• Localization: correct class, .1 < IOU < .5
• Similar: class is similar, IOU > .1
• Other: class is wrong, IOU > .1
• Background: IOU < .1 for any object
Localization errors account for more of YOLO’s errors than all other sources
combined. Fast R-CNN makes much fewer localization errors but far more
background errors.
Qualitative Results
Yolo

More Related Content

PPTX
You only look once (YOLO) : unified real time object detection
PPTX
PPTX
PDF
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
PPTX
You only look once
PPTX
PPTX
Yolov3
PPTX
Segment Anything
You only look once (YOLO) : unified real time object detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
You only look once
Yolov3
Segment Anything

What's hot (20)

PPTX
You Only Look Once: Unified, Real-Time Object Detection
PDF
Introduction to object detection
PPTX
You only look once: Unified, real-time object detection (UPC Reading Group)
PDF
Object Detection Using R-CNN Deep Learning Framework
PDF
Deep learning based object detection basics
PPTX
Object Detection using Deep Neural Networks
PDF
Anatomy of YOLO - v1
PDF
Yolov3
PDF
Single Shot Multibox Detector
PPTX
Yolo releases gianmaria
PDF
A Brief History of Object Detection / Tommi Kerola
PDF
YOLOv4: optimal speed and accuracy of object detection review
PPTX
Convolutional Neural Network and Its Applications
PDF
Mask-RCNN for Instance Segmentation
PDF
PR-207: YOLOv3: An Incremental Improvement
PPTX
Machine Learning - Convolutional Neural Network
PDF
Object detection and Instance Segmentation
PDF
Deep Learning for Computer Vision: Object Detection (UPC 2016)
PPTX
Object detection with deep learning
PPTX
Object detection
You Only Look Once: Unified, Real-Time Object Detection
Introduction to object detection
You only look once: Unified, real-time object detection (UPC Reading Group)
Object Detection Using R-CNN Deep Learning Framework
Deep learning based object detection basics
Object Detection using Deep Neural Networks
Anatomy of YOLO - v1
Yolov3
Single Shot Multibox Detector
Yolo releases gianmaria
A Brief History of Object Detection / Tommi Kerola
YOLOv4: optimal speed and accuracy of object detection review
Convolutional Neural Network and Its Applications
Mask-RCNN for Instance Segmentation
PR-207: YOLOv3: An Incremental Improvement
Machine Learning - Convolutional Neural Network
Object detection and Instance Segmentation
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Object detection with deep learning
Object detection
Ad

Similar to Yolo (20)

PPTX
object detection using yolo algorithm.pptx
PDF
Top object detection algorithms in deep neural networks
PPTX
Computer Vision in waymo cars Presentation
PPT
A Hierarchical Self-organizing Associative Memory for Machine ...
PDF
Applications in Machine Learning
PPTX
YOLO_review.pptxThis is a test document that is used to satisfy the requireme...
PDF
Analysis and Comparison Study of Data Mining Algorithms Using Rapid Miner
PDF
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
PDF
Optimized Neural Network for Classification of Multispectral Images
PDF
20141003.journal club
PPTX
auto-assistance system for visually impaired person
PPTX
ppt - Copy for projects will help you further
PDF
machinelearningengineeringslideshare-160909192132 (1).pdf
PDF
Comparison of hybrid pso sa algorithm and genetic algorithm for classification
PPTX
Anomaly Detection for Real-World Systems
PPTX
Neural Networks For Secondary Structure.pptx
PPTX
SEMINAR COURSE PRESENTATION on YOLO algorithm for object detection
DOCX
BDSIprojectsummary
PDF
11.comparison of hybrid pso sa algorithm and genetic algorithm for classifica...
PPTX
Computer vision series
object detection using yolo algorithm.pptx
Top object detection algorithms in deep neural networks
Computer Vision in waymo cars Presentation
A Hierarchical Self-organizing Associative Memory for Machine ...
Applications in Machine Learning
YOLO_review.pptxThis is a test document that is used to satisfy the requireme...
Analysis and Comparison Study of Data Mining Algorithms Using Rapid Miner
ANALYSIS AND COMPARISON STUDY OF DATA MINING ALGORITHMS USING RAPIDMINER
Optimized Neural Network for Classification of Multispectral Images
20141003.journal club
auto-assistance system for visually impaired person
ppt - Copy for projects will help you further
machinelearningengineeringslideshare-160909192132 (1).pdf
Comparison of hybrid pso sa algorithm and genetic algorithm for classification
Anomaly Detection for Real-World Systems
Neural Networks For Secondary Structure.pptx
SEMINAR COURSE PRESENTATION on YOLO algorithm for object detection
BDSIprojectsummary
11.comparison of hybrid pso sa algorithm and genetic algorithm for classifica...
Computer vision series
Ad

Recently uploaded (20)

PPTX
Advanced SystemCare Ultimate Crack + Portable (2025)
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PPTX
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
PDF
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
PDF
Digital Systems & Binary Numbers (comprehensive )
PPTX
Why Generative AI is the Future of Content, Code & Creativity?
PPTX
assetexplorer- product-overview - presentation
PPTX
Weekly report ppt - harsh dattuprasad patel.pptx
PDF
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
Patient Appointment Booking in Odoo with online payment
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Website Design Services for Small Businesses.pdf
PDF
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
PDF
iTop VPN Crack Latest Version Full Key 2025
Advanced SystemCare Ultimate Crack + Portable (2025)
wealthsignaloriginal-com-DS-text-... (1).pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
Operating system designcfffgfgggggggvggggggggg
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
Digital Systems & Binary Numbers (comprehensive )
Why Generative AI is the Future of Content, Code & Creativity?
assetexplorer- product-overview - presentation
Weekly report ppt - harsh dattuprasad patel.pptx
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
Wondershare Filmora 15 Crack With Activation Key [2025
Patient Appointment Booking in Odoo with online payment
Internet Downloader Manager (IDM) Crack 6.42 Build 41
CHAPTER 2 - PM Management and IT Context
Website Design Services for Small Businesses.pdf
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
iTop VPN Crack Latest Version Full Key 2025

Yolo

  • 1. You Only Look Once: Unified, Real-Time Object Detection Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi (2016)
  • 3. The YOLO Detection System (1) resizes the input image to 448 × 448. (2) runs a single convolutional network on the image. (3) thresholds the resulting detections by the model’s confidence.
  • 5. Bounding Box, Confidence and Class Probability YOLO reframes object detection as a regression problem. • The image is divided into an S × S grid and for each grid cell predicts B bounding boxes (x, y, w, h), confidence for those boxes, and C class probabilities. • These predictions are encoded as an S × S × (B ∗ 5 + C) tensor.
  • 6. Bounding Box, Confidence and Class Probability The confidence of the bounding box Formally we define confidence as Pr(Object) ∗ IOU . If no object exists in that cell, the confidence scores should be zero.
  • 7. The Neural Network Architecture For evaluating YOLO on PASCAL VOC, we use S = 7, B = 2. PASCAL VOC has 20 labelled classes so C = 20. Our final prediction is a 7 × 7 × (2∗5 + 20) tensor.
  • 8. Loss Function The size of the bounding box The confidence of the bounding box The probability of the class
  • 11. Intersection Over Union (IOU) and Object Detection https://devblogs.nvidia.com/exploring-spacenet-dataset-using-digits/
  • 12. Recall-Precision Curve and Average Precision https://acutecaretesting.org/en/articles/precision- recall-curves-what-are-they-and-how-are-they-used Ideally, the value of the Precision does not decrease as the increase of the value of Recall. The general definition for the Average Precision (AP) is finding the area under the precision-recall curve.
  • 13. https://medium.com/@jonathan_hui/ma p-mean-average-precision-for-object- detection-45c121a31173 The dataset contains 5 apples only. We collect all the predictions made for apples in all the images and rank it in descending order according to the predicted confidence level. The second column indicates whether the prediction is correct or not. In this example, the prediction is correct if IoU ≥ 0.5. Recall-Precision Curve and Average Precision
  • 14. An average for the 11-point interpolated AP is calculated and the curve is divided from 0 to 1.0 into 11 points Average Precision (AP) is the area under the precision-recall curve. mAP (mean average precision) is the average of the AP for each class. Average Precision and mean Average Precision
  • 16. Fast YOLO uses a neural network with fewer convolutional layers (9 instead of 24) and fewer filters in those layers. Comparison to Other Real-Time Systems YOLO is 10 mAP more accurate than the fast version while still well above real-time in speed.
  • 17. VOC 2007 Error Analysis •Correct: correct class and IOU > .5 • Localization: correct class, .1 < IOU < .5 • Similar: class is similar, IOU > .1 • Other: class is wrong, IOU > .1 • Background: IOU < .1 for any object Localization errors account for more of YOLO’s errors than all other sources combined. Fast R-CNN makes much fewer localization errors but far more background errors.