SlideShare a Scribd company logo
Semantic
Segmentation
Hello!
I am Frederick Apina
Machine Learning Engineer @ParrotAI
I am here because I love to give
presentations.
2
“When I think about strong
innovations in term of
automation, cognitive computing,
and artificial intelligence, they will
be coming a lot from Tanzania as
well.”
3
1.
What is semantic
segmentation?
5
6
Limitations
Still a bit rough since we’re only
drawing bounding boxes and don’t
really get an accurate idea of
object shape.
7
What if!?
8
Semantic Segmentation
Semantic Segmentation is to
label each pixel of an image with a
corresponding class of what is being
represented.
✗ commonly referred to as dense prediction.
2.
Applications of
Semantic
Segmentation
10
Autonomous Vehicles
11
Medical Surgeries
12
Medical Surgeries
13
Medical Images Diagnostics
3.
Representing the
Task
15
Our goal is to take either a RGB color image or a grayscale image and
output a segmentation map where each pixel contains a class label
represented as an integer.
16
We create our target by one-hot encoding the class labels - essentially
creating an output channel for each of the possible classes.
17
We can easily inspect a target by overlaying it onto the observation.
When we overlay a single channel of our target (or prediction), we refer to this
as a mask which illuminates the regions of an image where a specific class is
present.
3.
Constructing an
Architecture
A naive approach…
20
✗ Recall that for deep convolutional networks,
earlier layers tend to learn low-level concepts
while later layers develop more high-level (and
specialized) feature mappings. In order to
maintain expressiveness, we typically need to
increase the number of feature maps (channels)
as we get deeper in the network.
21
Solution?
Lucky for us..
One popular approach for image segmentation models is to follow
an encoder/decoder structure.
U-Net Architecture..
Consists of a
contracting path
to capture
context and
a symmetric expa
nding path that
enables precise
localization.
Advanced U-Net variants
The standard U-Net model consists of a series of
convolution operations for each "block" in the architecture.
Proposed: swap out the basic stacked convolution blocks in
favor of residual blocks. This residual block introduces short skip
connections (within the block) alongside the existing long skip
connections (between the corresponding feature maps of
encoder and decoder modules) found in the standard U-Net
structure.
Tiramisu: Full Convolution DenseNet
Tiramisu adopts the UNet design with downsampling, bottleneck, and upsampling paths
and skip connections. It replaces convolution and max pooling layers with Dense blocks
from the DenseNet architecture. Dense blocks contain residual connections.
Defining loss function
The most commonly used loss function for the task of image segmentation is a pixel-wise cross
entropy loss. This loss examines each pixel individually, comparing the class predictions (depth-wise
pixel vector) to our one-hot encoded target vector.
Deep Learning is an continuously-growing and a
relatively new concept, the vast amount of
resources can be a touch overwhelming for those
either looking to get into the field, or those
already engraved in it. A good way of cooping is to
get a good general knowledge of machine learning
and then find a good structured path to follow (be
a project or research).
27
Conclusion
28
Thanks!
Any questions?
You can find me at:
✗ Fred@parrotai.co.tz

More Related Content

What's hot (20)

PPTX
Digit recognition using neural network
shachibattar
 
PPTX
Machine Learning - Convolutional Neural Network
Richard Kuo
 
PPTX
Computer Vision for Beginners
Sanghamitra Deb
 
PDF
Offline Character Recognition Using Monte Carlo Method and Neural Network
ijaia
 
PPT
Person re-identification, PhD Day 2011
Riccardo Satta
 
PPTX
Dissimilarity-based people re-identification and search for intelligent video...
Riccardo Satta
 
PPT
Exploiting Dissimilarity Representations for Person Re-Identification
Riccardo Satta
 
PDF
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
IOSR Journals
 
PDF
Handwritten Digit Recognition using Convolutional Neural Networks
IRJET Journal
 
PPTX
Convolutional neural network from VGG to DenseNet
SungminYou
 
PPTX
Comparison of Learning Algorithms for Handwritten Digit Recognition
Safaa Alnabulsi
 
PDF
GTSRB Traffic Sign recognition using machine learning
Rupali Aher
 
DOCX
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Fingerprint compression-based-on-...
IEEEBEBTECHSTUDENTPROJECTS
 
PDF
Kq3518291832
IJERA Editor
 
PDF
Manifold learning with application to object recognition
zukun
 
PPTX
Image classification with Deep Neural Networks
Yogendra Tamang
 
PDF
A survey on the layers of convolutional Neural Network
Sasanko Sekhar Gantayat
 
DOCX
Digit recognition using mnist database
btandale
 
PPTX
Transfer Learning in NLP: A Survey
NUPUR YADAV
 
Digit recognition using neural network
shachibattar
 
Machine Learning - Convolutional Neural Network
Richard Kuo
 
Computer Vision for Beginners
Sanghamitra Deb
 
Offline Character Recognition Using Monte Carlo Method and Neural Network
ijaia
 
Person re-identification, PhD Day 2011
Riccardo Satta
 
Dissimilarity-based people re-identification and search for intelligent video...
Riccardo Satta
 
Exploiting Dissimilarity Representations for Person Re-Identification
Riccardo Satta
 
Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...
IOSR Journals
 
Handwritten Digit Recognition using Convolutional Neural Networks
IRJET Journal
 
Convolutional neural network from VGG to DenseNet
SungminYou
 
Comparison of Learning Algorithms for Handwritten Digit Recognition
Safaa Alnabulsi
 
GTSRB Traffic Sign recognition using machine learning
Rupali Aher
 
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Fingerprint compression-based-on-...
IEEEBEBTECHSTUDENTPROJECTS
 
Kq3518291832
IJERA Editor
 
Manifold learning with application to object recognition
zukun
 
Image classification with Deep Neural Networks
Yogendra Tamang
 
A survey on the layers of convolutional Neural Network
Sasanko Sekhar Gantayat
 
Digit recognition using mnist database
btandale
 
Transfer Learning in NLP: A Survey
NUPUR YADAV
 

Similar to Introduction to Segmentation in Computer vision (20)

PPTX
AaSeminar_Template.pptx
ManojGowdaKb
 
PDF
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ynxm25hpxp
 
PPTX
Image Segmentation: Approaches and Challenges
Apache MXNet
 
PDF
Intro to Semantic Segmentation Using Deep Learning
Deep Learning Analytical Solutions​​
 
PDF
IRJET- Semantic Segmentation using Deep Learning
IRJET Journal
 
PDF
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
岳華 杜
 
PDF
Image segmentation with deep learning
Antonio Rueda-Toicen
 
PDF
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
PPTX
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
PPTX
image_segmentation_ppt.pptx
fgdg12
 
PPTX
U-Netpresentation.pptx
NoorUlHaq47
 
PPTX
U-Net (1).pptx
Changjin Lee
 
PDF
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
PDF
A brief introduction to recent segmentation methods
Shunta Saito
 
PDF
#6 PyData Warsaw: Deep learning for image segmentation
Matthew Opala
 
PPTX
Review-image-segmentation-by-deep-learning
Trong-An Bui
 
PPTX
UNetEliyaLaialy (2).pptx
NoorUlHaq47
 
PPTX
vision_image_segmentation.pptx
vrushalikanawade2
 
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
PPTX
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
AaSeminar_Template.pptx
ManojGowdaKb
 
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ynxm25hpxp
 
Image Segmentation: Approaches and Challenges
Apache MXNet
 
Intro to Semantic Segmentation Using Deep Learning
Deep Learning Analytical Solutions​​
 
IRJET- Semantic Segmentation using Deep Learning
IRJET Journal
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
岳華 杜
 
Image segmentation with deep learning
Antonio Rueda-Toicen
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
image_segmentation_ppt.pptx
fgdg12
 
U-Netpresentation.pptx
NoorUlHaq47
 
U-Net (1).pptx
Changjin Lee
 
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
A brief introduction to recent segmentation methods
Shunta Saito
 
#6 PyData Warsaw: Deep learning for image segmentation
Matthew Opala
 
Review-image-segmentation-by-deep-learning
Trong-An Bui
 
UNetEliyaLaialy (2).pptx
NoorUlHaq47
 
vision_image_segmentation.pptx
vrushalikanawade2
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
Ad

Recently uploaded (20)

PPTX
RESEARCH-FINAL-GROUP-3, about the final .pptx
gwapokoha1
 
PDF
5991-5857_Agilent_MS_Theory_EN (1).pdf. pdf
NohaSalah45
 
PPT
Reliability Monitoring of Aircrfat commerce
Rizk2
 
PDF
Business Automation Solution with Excel 1.1.pdf
Vivek Kedia
 
DOCX
ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMEN...
JoemarAgbayani1
 
PPTX
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
PPTX
big data eco system fundamentals of data science
arivukarasi
 
PDF
Loading Data into Snowflake (Bulk & Stream)
Accentfuture
 
PPTX
Discrete Logarithm Problem in Cryptography (1).pptx
meshablinx38
 
PDF
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
 
PDF
Datàaaaaaaaaaengineeeeeeeeeeeeeeeeeeeeeee
juadsr96
 
DOCX
🧩 1. Solvent R-WPS Office work scientific
NohaSalah45
 
PDF
SaleServicereport and SaleServicereport
2251330007
 
PPTX
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
PDF
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
 
PDF
Kafka Use Cases Real-World Applications
Accentfuture
 
PPTX
Data Analytics using sparkabcdefghi.pptx
KarkuzhaliS3
 
PPTX
Generative AI Boost Data Governance and Quality- Tejasvi Addagada
Tejasvi Addagada
 
PDF
Orchestrating Data Workloads With Airflow.pdf
ssuserae5511
 
PPTX
Module-2_3-1eentzyssssssssssssssssssssss.pptx
ShahidHussain66691
 
RESEARCH-FINAL-GROUP-3, about the final .pptx
gwapokoha1
 
5991-5857_Agilent_MS_Theory_EN (1).pdf. pdf
NohaSalah45
 
Reliability Monitoring of Aircrfat commerce
Rizk2
 
Business Automation Solution with Excel 1.1.pdf
Vivek Kedia
 
ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMEN...
JoemarAgbayani1
 
Feb 2021 Ransomware Recovery presentation.pptx
enginsayin1
 
big data eco system fundamentals of data science
arivukarasi
 
Loading Data into Snowflake (Bulk & Stream)
Accentfuture
 
Discrete Logarithm Problem in Cryptography (1).pptx
meshablinx38
 
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
 
Datàaaaaaaaaaengineeeeeeeeeeeeeeeeeeeeeee
juadsr96
 
🧩 1. Solvent R-WPS Office work scientific
NohaSalah45
 
SaleServicereport and SaleServicereport
2251330007
 
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
 
Kafka Use Cases Real-World Applications
Accentfuture
 
Data Analytics using sparkabcdefghi.pptx
KarkuzhaliS3
 
Generative AI Boost Data Governance and Quality- Tejasvi Addagada
Tejasvi Addagada
 
Orchestrating Data Workloads With Airflow.pdf
ssuserae5511
 
Module-2_3-1eentzyssssssssssssssssssssss.pptx
ShahidHussain66691
 
Ad

Introduction to Segmentation in Computer vision

  • 2. Hello! I am Frederick Apina Machine Learning Engineer @ParrotAI I am here because I love to give presentations. 2
  • 3. “When I think about strong innovations in term of automation, cognitive computing, and artificial intelligence, they will be coming a lot from Tanzania as well.” 3
  • 5. 5
  • 6. 6 Limitations Still a bit rough since we’re only drawing bounding boxes and don’t really get an accurate idea of object shape.
  • 8. 8 Semantic Segmentation Semantic Segmentation is to label each pixel of an image with a corresponding class of what is being represented. ✗ commonly referred to as dense prediction.
  • 15. 15 Our goal is to take either a RGB color image or a grayscale image and output a segmentation map where each pixel contains a class label represented as an integer.
  • 16. 16 We create our target by one-hot encoding the class labels - essentially creating an output channel for each of the possible classes.
  • 17. 17 We can easily inspect a target by overlaying it onto the observation. When we overlay a single channel of our target (or prediction), we refer to this as a mask which illuminates the regions of an image where a specific class is present.
  • 20. 20 ✗ Recall that for deep convolutional networks, earlier layers tend to learn low-level concepts while later layers develop more high-level (and specialized) feature mappings. In order to maintain expressiveness, we typically need to increase the number of feature maps (channels) as we get deeper in the network.
  • 22. Lucky for us.. One popular approach for image segmentation models is to follow an encoder/decoder structure.
  • 23. U-Net Architecture.. Consists of a contracting path to capture context and a symmetric expa nding path that enables precise localization.
  • 24. Advanced U-Net variants The standard U-Net model consists of a series of convolution operations for each "block" in the architecture. Proposed: swap out the basic stacked convolution blocks in favor of residual blocks. This residual block introduces short skip connections (within the block) alongside the existing long skip connections (between the corresponding feature maps of encoder and decoder modules) found in the standard U-Net structure.
  • 25. Tiramisu: Full Convolution DenseNet Tiramisu adopts the UNet design with downsampling, bottleneck, and upsampling paths and skip connections. It replaces convolution and max pooling layers with Dense blocks from the DenseNet architecture. Dense blocks contain residual connections.
  • 26. Defining loss function The most commonly used loss function for the task of image segmentation is a pixel-wise cross entropy loss. This loss examines each pixel individually, comparing the class predictions (depth-wise pixel vector) to our one-hot encoded target vector.
  • 27. Deep Learning is an continuously-growing and a relatively new concept, the vast amount of resources can be a touch overwhelming for those either looking to get into the field, or those already engraved in it. A good way of cooping is to get a good general knowledge of machine learning and then find a good structured path to follow (be a project or research). 27 Conclusion