SlideShare a Scribd company logo
Intro to Semantic Segmentation Using Deep Learning
================================================================
Semantic segmentation is the task of classifying each and every pixel in an image
into a class as shown in the image below. Here you can see that all persons are red,
the road is purple, the vehicles are blue, street signs are yellow etc.
Semantic segmentation is different from instance segmentation which is that
different objects of the same class will have different labels as in person1, person2
and hence different colours. The picture below very crisply illustrates the difference
between instance and semantic segmentation. If you are interested in learning more
about classification and object detection, please check out my blog here.
One important question can be why do we need this granularity of understanding
pixel by pixel location?
Some examples that come to mind are:
i) Self Driving Cars — May need to know exactly where another car is on the road or
the location of a human crossing the road
ii) Robotic systems — Robots that say join two parts together will perform better if
they know the exact locations of the two parts
iii) Damage Detection - It may be important in this case to know the exact extent of
damage
Deep Learning Model Architectures for Semantic Segmentation
Lets now talk about 3 model architectures that do semantic segmentation.
1. Fully Convolutional Network (FCN)
FCN is a popular algorithm for doing semantic segmentation. This model uses
various blocks of convolution and max pool layers to first decompress an image to
1/32th of its original size. It then makes a class prediction at this level of granularity.
Finally it uses up sampling and deconvolution layers to resize the image to its
original dimensions.
These models typically don't have any fully connected layers. The goal of down
sampling steps is to capture semantic/contextual information while the goal of up
sampling is to recover spatial information. Also there are no limitations on image
size. The final image is the same size as the original image. To fully recover the fine
grained spatial information lost in down sampling, skip connections are used. A skip
connection is a connection that bypasses at least one layer. Here it is used to pass
information from the down sampling step to the up sampling step. Merging features
from various resolution levels helps combining context information with spatial
information
Contacts Us:-
Address: - 110 Fontainbleau Drive, Toronto
Telephone: - 647-550-0256
Email: - deeplearning33@gmail.com

More Related Content

What's hot (12)

PDF
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
ijcax
 
DOC
SchuurmansLecture.doc
butest
 
PPTX
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Cheriyan K M
 
DOCX
Multiview alignment hashing for
jpstudcorner
 
PPT
Rafi Zachut's slides on class specific segmentation
wolf
 
PDF
Nips 2016 tutorial generative adversarial networks review
Minho Heo
 
PDF
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
IJECEIAES
 
PDF
IEEE PROJECT TOPICS &ABSTRACTS on image processing
aswin tbbc
 
PPTX
Static Spatial Graph Features
Niklas Elmqvist
 
PPT
A Review of Relational Machine Learning(SRL) for Knowledge Graphs
yalda akbarzadeh
 
DOCX
Bt9301 computer graphics
smumbahelp
 
PDF
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
ijnlc
 
NEW ONTOLOGY RETRIEVAL IMAGE METHOD IN 5K COREL IMAGES
ijcax
 
SchuurmansLecture.doc
butest
 
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Cheriyan K M
 
Multiview alignment hashing for
jpstudcorner
 
Rafi Zachut's slides on class specific segmentation
wolf
 
Nips 2016 tutorial generative adversarial networks review
Minho Heo
 
Unimodal Multi-Feature Fusion and one-dimensional Hidden Markov Models for Lo...
IJECEIAES
 
IEEE PROJECT TOPICS &ABSTRACTS on image processing
aswin tbbc
 
Static Spatial Graph Features
Niklas Elmqvist
 
A Review of Relational Machine Learning(SRL) for Knowledge Graphs
yalda akbarzadeh
 
Bt9301 computer graphics
smumbahelp
 
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
ijnlc
 

Similar to Intro to Semantic Segmentation Using Deep Learning (20)

PPTX
AaSeminar_Template.pptx
ManojGowdaKb
 
PPTX
Introduction to Segmentation in Computer vision
ParrotAI
 
PDF
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
PDF
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ynxm25hpxp
 
PPTX
Image Segmentation: Approaches and Challenges
Apache MXNet
 
PPTX
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
PDF
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
PDF
IRJET- Semantic Segmentation using Deep Learning
IRJET Journal
 
PDF
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
岳華 杜
 
PPTX
Image Segmentation Using Deep Learning : A survey
NUPUR YADAV
 
PPTX
Review-image-segmentation-by-deep-learning
Trong-An Bui
 
PPTX
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
PPTX
DefenseTalk_Trimmed
Abhishek Sharma
 
PPTX
cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptx
faizalmistry5
 
PDF
A brief introduction to recent segmentation methods
Shunta Saito
 
PDF
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
ITIIIndustries
 
PDF
The Future of Health Monitoring: Advances in Wearable Sensor Data Processing
IgMin Publications Inc.
 
PPTX
Image segmentation hj_cho
Hyungjoo Cho
 
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
PDF
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
Edge AI and Vision Alliance
 
AaSeminar_Template.pptx
ManojGowdaKb
 
Introduction to Segmentation in Computer vision
ParrotAI
 
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
ynxm25hpxp
 
Image Segmentation: Approaches and Challenges
Apache MXNet
 
Semantic segmentation with Convolutional Neural Network Approaches
UMBC
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
IRJET- Semantic Segmentation using Deep Learning
IRJET Journal
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
岳華 杜
 
Image Segmentation Using Deep Learning : A survey
NUPUR YADAV
 
Review-image-segmentation-by-deep-learning
Trong-An Bui
 
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
DefenseTalk_Trimmed
Abhishek Sharma
 
cityscapes Semantic Segmentation using FCN, U Net and U Net++.pptx
faizalmistry5
 
A brief introduction to recent segmentation methods
Shunta Saito
 
Semantic Video Segmentation with Using Ensemble of Particular Classifiers and...
ITIIIndustries
 
The Future of Health Monitoring: Advances in Wearable Sensor Data Processing
IgMin Publications Inc.
 
Image segmentation hj_cho
Hyungjoo Cho
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
Edge AI and Vision Alliance
 
Ad

Recently uploaded (20)

PDF
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
PPSX
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
PPTX
Enabling the Digital Artisan – keynote at ICOCI 2025
Alan Dix
 
PDF
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
PDF
Bitkom eIDAS Summit | European Business Wallet: Use Cases, Macroeconomics, an...
Carsten Stoecker
 
PPTX
Reimaginando la Ciberdefensa: De Copilots a Redes de Agentes
Cristian Garcia G.
 
PDF
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Safe Software
 
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
PDF
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
PDF
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
 
PDF
Kubernetes - Architecture & Components.pdf
geethak285
 
PDF
Proactive Server and System Monitoring with FME: Using HTTP and System Caller...
Safe Software
 
PDF
''Taming Explosive Growth: Building Resilience in a Hyper-Scaled Financial Pl...
Fwdays
 
PDF
Understanding AI Optimization AIO, LLMO, and GEO
CoDigital
 
PDF
“A Re-imagination of Embedded Vision System Design,” a Presentation from Imag...
Edge AI and Vision Alliance
 
PPTX
CapCut Pro PC Crack Latest Version Free Free
josanj305
 
PDF
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
PDF
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
PPTX
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
Enabling the Digital Artisan – keynote at ICOCI 2025
Alan Dix
 
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
Bitkom eIDAS Summit | European Business Wallet: Use Cases, Macroeconomics, an...
Carsten Stoecker
 
Reimaginando la Ciberdefensa: De Copilots a Redes de Agentes
Cristian Garcia G.
 
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Safe Software
 
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
 
Kubernetes - Architecture & Components.pdf
geethak285
 
Proactive Server and System Monitoring with FME: Using HTTP and System Caller...
Safe Software
 
''Taming Explosive Growth: Building Resilience in a Hyper-Scaled Financial Pl...
Fwdays
 
Understanding AI Optimization AIO, LLMO, and GEO
CoDigital
 
“A Re-imagination of Embedded Vision System Design,” a Presentation from Imag...
Edge AI and Vision Alliance
 
CapCut Pro PC Crack Latest Version Free Free
josanj305
 
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
Ad

Intro to Semantic Segmentation Using Deep Learning

  • 1. Intro to Semantic Segmentation Using Deep Learning ================================================================ Semantic segmentation is the task of classifying each and every pixel in an image into a class as shown in the image below. Here you can see that all persons are red, the road is purple, the vehicles are blue, street signs are yellow etc. Semantic segmentation is different from instance segmentation which is that different objects of the same class will have different labels as in person1, person2 and hence different colours. The picture below very crisply illustrates the difference between instance and semantic segmentation. If you are interested in learning more about classification and object detection, please check out my blog here.
  • 2. One important question can be why do we need this granularity of understanding pixel by pixel location? Some examples that come to mind are: i) Self Driving Cars — May need to know exactly where another car is on the road or the location of a human crossing the road ii) Robotic systems — Robots that say join two parts together will perform better if they know the exact locations of the two parts iii) Damage Detection - It may be important in this case to know the exact extent of damage Deep Learning Model Architectures for Semantic Segmentation Lets now talk about 3 model architectures that do semantic segmentation. 1. Fully Convolutional Network (FCN) FCN is a popular algorithm for doing semantic segmentation. This model uses various blocks of convolution and max pool layers to first decompress an image to 1/32th of its original size. It then makes a class prediction at this level of granularity. Finally it uses up sampling and deconvolution layers to resize the image to its original dimensions. These models typically don't have any fully connected layers. The goal of down sampling steps is to capture semantic/contextual information while the goal of up sampling is to recover spatial information. Also there are no limitations on image size. The final image is the same size as the original image. To fully recover the fine grained spatial information lost in down sampling, skip connections are used. A skip
  • 3. connection is a connection that bypasses at least one layer. Here it is used to pass information from the down sampling step to the up sampling step. Merging features from various resolution levels helps combining context information with spatial information Contacts Us:- Address: - 110 Fontainbleau Drive, Toronto Telephone: - 647-550-0256 Email: - [email protected]