Introduction to Segmentation in Computer vision

0 likes203 views

Semantic segmentation is a dense prediction task that labels each pixel of an image with a class. It has applications in autonomous vehicles, medical imaging, and surgeries. Popular architectures for semantic segmentation include U-Net, which uses an encoder-decoder structure with skip connections, and Tiramisu, which uses dense blocks. The loss function commonly used is pixel-wise cross entropy loss, which examines predictions at each pixel.

Data & Analytics

Hello!
I am Frederick Apina
Machine Learning Engineer @ParrotAI
I am here because I love to give
presentations.
2

“When I think about strong
innovations in term of
automation, cognitive computing,
and artificial intelligence, they will
be coming a lot from Tanzania as
well.”
3

6
Limitations
Still a bit rough since we’re only
drawing bounding boxes and don’t
really get an accurate idea of
object shape.

8
Semantic Segmentation
Semantic Segmentation is to
label each pixel of an image with a
corresponding class of what is being
represented.
✗ commonly referred to as dense prediction.

2.
Applications of
Semantic
Segmentation

15
Our goal is to take either a RGB color image or a grayscale image and
output a segmentation map where each pixel contains a class label
represented as an integer.

16
We create our target by one-hot encoding the class labels - essentially
creating an output channel for each of the possible classes.

17
We can easily inspect a target by overlaying it onto the observation.
When we overlay a single channel of our target (or prediction), we refer to this
as a mask which illuminates the regions of an image where a specific class is
present.

20
✗ Recall that for deep convolutional networks,
earlier layers tend to learn low-level concepts
while later layers develop more high-level (and
specialized) feature mappings. In order to
maintain expressiveness, we typically need to
increase the number of feature maps (channels)
as we get deeper in the network.

Lucky for us..
One popular approach for image segmentation models is to follow
an encoder/decoder structure.

U-Net Architecture..
Consists of a
contracting path
to capture
context and
a symmetric expa
nding path that
enables precise
localization.

Advanced U-Net variants
The standard U-Net model consists of a series of
convolution operations for each "block" in the architecture.
Proposed: swap out the basic stacked convolution blocks in
favor of residual blocks. This residual block introduces short skip
connections (within the block) alongside the existing long skip
connections (between the corresponding feature maps of
encoder and decoder modules) found in the standard U-Net
structure.

Tiramisu: Full Convolution DenseNet
Tiramisu adopts the UNet design with downsampling, bottleneck, and upsampling paths
and skip connections. It replaces convolution and max pooling layers with Dense blocks
from the DenseNet architecture. Dense blocks contain residual connections.

Defining loss function
The most commonly used loss function for the task of image segmentation is a pixel-wise cross
entropy loss. This loss examines each pixel individually, comparing the class predictions (depth-wise
pixel vector) to our one-hot encoded target vector.

Deep Learning is an continuously-growing and a
relatively new concept, the vast amount of
resources can be a touch overwhelming for those
either looking to get into the field, or those
already engraved in it. A good way of cooping is to
get a good general knowledge of machine learning
and then find a good structured path to follow (be
a project or research).
27
Conclusion

28
Thanks!
Any questions?
You can find me at:
✗ Fred@parrotai.co.tz

More Related Content

What's hot (20)

PPTX

Digit recognition using neural networkshachibattar

PPTX

Machine Learning - Convolutional Neural NetworkRichard Kuo

PPTX

Computer Vision for BeginnersSanghamitra Deb

PDF

Offline Character Recognition Using Monte Carlo Method and Neural Networkijaia

PPT

Person re-identification, PhD Day 2011Riccardo Satta

PPTX

Dissimilarity-based people re-identification and search for intelligent video...Riccardo Satta

PPT

Exploiting Dissimilarity Representations for Person Re-IdentificationRiccardo Satta

PDF

Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...IOSR Journals

PDF

Handwritten Digit Recognition using Convolutional Neural NetworksIRJET Journal

PPTX

Convolutional neural network from VGG to DenseNetSungminYou

PPTX

Comparison of Learning Algorithms for Handwritten Digit RecognitionSafaa Alnabulsi

PDF

GTSRB Traffic Sign recognition using machine learningRupali Aher

DOCX

IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Fingerprint compression-based-on-...IEEEBEBTECHSTUDENTPROJECTS

PDF

Kq3518291832IJERA Editor

PDF

Manifold learning with application to object recognitionzukun

PPTX

Image classification with Deep Neural NetworksYogendra Tamang

PDF

A survey on the layers of convolutional Neural NetworkSasanko Sekhar Gantayat

DOCX

Digit recognition using mnist databasebtandale

PPTX

CnnNirthika Rajendran

PPTX

Transfer Learning in NLP: A SurveyNUPUR YADAV

Digit recognition using neural networkshachibattar

Machine Learning - Convolutional Neural NetworkRichard Kuo

Computer Vision for BeginnersSanghamitra Deb

Offline Character Recognition Using Monte Carlo Method and Neural Networkijaia

Person re-identification, PhD Day 2011Riccardo Satta

Dissimilarity-based people re-identification and search for intelligent video...Riccardo Satta

Exploiting Dissimilarity Representations for Person Re-IdentificationRiccardo Satta

Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...IOSR Journals

Handwritten Digit Recognition using Convolutional Neural NetworksIRJET Journal

Convolutional neural network from VGG to DenseNetSungminYou

Comparison of Learning Algorithms for Handwritten Digit RecognitionSafaa Alnabulsi

GTSRB Traffic Sign recognition using machine learningRupali Aher

IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Fingerprint compression-based-on-...IEEEBEBTECHSTUDENTPROJECTS

Kq3518291832IJERA Editor

Manifold learning with application to object recognitionzukun

Image classification with Deep Neural NetworksYogendra Tamang

A survey on the layers of convolutional Neural NetworkSasanko Sekhar Gantayat

Digit recognition using mnist databasebtandale

CnnNirthika Rajendran

Transfer Learning in NLP: A SurveyNUPUR YADAV

Similar to Introduction to Segmentation in Computer vision (20)

PPTX

AaSeminar_Template.pptxManojGowdaKb

PDF

SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp

PPTX

Image Segmentation: Approaches and ChallengesApache MXNet

PDF

Intro to Semantic Segmentation Using Deep LearningDeep Learning Analytical Solutions

PDF

IRJET- Semantic Segmentation using Deep LearningIRJET Journal

PDF

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華杜

PDF

Image segmentation with deep learningAntonio Rueda-Toicen

PDF

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

PPTX

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

PPTX

image_segmentation_ppt.pptxfgdg12

PPTX

U-Netpresentation.pptxNoorUlHaq47

PPTX

U-Net (1).pptxChangjin Lee

PDF

Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

PDF

A brief introduction to recent segmentation methodsShunta Saito

PDF

#6 PyData Warsaw: Deep learning for image segmentationMatthew Opala

PPTX

Review-image-segmentation-by-deep-learningTrong-An Bui

PPTX

UNetEliyaLaialy (2).pptxNoorUlHaq47

PPTX

vision_image_segmentation.pptxvrushalikanawade2

PDF

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

PPTX

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

AaSeminar_Template.pptxManojGowdaKb

SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp

Image Segmentation: Approaches and ChallengesApache MXNet

Intro to Semantic Segmentation Using Deep LearningDeep Learning Analytical Solutions

IRJET- Semantic Segmentation using Deep LearningIRJET Journal

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華杜

Image segmentation with deep learningAntonio Rueda-Toicen

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Semantic segmentation with Convolutional Neural Network ApproachesUMBC

image_segmentation_ppt.pptxfgdg12

U-Netpresentation.pptxNoorUlHaq47

U-Net (1).pptxChangjin Lee

Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

A brief introduction to recent segmentation methodsShunta Saito

#6 PyData Warsaw: Deep learning for image segmentationMatthew Opala

Review-image-segmentation-by-deep-learningTrong-An Bui

UNetEliyaLaialy (2).pptxNoorUlHaq47

vision_image_segmentation.pptxvrushalikanawade2

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

Recently uploaded (20)

PPTX

RESEARCH-FINAL-GROUP-3, about the final .pptxgwapokoha1

PDF

5991-5857_Agilent_MS_Theory_EN (1).pdf. pdfNohaSalah45

PPT

Reliability Monitoring of Aircrfat commerceRizk2

PDF

Business Automation Solution with Excel 1.1.pdfVivek Kedia

DOCX

ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMEN...JoemarAgbayani1

PPTX

Feb 2021 Ransomware Recovery presentation.pptxenginsayin1

PPTX

big data eco system fundamentals of data sciencearivukarasi

PDF

Loading Data into Snowflake (Bulk & Stream)Accentfuture

PPTX

Discrete Logarithm Problem in Cryptography (1).pptxmeshablinx38

PDF

Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshshernandezemma379

PDF

Datàaaaaaaaaaengineeeeeeeeeeeeeeeeeeeeeeejuadsr96

DOCX

🧩 1. Solvent R-WPS Office work scientificNohaSalah45

PDF

SaleServicereport and SaleServicereport2251330007

PPTX

Project_Update_Summary.for the use from PMOdysseas Lekatsas

PDF

TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018MELJUN CORTES

PDF

Kafka Use Cases Real-World ApplicationsAccentfuture

PPTX

Data Analytics using sparkabcdefghi.pptxKarkuzhaliS3

PPTX

Generative AI Boost Data Governance and Quality- Tejasvi AddagadaTejasvi Addagada

PDF

Orchestrating Data Workloads With Airflow.pdfssuserae5511

PPTX

Module-2_3-1eentzyssssssssssssssssssssss.pptxShahidHussain66691

RESEARCH-FINAL-GROUP-3, about the final .pptxgwapokoha1

5991-5857_Agilent_MS_Theory_EN (1).pdf. pdfNohaSalah45

Reliability Monitoring of Aircrfat commerceRizk2

Business Automation Solution with Excel 1.1.pdfVivek Kedia

ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMENT AS OF MAY 15 RCT ACCOMPLISHMEN...JoemarAgbayani1

Feb 2021 Ransomware Recovery presentation.pptxenginsayin1

big data eco system fundamentals of data sciencearivukarasi

Loading Data into Snowflake (Bulk & Stream)Accentfuture

Discrete Logarithm Problem in Cryptography (1).pptxmeshablinx38

Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshshernandezemma379

Datàaaaaaaaaaengineeeeeeeeeeeeeeeeeeeeeeejuadsr96

🧩 1. Solvent R-WPS Office work scientificNohaSalah45

SaleServicereport and SaleServicereport2251330007

Project_Update_Summary.for the use from PMOdysseas Lekatsas

TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018MELJUN CORTES

Kafka Use Cases Real-World ApplicationsAccentfuture

Data Analytics using sparkabcdefghi.pptxKarkuzhaliS3

Generative AI Boost Data Governance and Quality- Tejasvi AddagadaTejasvi Addagada

Orchestrating Data Workloads With Airflow.pdfssuserae5511

Module-2_3-1eentzyssssssssssssssssssssss.pptxShahidHussain66691

Introduction to Segmentation in Computer vision

1. Semantic Segmentation

2. Hello! I am Frederick Apina Machine Learning Engineer @ParrotAI I am here because I love to give presentations. 2

3. “When I think about strong innovations in term of automation, cognitive computing, and artificial intelligence, they will be coming a lot from Tanzania as well.” 3

4. 1. What is semantic segmentation?

5. 5

6. 6 Limitations Still a bit rough since we’re only drawing bounding boxes and don’t really get an accurate idea of object shape.

7. 7 What if!?

8. 8 Semantic Segmentation Semantic Segmentation is to label each pixel of an image with a corresponding class of what is being represented. ✗ commonly referred to as dense prediction.

9. 2. Applications of Semantic Segmentation

10. 10 Autonomous Vehicles

11. 11 Medical Surgeries

12. 12 Medical Surgeries

13. 13 Medical Images Diagnostics

14. 3. Representing the Task

15. 15 Our goal is to take either a RGB color image or a grayscale image and output a segmentation map where each pixel contains a class label represented as an integer.

16. 16 We create our target by one-hot encoding the class labels - essentially creating an output channel for each of the possible classes.

17. 17 We can easily inspect a target by overlaying it onto the observation. When we overlay a single channel of our target (or prediction), we refer to this as a mask which illuminates the regions of an image where a specific class is present.

18. 3. Constructing an Architecture

19. A naive approach…

20. 20 ✗ Recall that for deep convolutional networks, earlier layers tend to learn low-level concepts while later layers develop more high-level (and specialized) feature mappings. In order to maintain expressiveness, we typically need to increase the number of feature maps (channels) as we get deeper in the network.

21. 21 Solution?

22. Lucky for us.. One popular approach for image segmentation models is to follow an encoder/decoder structure.

23. U-Net Architecture.. Consists of a contracting path to capture context and a symmetric expa nding path that enables precise localization.

24. Advanced U-Net variants The standard U-Net model consists of a series of convolution operations for each "block" in the architecture. Proposed: swap out the basic stacked convolution blocks in favor of residual blocks. This residual block introduces short skip connections (within the block) alongside the existing long skip connections (between the corresponding feature maps of encoder and decoder modules) found in the standard U-Net structure.

25. Tiramisu: Full Convolution DenseNet Tiramisu adopts the UNet design with downsampling, bottleneck, and upsampling paths and skip connections. It replaces convolution and max pooling layers with Dense blocks from the DenseNet architecture. Dense blocks contain residual connections.

26. Defining loss function The most commonly used loss function for the task of image segmentation is a pixel-wise cross entropy loss. This loss examines each pixel individually, comparing the class predictions (depth-wise pixel vector) to our one-hot encoded target vector.

27. Deep Learning is an continuously-growing and a relatively new concept, the vast amount of resources can be a touch overwhelming for those either looking to get into the field, or those already engraved in it. A good way of cooping is to get a good general knowledge of machine learning and then find a good structured path to follow (be a project or research). 27 Conclusion

28. 28 Thanks! Any questions? You can find me at: ✗ [email protected]