Semantic segmentation with Convolutional Neural Network Approaches

Presenter : Aydin Ayanzadeh
Email: Ayanzadeh17@itu.edu.tr
Computer vision-Dr.-Ing. Hazım Kemal EKENEL, Spring 2018
Semantic segmentation with Convolutional
Neural Network
1

Agenda
- INTRODUCTION - WBS
-PROJECT DESCRIPTION -Gannett chart
-Related work
-Fine-tuning the model
-Experiment Results
-Demo of work
2

Introduction
● Microsoft Common Objects in Context (COCO)
○ 82783 images for training, 40504 for
validation
● VOC 2012,PASCAL-Context, PASCAL,CitySpace
and etc.
○ Instance semantic segmentation,
○ semantic segmentation
[1] http://cocodataset.org/#home 3

Scene Segmentation
4
Importance of Semantic Segmentation
● Autonomous driving
● Medical imaging

Mask-RCNN
● State of the art multi task model for
visual scene understanding:
● object detection
● classification
● instance segmentation
● Highly modular and easy to train
● Extending Faster R-CNN for Pixel
Level Segmentation
● Based on Faster R-CNN + mask
branch, RoIAlign
5

Mask-RCNN
6
● Proposal evaluation based on
Intersection over Union with ground
truth boxes:
● best regions are kept as positive
examples.
● worst (IoU < 0.3) are known as
negatives for training
https://www.pyimagesearch.com/2016/11/07/intersection-over-union-iou-for-object-detection/

Requirement for training ● 8-GPU (so 8 on 8 GPUs)
● train the model for 24k iterations,
● It takes about 2 days of training on a
single 8-GPU (16 days with one GPU!!!)
● Nvidia Tesla M40 GPU (without
additional feature)
● it can be run 3fps on test time
8

Fine-Tuning the model
● truncate the last layer (softmax layer) of the
pre-trained network
● replace it with our new softmax layer that are
relevant to our own problem.
pros and cons of tuned model
Disadvantage: Segmentation accuracy is
less than original model.
Advantage: It is very faster!!
9
https://www.slideshare.net/AndrKarpitenko/practical-deep-learning

Dataset Analyzing
● Mask R-CNN does detection,
classification and instance
segmentation.
● Based on Faster R-CNN + mask
branch, RoIAlign
● State of the art detection and
instance segmentation on MS COCO
and Cityscapes
10

DeepLab
● DeepLab v1
● DeepLab v2
● DeepLab v3
13

Convolution
14
Dilated Convolution
https://towardsdatascience.com/types-of-convolutions-in-deep-learning-717013397f4d

Atrous Convolution
15
● Small field of view cause accurate localization
● Large field of view cause to context assimilation

Experimental Results
19
● Complicate image

Evaluation Metric
● Pixel Accuracy (PA)
● Mean Pixel Accuracy (MPA)
● Mean Intersection over Union (MIoU)
● Frequency Weighted Intersection over Union
(FWIoU)
21

Future work
● Real-time Segmentation
● Face Segmentation
Discussion
● Complicate image
● Quality of image
● Dataset Size
24

Project
Project ImplementationDatasets
Collecting the
datasets
Research among
State-of-Arts
Extend steps of
project
Finding nominal
methods
Building the
proposed approach
Analyzing the
performance of
approaches
Organizing and
categorizing datasets
Organizing and
categorizing datasets
WBS
Preliminary steps of
project
Implement additional
techniques
Milestone of
Extended step of
project

Reference
1. G. Eason, B. Noble, and I.N. Sneddon, “On certain integrals of Lipschitz-Hankel type involving products of Bessel functions,” Phil. Trans. Roy. Soc. London, vol. A247, pp. 529-551, April 1955.
(references)
2. Long, Jonathan, Evan Shelhamer, and Trevor Darrell. "Fully convolutional networks for semantic segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition.
2015.
3. Badrinarayanan, Vijay, Alex Kendall, and Roberto Cipolla. "Segnet: A deep convolutional encoder-decoder architecture for image segmentation." IEEE transactions on pattern analysis and machine
intelligence 39.12 (2017): 2481-2495.
4. Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical image computing and computer-
assisted intervention. Springer, Cham, 2015.
5. Jégou, Simon, et al. "The one hundred layers tiramisu: Fully convolutional densenets for semantic segmentation." Computer Vision and Pattern Recognition Workshops (CVPRW), 2017 IEEE
Conference on. IEEE, 2017.
6. Paszke, Adam, et al. "Enet: A deep neural network architecture for real-time semantic segmentation." arXiv preprint arXiv:1606.02147 (2016).
7. Chaurasia, Abhishek, and Eugenio Culurciello. "LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation." arXiv preprint arXiv:1707.03718 (2017).
8. He, Kaiming, et al. "Mask r-cnn." Computer Vision (ICCV), 2017 IEEE International Conference on. IEEE, 2017.
9. Zhao, Hengshuang, et al. "Pyramid scene parsing network." IEEE Conf. on Computer Vision and Pattern Recognition (CVPR). 2017.
10. Lin, Guosheng, et al. "Refinenet: Multi-path refinement networks for high-resolution semantic segmentation." IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017
11. Islam, Md Amirul, et al. "Gated feedback refinement network for dense image labeling." 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2017.
12. Hong, Seunghoon, Hyeonwoo Noh, and Bohyung Han. "Decoupled deep neural network for semi-supervised semantic segmentation." Advances in neural information processing systems. 2015.
13. Souly, Nasim, Concetto Spampinato, and Mubarak Shah. "Semi and weakly supervised semantic segmentation using generative adversarial network." arXiv preprint arXiv:1703.09695 (2017).
27

Semantic segmentation with Convolutional Neural Network Approaches

More Related Content

What's hot (20)

Similar to Semantic segmentation with Convolutional Neural Network Approaches (20)

More from UMBC (20)

Recently uploaded (20)

Semantic segmentation with Convolutional Neural Network Approaches

Editor's Notes