SlideShare a Scribd company logo
Unsupervised Correction of Training Labels
project taken under program
Presented by:
Charmi Chokshi
Final year BTech student of
Information and Communication Technology
Ahmedabad University
Duration: May 31, 2018 till July End
Guided by:
Mr. Pankaj Bodani (Scientist-SE)
Space Application Centre-ISRO
Outline
● Objectives
● Introduction
● Methodology
● Unsupervised Sementic Segmentation
Algorithms
● Discussion
● Region Growing Algorithm
● Conclusion
● Future work
● References
2
Objective
● To work on the preprocessing (Data Cleaning) step of the image
segmentation problem using Deep Learning
● To create accurate Training Labels as the input to Neural Network
● To investigate/compare the use of different unsupervised image
segmentation techniques for boundary correction
3
Introduction
4
What is Semantic Segmentation?
● In computer vision, semantic segmentation is the process of partitioning a
digital image into multiple meaningful segments
● Semantic segmentation is typically used to locate objects and boundaries in
images.
● More precisely, it is the process of assigning a label to every pixel in an image
such that pixels with the same label share certain characteristics
5
Application of Image Segmentation
● Driverless car
● Medical imaging
● Object detection
○ Face detection
○ Pedestrian detection
○ Brake light detection
○ Locate objects in satellite images (roads, forests, crops, water bodies, etc.)
● Recognition Tasks
● Traffic control systems
6
What is Unsupervised Sementic Segmentation?
● Unsupervised = Learning without the help of teacher!
● No Labeled Training Data available for the model to learn from
● The goal of this unsupervised machine learning technique is to find
similarities in the data point and group similar data points together which
will give us insight into underlying patterns of different groups
● “Clustering” is the process of grouping similar entities together
Supervised Segmentation
Input: Raw image, Labelled Image
Output: Segmented Image
Unsupervised Segmentation
Input: Raw image
Output: Segmented Image
7
Methodology
8
Methodology
Methodology consist of following 2 stages:
1. Unsupervised Sementic Segmentation using Clustering algorithms
2. Post-processing using Seed based Region Growing algorithm
Input Data
● Satellite: IRS Resourcesat–2
● Sensor: LISS-IV
● Spatial Resolution: 5.8 m
● No of Bands: 3
● Bit depth: 16 – 10 bit quantization (1024 different tones can be
assigned to a pixel)
● Size: More than 3000 X 3000
● Cities: Vadodara, Jabalpur, Sagar, Satana, Ujjain, Varanasi, Rampur 9
Input Images
Vadodara Varanasi Ujjain
10
Raw and Manually Labelled Images
11
Raw and Manually Labelled Images
12
Methodology (step-1 of 2)
13
DBSCAN
● DBSCAN: Density-based spatial clustering of
applications with noise
● Its clusters are defined as areas of higher density than
the remainder of the data set
● It clusters water bodies accurately in our dataset
● DBSCAN(eps=3.5, min_samples=5, metric=’euclidean’, n_jobs=1)
● eps: The maximum distance between two samples for them to be
considered as in the same neighborhood
● min_samples: The number of samples in a neighborhood for a
point to be considered as a core point
14
K-means
● k-means clustering aims to partition n observations into k clusters in which
each observation belongs to the cluster with the nearest mean, serving as a
prototype of the cluster
● KMeans(n_clusters=3, n_init=10, n_jobs=1)
● n_init: Number of time the k-means algorithm will be run with different centroid seeds
15
K-means Results
16
K=3 K=10
Mean Shift
17
Intuitive Description
18
Intuitive Description
neighboring
pixels are
determined
19
Intuitive Description
new color
mean value
is calculated
20
Intuitive Description
move that
window to
the area of
maximum
pixel
density
21
Intuitive Description
new
center for
the next
iteration
22
Intuitive Description
iterate until
the spatial
and the
color mean
stops
changing
23
Mean Shift
● Mean Shift: Finding modes in a set of data samples, manifesting an underlying
probability density function (PDF) in RN
● It is a procedure for locating the maxima of a density function given discrete
data sampled from that function
● Thus, it is using a non-parametric density gradient estimation
24
Non-Parametric Density Estimation
25
Non-Parametric Density Estimation
26
Non-Parametric Density Estimation
● for each channel, separate colour histograms will be generated
27
Mean Shift Results
Down Sampled by:
0 1 2 3
28
N-Cut (Graph-Cut)
● Given an image or image sequence, set up
a undirected weighted graph G = (V; E)
● The nodes are pixels
● The weight on the edge connecting two
nodes is the measure of the similarity
between the two nodes in terms of colour,
texture etc.
● The objective of normalized partitioning is
to optimize the cut value
29
Watershed
● The watershed transformation treats the
image it operates upon like a topographic
map
● Intuitively, a drop of water falling on a
topographic relief flows towards the
"nearest" minimum
● The "nearest" minimum is that minimum
which lies at the end of the path of steepest
descent
30
Discussion
● DBSCAN: clustered only water bodies
● K-mean: loss of data
● Mean shift: overall good result
● N-cut: poorest result (time complexity is
too high)
● Watershed: not good for this dataset
Reference [1]
31
Methodology (step-2 of 2)
32
Region Growing Algorithm
33
Region Growing
● Group pixels or sub-regions into
larger regions when homogeneity
criterion is satisfied (assumed
criterion is a range of pixel value in all
3 bands)
● Region grows around the seed point
based on similar properties (grey level,
texture, color) (9 seed points have
been assumed)
● It is better in noisy image where edges
are hard to identify
34
Region Growing Video
35
Output of RG on Mean Shift & Raw Image
As of now, step 2 has directly been performed on raw image. But after tweaking the parameters, output
of mean shift can be used
36
Results
Manually
Labelled
Input
Output
Of RG
Vadodara City Jabalpur CitySagar City 37
Results
Manually Labelled Input Output
Sagar City
Jabalpur City
38
Results
Manually Labelled Input Output
Jabalpur City
Jabalpur City
39
Conclusion
● As of now, RGB Threshold of 14 and B&W
pixel voting of 50% applied on RAW
image without preprocessing gives best
result (based on subjective assessment)
● I will now tune parameters for
unsupervised segmentation and try to
arrive at results which are better than
using RAW image
40
Future Work
● Twiking of Hyperparameters such as RGB Threshold value and Voting value
of B&W pixels for better accuracy
● Implementation of Progressive Thresholding technique to improve voting
Labelled
Input Output
Region
Growing
Voting Value
(Fix 50%)
RGB
Threshold
(10,15,20)
Feedback output image as
new labelled image
41
References
[1] Xia, Xide, and Brian Kulis. "W-Net: A Deep Model
for Fully Unsupervised Image Segmentation." arXiv
preprint arXiv:1711.08506 (2017).
[2] Shi, Jianbo, and Jitendra Malik. "Normalized cuts
and image segmentation." IEEE Transactions on
pattern analysis and machine intelligence 22.8
(2000): 888-905.
[3] Zhou, Yong-mei, Sheng-yi Jiang, and Mei-lin Yin.
"A region-based image segmentation method with
mean-shift clustering algorithm." Fuzzy Systems and
Knowledge Discovery, 2008. FSKD'08. Fifth
International Conference on. Vol. 2. IEEE, 2008.
42
Thank you!

More Related Content

PDF
Remotely Sensed Image (RSI) Analysis for feature extraction using Color map I...
DOCX
Ijdmta v1i1
PPTX
Cahall Final Intern Presentation
PDF
C1104011322
PDF
A binarization technique for extraction of devanagari text from camera based ...
PDF
Dk34681688
PDF
G04544346
PDF
A New Method for Indoor-outdoor Image Classification Using Color Correlated T...
Remotely Sensed Image (RSI) Analysis for feature extraction using Color map I...
Ijdmta v1i1
Cahall Final Intern Presentation
C1104011322
A binarization technique for extraction of devanagari text from camera based ...
Dk34681688
G04544346
A New Method for Indoor-outdoor Image Classification Using Color Correlated T...

What's hot (20)

PDF
Sign Language Recognition Using Image Processing For Mute People
PDF
PDE BASED FEATURES FOR TEXTURE ANALYSIS USING WAVELET TRANSFORM
PDF
Cj36511514
PDF
A Novel PSNR-B Approach for Evaluating the Quality of De-blocked Images
PDF
Reduced-reference Video Quality Metric Using Spatial Information in Salient R...
PDF
Jc3515691575
PDF
Research Inventy : International Journal of Engineering and Science
PDF
Av4301248253
PDF
improving differently illuminant images with fuzzy membership based saturatio...
PDF
Automatic dominant region segmentation for natural images
PDF
Wavelet-Based Color Histogram on Content-Based Image Retrieval
PDF
Parking detection system using background subtraction and HSV color segmentation
PPT
Image inpainting
PDF
IRJET- Crowd Density Estimation using Image Processing
PDF
Dissertation synopsis for imagedenoising(noise reduction )using non local me...
PDF
H017416670
PDF
A Review on Image Inpainting to Restore Image
PDF
Sample Paper Techscribe
PDF
Fuzzy Region Merging Using Fuzzy Similarity Measurement on Image Segmentation
PDF
An improved hdr image processing using fast global tone mapping
Sign Language Recognition Using Image Processing For Mute People
PDE BASED FEATURES FOR TEXTURE ANALYSIS USING WAVELET TRANSFORM
Cj36511514
A Novel PSNR-B Approach for Evaluating the Quality of De-blocked Images
Reduced-reference Video Quality Metric Using Spatial Information in Salient R...
Jc3515691575
Research Inventy : International Journal of Engineering and Science
Av4301248253
improving differently illuminant images with fuzzy membership based saturatio...
Automatic dominant region segmentation for natural images
Wavelet-Based Color Histogram on Content-Based Image Retrieval
Parking detection system using background subtraction and HSV color segmentation
Image inpainting
IRJET- Crowd Density Estimation using Image Processing
Dissertation synopsis for imagedenoising(noise reduction )using non local me...
H017416670
A Review on Image Inpainting to Restore Image
Sample Paper Techscribe
Fuzzy Region Merging Using Fuzzy Similarity Measurement on Image Segmentation
An improved hdr image processing using fast global tone mapping
Ad

Similar to Charmi chokshi ppt (20)

PPTX
Semantic Segmentation on Satellite Imagery
PPTX
FastV2C-HandNet - ICICC 2020
PDF
International Journal of Computational Engineering Research(IJCER)
PPTX
Plant Disease Detection.pptx
PDF
Comparative study of two methods for Handwritten Devanagari Numeral Recognition
PDF
IRJET - Change Detection in Satellite Images using Convolutional Neural N...
PDF
An improved hdr image processing using fast global
PDF
IRJET- A Review on Plant Disease Detection using Image Processing
PDF
Detecting Boundaries for Image Segmentation and Object Recognition
PDF
IRJET- Universal Currency Identifier
PDF
A version of watershed algorithm for color image segmentation
PPTX
[NS][Lab_Seminar_240611]Graph R-CNN.pptx
PDF
Comparative Study of Spatial Domain Image Steganography Techniques
PDF
1388586134 10545195
PDF
Analysis and Implementation Image Segmentation Through k-mean Algorithm with ...
PDF
A Robust Image Watermarking Technique using Luminance Based Area Selection an...
PDF
Bangla Optical Digits Recognition using Edge Detection Method
PDF
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
PDF
Adaptive CSLBP compressed image hashing
PDF
A PROJECT REPORT ON REMOVAL OF UNNECESSARY OBJECTS FROM PHOTOS USING MASKING
Semantic Segmentation on Satellite Imagery
FastV2C-HandNet - ICICC 2020
International Journal of Computational Engineering Research(IJCER)
Plant Disease Detection.pptx
Comparative study of two methods for Handwritten Devanagari Numeral Recognition
IRJET - Change Detection in Satellite Images using Convolutional Neural N...
An improved hdr image processing using fast global
IRJET- A Review on Plant Disease Detection using Image Processing
Detecting Boundaries for Image Segmentation and Object Recognition
IRJET- Universal Currency Identifier
A version of watershed algorithm for color image segmentation
[NS][Lab_Seminar_240611]Graph R-CNN.pptx
Comparative Study of Spatial Domain Image Steganography Techniques
1388586134 10545195
Analysis and Implementation Image Segmentation Through k-mean Algorithm with ...
A Robust Image Watermarking Technique using Luminance Based Area Selection an...
Bangla Optical Digits Recognition using Edge Detection Method
A STUDY AND ANALYSIS OF DIFFERENT EDGE DETECTION TECHNIQUES
Adaptive CSLBP compressed image hashing
A PROJECT REPORT ON REMOVAL OF UNNECESSARY OBJECTS FROM PHOTOS USING MASKING
Ad

More from Charmi Chokshi (14)

PPTX
Deep learning with tensorflow
PDF
From ELIZA to Alexa and Beyond
PDF
Report smart dustbin
PDF
Os prj ppt
PDF
magazine
PDF
PPTX
Trade relations US & India; the changing faces
PPTX
Public vs private sector banks in india
PPTX
Market analysis of pencil
PPTX
Indian agriculture sector, Issues Related to Minimum Support Price, Subsidies...
PPTX
Face recognition system using Hidden Markov Model
PPTX
Educational Resource Management System
PPTX
Dictionary implementation using TRIE
PPTX
8 bit mips-processor
Deep learning with tensorflow
From ELIZA to Alexa and Beyond
Report smart dustbin
Os prj ppt
magazine
Trade relations US & India; the changing faces
Public vs private sector banks in india
Market analysis of pencil
Indian agriculture sector, Issues Related to Minimum Support Price, Subsidies...
Face recognition system using Hidden Markov Model
Educational Resource Management System
Dictionary implementation using TRIE
8 bit mips-processor

Recently uploaded (20)

PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PPTX
Current and future trends in Computer Vision.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Geodesy 1.pptx...............................................
PPTX
Sustainable Sites - Green Building Construction
DOCX
573137875-Attendance-Management-System-original
PDF
737-MAX_SRG.pdf student reference guides
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
Current and future trends in Computer Vision.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Geodesy 1.pptx...............................................
Sustainable Sites - Green Building Construction
573137875-Attendance-Management-System-original
737-MAX_SRG.pdf student reference guides
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Fundamentals of safety and accident prevention -final (1).pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
R24 SURVEYING LAB MANUAL for civil enggi
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems

Charmi chokshi ppt

  • 1. Unsupervised Correction of Training Labels project taken under program Presented by: Charmi Chokshi Final year BTech student of Information and Communication Technology Ahmedabad University Duration: May 31, 2018 till July End Guided by: Mr. Pankaj Bodani (Scientist-SE) Space Application Centre-ISRO
  • 2. Outline ● Objectives ● Introduction ● Methodology ● Unsupervised Sementic Segmentation Algorithms ● Discussion ● Region Growing Algorithm ● Conclusion ● Future work ● References 2
  • 3. Objective ● To work on the preprocessing (Data Cleaning) step of the image segmentation problem using Deep Learning ● To create accurate Training Labels as the input to Neural Network ● To investigate/compare the use of different unsupervised image segmentation techniques for boundary correction 3
  • 5. What is Semantic Segmentation? ● In computer vision, semantic segmentation is the process of partitioning a digital image into multiple meaningful segments ● Semantic segmentation is typically used to locate objects and boundaries in images. ● More precisely, it is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics 5
  • 6. Application of Image Segmentation ● Driverless car ● Medical imaging ● Object detection ○ Face detection ○ Pedestrian detection ○ Brake light detection ○ Locate objects in satellite images (roads, forests, crops, water bodies, etc.) ● Recognition Tasks ● Traffic control systems 6
  • 7. What is Unsupervised Sementic Segmentation? ● Unsupervised = Learning without the help of teacher! ● No Labeled Training Data available for the model to learn from ● The goal of this unsupervised machine learning technique is to find similarities in the data point and group similar data points together which will give us insight into underlying patterns of different groups ● “Clustering” is the process of grouping similar entities together Supervised Segmentation Input: Raw image, Labelled Image Output: Segmented Image Unsupervised Segmentation Input: Raw image Output: Segmented Image 7
  • 9. Methodology Methodology consist of following 2 stages: 1. Unsupervised Sementic Segmentation using Clustering algorithms 2. Post-processing using Seed based Region Growing algorithm Input Data ● Satellite: IRS Resourcesat–2 ● Sensor: LISS-IV ● Spatial Resolution: 5.8 m ● No of Bands: 3 ● Bit depth: 16 – 10 bit quantization (1024 different tones can be assigned to a pixel) ● Size: More than 3000 X 3000 ● Cities: Vadodara, Jabalpur, Sagar, Satana, Ujjain, Varanasi, Rampur 9
  • 11. Raw and Manually Labelled Images 11
  • 12. Raw and Manually Labelled Images 12
  • 14. DBSCAN ● DBSCAN: Density-based spatial clustering of applications with noise ● Its clusters are defined as areas of higher density than the remainder of the data set ● It clusters water bodies accurately in our dataset ● DBSCAN(eps=3.5, min_samples=5, metric=’euclidean’, n_jobs=1) ● eps: The maximum distance between two samples for them to be considered as in the same neighborhood ● min_samples: The number of samples in a neighborhood for a point to be considered as a core point 14
  • 15. K-means ● k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster ● KMeans(n_clusters=3, n_init=10, n_jobs=1) ● n_init: Number of time the k-means algorithm will be run with different centroid seeds 15
  • 20. Intuitive Description new color mean value is calculated 20
  • 21. Intuitive Description move that window to the area of maximum pixel density 21
  • 23. Intuitive Description iterate until the spatial and the color mean stops changing 23
  • 24. Mean Shift ● Mean Shift: Finding modes in a set of data samples, manifesting an underlying probability density function (PDF) in RN ● It is a procedure for locating the maxima of a density function given discrete data sampled from that function ● Thus, it is using a non-parametric density gradient estimation 24
  • 27. Non-Parametric Density Estimation ● for each channel, separate colour histograms will be generated 27
  • 28. Mean Shift Results Down Sampled by: 0 1 2 3 28
  • 29. N-Cut (Graph-Cut) ● Given an image or image sequence, set up a undirected weighted graph G = (V; E) ● The nodes are pixels ● The weight on the edge connecting two nodes is the measure of the similarity between the two nodes in terms of colour, texture etc. ● The objective of normalized partitioning is to optimize the cut value 29
  • 30. Watershed ● The watershed transformation treats the image it operates upon like a topographic map ● Intuitively, a drop of water falling on a topographic relief flows towards the "nearest" minimum ● The "nearest" minimum is that minimum which lies at the end of the path of steepest descent 30
  • 31. Discussion ● DBSCAN: clustered only water bodies ● K-mean: loss of data ● Mean shift: overall good result ● N-cut: poorest result (time complexity is too high) ● Watershed: not good for this dataset Reference [1] 31
  • 34. Region Growing ● Group pixels or sub-regions into larger regions when homogeneity criterion is satisfied (assumed criterion is a range of pixel value in all 3 bands) ● Region grows around the seed point based on similar properties (grey level, texture, color) (9 seed points have been assumed) ● It is better in noisy image where edges are hard to identify 34
  • 36. Output of RG on Mean Shift & Raw Image As of now, step 2 has directly been performed on raw image. But after tweaking the parameters, output of mean shift can be used 36
  • 38. Results Manually Labelled Input Output Sagar City Jabalpur City 38
  • 39. Results Manually Labelled Input Output Jabalpur City Jabalpur City 39
  • 40. Conclusion ● As of now, RGB Threshold of 14 and B&W pixel voting of 50% applied on RAW image without preprocessing gives best result (based on subjective assessment) ● I will now tune parameters for unsupervised segmentation and try to arrive at results which are better than using RAW image 40
  • 41. Future Work ● Twiking of Hyperparameters such as RGB Threshold value and Voting value of B&W pixels for better accuracy ● Implementation of Progressive Thresholding technique to improve voting Labelled Input Output Region Growing Voting Value (Fix 50%) RGB Threshold (10,15,20) Feedback output image as new labelled image 41
  • 42. References [1] Xia, Xide, and Brian Kulis. "W-Net: A Deep Model for Fully Unsupervised Image Segmentation." arXiv preprint arXiv:1711.08506 (2017). [2] Shi, Jianbo, and Jitendra Malik. "Normalized cuts and image segmentation." IEEE Transactions on pattern analysis and machine intelligence 22.8 (2000): 888-905. [3] Zhou, Yong-mei, Sheng-yi Jiang, and Mei-lin Yin. "A region-based image segmentation method with mean-shift clustering algorithm." Fuzzy Systems and Knowledge Discovery, 2008. FSKD'08. Fifth International Conference on. Vol. 2. IEEE, 2008. 42