SlideShare a Scribd company logo
SCENE TEXT RECOGNITION IN MOBILE 
APPLICATION BY CHARACTER 
DESCRIPTOR AND STRUCTURE 
COMFIGURATION 
CHERIYAN K M
INTRODUCING….. 
 Valuable information form an image. 
 To extract an information. 
 Automatic and Effective scene text detection. 
 Recognition algorithm. 
 Factors affecting on extraction. 
 Cluttered background. 
 Difference in text pattern. 
 Difficult to model the structure of character. 
 Lake of discriminative pixel level appearance. 
 Structure features from non-text background outliers. 
 Different word , may diff. characters , in various fonts , 
styles and size.
 Two activities; 
 Text detection. 
 Localize the image region containing the text characters. 
 Based on 
 Color uniformity and 
 Horizontal alignment of text char. 
 Text recognition. 
 Transform pixel-based text into reliable codes. 
 Distinguish diff. text characters , Properly compose the text 
word. 
 62 identity category of text characters. 
9 (0-9) 
26 (a-z) 
26 (A-Z) 
 Two schemes; 
 Character recognizer to predict the category of text char. 
 Binary character classifier to predict the existence of ctgry.
RELATED WORKS 
 Optical Character Recognizer (OCR) system. 
 Many algorithms are proposed; 
 Weinmen:- combined the Gabor-based appearance 
model. 
 Neumann:- based on extremal region. 
 Smith:- based on SIFT. 
 Mishra:- adopted conditional random field. 
 Lu:- modeled the inner character structure. 
 Coates:- extracted local features of character patches.
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE CONFIGURATION
LAYOUT BASED SCENE TEXT DETECTION 
 A text; 
 Instruction 
 Identifier 
 Uniform color 
 Aligned arrangement 
Two processes are employed to complete layout 
analysis 
1. Color Decomposition 
2. Horizontal Alignment 
Improved to compatible with mobile app
LAYOUT ANALYSIS OF COLOR 
DECOMPOSITION 
 Boundary clustering algorithm base on bigram color 
uniformity. 
 Group pixels of same color into a layer. 
 Character boundary boarder b/w txt and bg.(color 
pair) 
 Create a vector of color pair (txt and bg).
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE CONFIGURATION
LAYOUT ANALYSIS OF HORIZONTAL 
ALIGNMENT 
Text information(string) 
Several 
character 
members 
In similar 
size 
Approximately 
horizontal 
alignment 
The geometrical properties to detect the existence of text characters
 Adjacent character grouping algorithm 
Bounding box>siblings>similar size & vertical location>merge
 For non-horizontal strings-> ± /6 degree set as 
range.
STRUCTURE BASED SCENE TEXT 
RECOGNITION 
 To extract text information. 
 Binary classification problem. 
 Character classes(Queried characters). 
 Binary classifier:- to distinguish character class 
from other classes or bg outliers. 
 Eg: Character class A predict patches containing A as 
positive. And other as negative. 
 Two activities; 
1. Character descriptor. 
2. Stroke configuration.
CHARACTER DESCRIPTOR 
 Extract structure features. 
 4 different key points features; 
1. Harris Detector:- To extract Key points from corner and 
junction. 
2. MSER Detector:-To extract Key point from stroke 
component. 
3. Dense Detector:- To extract Key point uniformly. 
4. Random Detector:- To extract the preset number of 
Key points in a random pattern.
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE CONFIGURATION
Flowchart of our proposed character descriptor 
HOG:-features are Calculated as observed feature vector x. 
(Histogram of Oriented Gradient) 
•Selected as local feature descriptor( compatibility 
with all 4 key point detectors).
 SIFT and SURF are not employed 
 Normalization of character patches(128x128). 
 Feature Quantization: to aggregate the extracted 
features 
 Bag-of-Words(BOW) Medel:- Applied to key points from 
all 4 feature detector. 
 Gaussian Mixture Model(GMM):Applied to key points 
from DD & RD.(fixed number and location of key point) 
 Now mapping both into characteristic Histogram as 
feature representation. 
 Cascading BOW and GMM-based feature repr. ,we 
get Character Descriptor.
CHARACTER STROKE CONFIGURATION 
 Stroke:- Region bounded by two parallel boundary 
segments. 
 Stroke width 
 Stroke orientation 
 Characters are connected strokes with 
configuration. 
 Structure Map of Strokes is stroke configuration.( 
is consistant) 
Eg: B have 1 vertical stroke 
2 arc strokes. 
B
 Synthesized character generator: Estimate stroke 
configuration from computer s/w.(Provide accurate 
skeleton and boundary) 
 Synthetic font training dataset(20000 are selected 
out off 67400 character patches) 
 Contain 62 class of characters(128x128 pixel) 
 Compose Stroke Configuration 
Step1 
 Discrete Contour Evaluation(DCE):obtain boundary and 
skeleton. Skeleton pruning on the basis of DCE. 
 DCE simplifies the character(using polygon and small 
no. of vertices) 
 DCE and Skeleton pruning are invariant to deformation 
and scaling.
Step2 
 Estimate stroke width and orientation 
 Width: length along normal 
 Orientation: tangent 
 Sampling from character boundary 
 128 samples. 
 So that no. of samples = length 
 Estimating 
 Taking two neighboring sample point to fit a line. 
 Approximately collinear. 
 A quadratic curve.
Step3 
 Calculate Skeleton-based stroke map 
 Consistency of stroke width and orientation. 
Width no larger than 3 Orientation no larger than /8 
 Construct stroke section: If sample point satisfying 
the stroke related features. 
 Construct junction sections: If they are not. 
 Skeleton points are extracted.
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE CONFIGURATION
STROKE ALIGNMENT METHOD 
 To handle various fonts, styles …..etc 
 Mean value of all stroke configuration. 
 Mean value,
 D=Distance b/w stroke configurations of two 
samples 
 S=Mean value of stroke configurations. 
 Ti=Transformations applied on strokes of i-th stroke 
configuration. 
 g(Ti)=Amplitude of the transformation.
DEMO SYSTEM
THANK YOU

More Related Content

PPTX
Text detection and recognition from natural scenes
PPTX
Detecting text from natural images with Stroke Width Transform
PDF
Text Detection Strategies
PPTX
Text Detection From Image
PPTX
Text extraction from images
PPTX
Image to text Converter
PPTX
Text extraction from natural scene image, a survey
PDF
Optical Character Recognition
Text detection and recognition from natural scenes
Detecting text from natural images with Stroke Width Transform
Text Detection Strategies
Text Detection From Image
Text extraction from images
Image to text Converter
Text extraction from natural scene image, a survey
Optical Character Recognition

What's hot (20)

PDF
C04741319
PPTX
Text Detection and Recognition
PDF
Text Extraction from Image using Python
PDF
IRJET- Devnagari Text Detection
PDF
Scene text recognition in mobile applications by character descriptor and str...
PDF
Self-Directing Text Detection and Removal from Images with Smoothing
PPTX
Texture features based text extraction from images using DWT and K-means clus...
PDF
Text Extraction System by Eliminating Non-Text Regions
PDF
Optical Character Recognition
PDF
Ay32333339
PDF
International Journal of Engineering Research and Development
PDF
A NOVEL FEATURE SET FOR RECOGNITION OF SIMILAR SHAPED HANDWRITTEN HINDI CHARA...
PPTX
Handwritten and Machine Printed Text Separation in Document Images using the ...
PDF
K2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
PDF
F045053236
PDF
A Survey Paper on Character Recognition
PDF
IRJET- Object Detection using Hausdorff Distance
PDF
SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT
PDF
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
PDF
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
C04741319
Text Detection and Recognition
Text Extraction from Image using Python
IRJET- Devnagari Text Detection
Scene text recognition in mobile applications by character descriptor and str...
Self-Directing Text Detection and Removal from Images with Smoothing
Texture features based text extraction from images using DWT and K-means clus...
Text Extraction System by Eliminating Non-Text Regions
Optical Character Recognition
Ay32333339
International Journal of Engineering Research and Development
A NOVEL FEATURE SET FOR RECOGNITION OF SIMILAR SHAPED HANDWRITTEN HINDI CHARA...
Handwritten and Machine Printed Text Separation in Document Images using the ...
K2 Algorithm-based Text Detection with An Adaptive Classifier Threshold
F045053236
A Survey Paper on Character Recognition
IRJET- Object Detection using Hausdorff Distance
SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
A MULTI-STREAM HMM APPROACH TO OFFLINE HANDWRITTEN ARABIC WORD RECOGNITION
Ad

Viewers also liked (17)

PPTX
Optical Character Recognition( OCR )
PDF
Mobile to server face recognition. Skripsi 1.
PPSX
PPTX
Presen_Segmentation
PDF
Introduction to graphs and their ability to represent images
PPTX
[RakutenTechConf2013] [C4-1] Text detection in product images
ODP
Face recognition application
PDF
राजभाषा हिंदी-सूचना और प्रौद्योगि‍की विषय पर हिंदी कार्यशाला
PDF
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
PPT
Devanagari Character Recognition
PPT
character recognition: Scope and challenges
PPTX
Facial recognition locker for android
ODP
Emotion detection from text using data mining and text mining
PPTX
Text extraction From Digital image
PPTX
Optical Character Recognition (OCR)
DOCX
Android Face Recognition App Locker
PPT
Face recognition ppt
Optical Character Recognition( OCR )
Mobile to server face recognition. Skripsi 1.
Presen_Segmentation
Introduction to graphs and their ability to represent images
[RakutenTechConf2013] [C4-1] Text detection in product images
Face recognition application
राजभाषा हिंदी-सूचना और प्रौद्योगि‍की विषय पर हिंदी कार्यशाला
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
Devanagari Character Recognition
character recognition: Scope and challenges
Facial recognition locker for android
Emotion detection from text using data mining and text mining
Text extraction From Digital image
Optical Character Recognition (OCR)
Android Face Recognition App Locker
Face recognition ppt
Ad

Similar to SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE CONFIGURATION (20)

PPTX
Pattern_Recognition_via_Character_Recogn.pptx
PPTX
Image feature extraction
PDF
A binary graphics recognition algorithm based on fitting function
PDF
Performance analysis of chain code descriptor for hand shape classification
DOCX
Autocad commands
PDF
SEGMENTATION AND RECOGNITION OF HANDWRITTEN DIGIT NUMERAL STRING USING A MULT...
PDF
Segmentation and recognition of handwritten digit numeral string using a mult...
PDF
SEGMENTATION AND RECOGNITION OF HANDWRITTEN DIGIT NUMERAL STRING USING A MULT...
PPTX
Representation and recognition of handwirten digits using deformable templates
PPTX
Attributes of output Primitive
PDF
Chap_9_Representation_and_Description.pdf
PDF
Chap_9_Representation_and_Description.pdf
PDF
Paper id 24201433
PPTX
Digital Image Processing, Computer Science
PPTX
Introduction to image processing and pattern recognition
PPT
Signature recognition using clustering techniques dissertati
PDF
Optical character recognition performance analysis of sif and ldf based ocr
PDF
Artificial Neural Network For Recognition Of Handwritten Devanagari Character
PDF
L017116064
PPTX
LSDI 2.pptx
Pattern_Recognition_via_Character_Recogn.pptx
Image feature extraction
A binary graphics recognition algorithm based on fitting function
Performance analysis of chain code descriptor for hand shape classification
Autocad commands
SEGMENTATION AND RECOGNITION OF HANDWRITTEN DIGIT NUMERAL STRING USING A MULT...
Segmentation and recognition of handwritten digit numeral string using a mult...
SEGMENTATION AND RECOGNITION OF HANDWRITTEN DIGIT NUMERAL STRING USING A MULT...
Representation and recognition of handwirten digits using deformable templates
Attributes of output Primitive
Chap_9_Representation_and_Description.pdf
Chap_9_Representation_and_Description.pdf
Paper id 24201433
Digital Image Processing, Computer Science
Introduction to image processing and pattern recognition
Signature recognition using clustering techniques dissertati
Optical character recognition performance analysis of sif and ldf based ocr
Artificial Neural Network For Recognition Of Handwritten Devanagari Character
L017116064
LSDI 2.pptx

Recently uploaded (20)

PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Welding lecture in detail for understanding
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPT
Project quality management in manufacturing
PDF
PPT on Performance Review to get promotions
PPTX
web development for engineering and engineering
PPTX
CH1 Production IntroductoryConcepts.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
OOP with Java - Java Introduction (Basics)
Welding lecture in detail for understanding
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Operating System & Kernel Study Guide-1 - converted.pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
R24 SURVEYING LAB MANUAL for civil enggi
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Project quality management in manufacturing
PPT on Performance Review to get promotions
web development for engineering and engineering
CH1 Production IntroductoryConcepts.pptx
573137875-Attendance-Management-System-original
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
CYBER-CRIMES AND SECURITY A guide to understanding
Foundation to blockchain - A guide to Blockchain Tech
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS

SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE CONFIGURATION

  • 1. SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRUCTURE COMFIGURATION CHERIYAN K M
  • 2. INTRODUCING…..  Valuable information form an image.  To extract an information.  Automatic and Effective scene text detection.  Recognition algorithm.  Factors affecting on extraction.  Cluttered background.  Difference in text pattern.  Difficult to model the structure of character.  Lake of discriminative pixel level appearance.  Structure features from non-text background outliers.  Different word , may diff. characters , in various fonts , styles and size.
  • 3.  Two activities;  Text detection.  Localize the image region containing the text characters.  Based on  Color uniformity and  Horizontal alignment of text char.  Text recognition.  Transform pixel-based text into reliable codes.  Distinguish diff. text characters , Properly compose the text word.  62 identity category of text characters. 9 (0-9) 26 (a-z) 26 (A-Z)  Two schemes;  Character recognizer to predict the category of text char.  Binary character classifier to predict the existence of ctgry.
  • 4. RELATED WORKS  Optical Character Recognizer (OCR) system.  Many algorithms are proposed;  Weinmen:- combined the Gabor-based appearance model.  Neumann:- based on extremal region.  Smith:- based on SIFT.  Mishra:- adopted conditional random field.  Lu:- modeled the inner character structure.  Coates:- extracted local features of character patches.
  • 6. LAYOUT BASED SCENE TEXT DETECTION  A text;  Instruction  Identifier  Uniform color  Aligned arrangement Two processes are employed to complete layout analysis 1. Color Decomposition 2. Horizontal Alignment Improved to compatible with mobile app
  • 7. LAYOUT ANALYSIS OF COLOR DECOMPOSITION  Boundary clustering algorithm base on bigram color uniformity.  Group pixels of same color into a layer.  Character boundary boarder b/w txt and bg.(color pair)  Create a vector of color pair (txt and bg).
  • 9. LAYOUT ANALYSIS OF HORIZONTAL ALIGNMENT Text information(string) Several character members In similar size Approximately horizontal alignment The geometrical properties to detect the existence of text characters
  • 10.  Adjacent character grouping algorithm Bounding box>siblings>similar size & vertical location>merge
  • 11.  For non-horizontal strings-> ± /6 degree set as range.
  • 12. STRUCTURE BASED SCENE TEXT RECOGNITION  To extract text information.  Binary classification problem.  Character classes(Queried characters).  Binary classifier:- to distinguish character class from other classes or bg outliers.  Eg: Character class A predict patches containing A as positive. And other as negative.  Two activities; 1. Character descriptor. 2. Stroke configuration.
  • 13. CHARACTER DESCRIPTOR  Extract structure features.  4 different key points features; 1. Harris Detector:- To extract Key points from corner and junction. 2. MSER Detector:-To extract Key point from stroke component. 3. Dense Detector:- To extract Key point uniformly. 4. Random Detector:- To extract the preset number of Key points in a random pattern.
  • 15. Flowchart of our proposed character descriptor HOG:-features are Calculated as observed feature vector x. (Histogram of Oriented Gradient) •Selected as local feature descriptor( compatibility with all 4 key point detectors).
  • 16.  SIFT and SURF are not employed  Normalization of character patches(128x128).  Feature Quantization: to aggregate the extracted features  Bag-of-Words(BOW) Medel:- Applied to key points from all 4 feature detector.  Gaussian Mixture Model(GMM):Applied to key points from DD & RD.(fixed number and location of key point)  Now mapping both into characteristic Histogram as feature representation.  Cascading BOW and GMM-based feature repr. ,we get Character Descriptor.
  • 17. CHARACTER STROKE CONFIGURATION  Stroke:- Region bounded by two parallel boundary segments.  Stroke width  Stroke orientation  Characters are connected strokes with configuration.  Structure Map of Strokes is stroke configuration.( is consistant) Eg: B have 1 vertical stroke 2 arc strokes. B
  • 18.  Synthesized character generator: Estimate stroke configuration from computer s/w.(Provide accurate skeleton and boundary)  Synthetic font training dataset(20000 are selected out off 67400 character patches)  Contain 62 class of characters(128x128 pixel)  Compose Stroke Configuration Step1  Discrete Contour Evaluation(DCE):obtain boundary and skeleton. Skeleton pruning on the basis of DCE.  DCE simplifies the character(using polygon and small no. of vertices)  DCE and Skeleton pruning are invariant to deformation and scaling.
  • 19. Step2  Estimate stroke width and orientation  Width: length along normal  Orientation: tangent  Sampling from character boundary  128 samples.  So that no. of samples = length  Estimating  Taking two neighboring sample point to fit a line.  Approximately collinear.  A quadratic curve.
  • 20. Step3  Calculate Skeleton-based stroke map  Consistency of stroke width and orientation. Width no larger than 3 Orientation no larger than /8  Construct stroke section: If sample point satisfying the stroke related features.  Construct junction sections: If they are not.  Skeleton points are extracted.
  • 22. STROKE ALIGNMENT METHOD  To handle various fonts, styles …..etc  Mean value of all stroke configuration.  Mean value,
  • 23.  D=Distance b/w stroke configurations of two samples  S=Mean value of stroke configurations.  Ti=Transformations applied on strokes of i-th stroke configuration.  g(Ti)=Amplitude of the transformation.