SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 356
Handwritten Decimal Image Compression using Deep Stacked
Autoencoder
Swati Pachare1, Shubhada Thakare2
1P.G. Scholar, Dept. of Electronics Engineering, GCOE, Amravati
2Assistant Professor, Dept. of Electronics Engineering, GCOE, Amravati
----------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Compression of image is a technique that is used
to identify internal dataredundancyandtosubsequentlycome
up with a compact representation. Compression of image has
been a necessary and effective topic of research in the image
processing domain. Earlier used algorithms for image
compression, like JPEG and JPEG2000, depend on the
encoder/decoder (codec) block diagram. They use the fixed
transform matrixes, i.e wavelet transform and DiscreteCosine
Transform (DCT), together with quantization and entropy
coder to compress the image together with quantization and
entropy coder to compress the image. A compression of image
is important in the applications image processing like storing
of data, classification of image, recognition of image etc.
However, the image compression with auto-encoder has been
found for a small number of the improvements. Therefore, the
proposed work is to study and demonstrate the image
compression algorithm using the deep neural network(DNN).
Deep learning has much potentialtoenhancetheperformance
in various computer vision tasks.
Key Words: Image compression, image processing, auto-
encoder, deep neural network
1. INTRODUCTION
Image compression is the art and science art diminishing
the data required in the representation of an image. It is
most helpful and commercially successful techniques in the
area of the digital image processing. Compression enables
image transmission at very low bandwidth and decreases
space needed for the storage of the data. Image compression
has turn out to be necessary owing to increased demand for
information transfer and storage.
J. Jiang presents a review on image compression with neural
network shows image compression existing technologylike,
MPEG, H.26X and JPEG standards are being developed with
assisting with neural network to provide improvementover
traditional algorithms [1].
Chao Dong et.al shows lossy compression methodslikeJPEG
introduces ringing effects, blocking artifacts. In wavelet
transform thersholding and DCT transform in shape-
adaptive are used to remove ringing and blocking artifacts
but gives the blurred output. To eliminate the undesired
artifacts Artifacts Reduction-Convolutional Neural Network
(AR-CNN) is used showsimpressiveresultsitsuppressed the
blocking artifacts while retains the edge patterns and sharp
details [2]. G. E. Hinton, R. R. Salakhutdinov proposed
dimensionality reduction method of data using the
Restricted Boltzmann Machine (RBM) which outperforms
over the Principal Component Analysis (PCA) method [3].
Adna Sento proposed the method of image compression
method usingauto-encoder algorithmusesextendedKalman
filter algorithm as learning algorithm which updates the
weights in the network [4].
Hongda Shen [5] proposedthelosslesscompressionmethod
of curated erythrocyte images using stacked auto-encoder
and their variants. The dimensionality reduction of the
images preserves all discriminative features of the images.
This compression gives good compression performance as
compared to JP2K-LM, JPEG-LSandCALIC.J.Almotiriet.al [6]
presents the comparison between auto-encoder and
Principal Component Analysis (PCA) inthe compressionand
classification of the data. The auto-encoder gives 98.1%
accuracy while PCA gives 97.2% accuracy. Robert Torfason
et al. proposed two distinct computer vision tasks from
compressed image representations which are image
classification andsemanticsegmentationareconsider.When
combining classification and image compression training,
observe an increase in MS-SSIM and SSIM and at the same
time, a better segmentation and classification precision [7].
A.B. Said et al. [8] presents the combined compression and
classification of EMG and EEG signals by means of deep
learning approach. Deep architecture extracts the features
and also reconstructs the data using greedy layer wise
training. This experiment is conduct on DEAP dataset. It
consists of EEG, EMG and multiple physiological signals
recorded from 32 participants. For both EEG and EMG data
they have 23,040 samples of dimensionality 896. While
comparison with DWT andcompresssensingshowsthat this
approach performs better with high compression ratio. J.
Papitha, G. Merlin Nancy, D. Nedumaran proposed eight
compression algorithms were compareon200andmore MR
images for the evaluation of quality and also the
performance of it [9]. Gaurav Kumar and Pradeep Kumar
Bhatia [10] expound the brief comparison between wavelet,
Discrete Cosine Transform and Neural network methods
used for image compression. The comparison is based on
performance parameters such as PSNR, retained energyand
output image size. From this comparison wavelet based
image compression shows the optimumresultsascompared
with ANN and DCT. W. K. Yeo et al., proposed the method to
compress the MRI images using the feed forward neural
network and for training back-propagation algorithm is
used. The lossless algorithms like JPEG, JPEG 2000
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 357
outperforms well in terms of image qualityandcompression
ratio as compared to feed forwardneural network (FNN)but
the FNN in the field of medical [11].
2. PROPOSED SYSTEM
This section includes the required compression and
decompression of the handwritten decimal numeralsimages
from the dataset. For this compression and reconstructionof
the images stacked auto-encoder is used. The auto-encoder
encodes the image into its compressed representation and
from this compressed representation reconstruction of the
images is carried out.
Fig- 1: Block Diagram
1. Modified National Institute of Standard and Technology
(MNIST) is the database used consisting of collection of
handwritten digit images. It consists of 60,000 training
images and 10,000 testing images [12].
2. Image compression with stacked auto-encoder:
Auto-encoders [13] can be stackedtoformadeepnetworkby
feeding the internalrepresentation(outputcode)oftheAuto-
encoder at the layer below as input to the considered layer.
The unsupervised pre-trainingofthearchitectureisdoneone
layer at a time. Suppose we consider k layers, once the first k
layers are trained, it is possible to train the (k + 1)th layer
using the internal representation of the kth layer. The
advantage of this architecture is that by using more hidden
layers than a single auto-encoder, a high-dimensional input
data can be reduced to a much smaller code representing the
important features.
Fig-2: Stacked Autoencoder
Training of Auto-encoder
Step 1: Start
Step 2: Create and configure the auto-encoder network
Step 3: Initialize the weights and biases
Step 4: Train the auto-encoder network with Scaled
Conjugate Gradient (SCG) method
Step 5: Output of hidden layer of 1st auto-encoder network
is given as input to the 2nd auto-encoder
Step 6: Repeat step 4
Step 7: Again the output of hidden layer of 2nd auto-encoder
network is given as input to the 3rd auto-encoder.
Step 8: Repeat step 4
Step 9: And after training of all three auto-encoder the
output of hidden layer 3rd Auto-encoder is given as input to
the softmax layer which is the activation function.
Step 10: All this auto-encoder are stacked together to form
deep stacked auto-encoder.
Step 11: End
Scaled Conjugate Gradient (SCG) [14] training algorithm is
used. SCG is the supervised learning algorithm. SCG is fully
automated, includes no critical user-dependent parameters,
and avoids a time consuming line search, which CGB and
BFGS uses in each of its iteration in order to determine an
appropriate step size. SCG belongs to class of conjugate
gradient methods whichshows super-linear convergenceon
most problems. The speed is depends upon convergence
criteria i.e bigger demand for reduction of error, bigger the
speed-up.
3. EXPERIMENTATION
1. Data collection
The dataset chosen for the analysis of the deep
networks is MNIST which contains a set of hand-
written digit samples from 0 to 9. Each digit is in the
form of a grey-scale image, of size 20 × 20 pixels,
with values in range [0;1]
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 358
Fig- 3: MNIST Sample Images
2. Training of auto-encoder
The MNIST handwritten digits dataset have been
used for this experiment. This architecture is
consists of three hidden layers where first layer has
100 units, second layer has 50 units and third layer
has 25 units. This structure can be denoted as [400-
>100->50->25]. The training of these three auto-
encoder is performedonebyone.Thenaftertraining
all three auto-encoders are stacked togethertoform
a deep architecture.
Fig- 4 : Auto-encoder Layer 1
Fig-5 : Auto-encoder Layer 2
Fig- 6 : Auto-encoder Layer 3
Fig-7: Stacked Auto-encoder
Fig- 8: MSE obtained at layer 1
Fig- 9: MSE obtained at layer 2
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 359
Fig- 10: MSE obtained at layer 3
3. Testing of Auto-encoder
While testing, 10,000 images from the MNIST
dataset are taken for the testing. All trained neural
networks are saved as a .mat file that can be used
for testing. These .mat file can be directly loaded
when testing is carried out. So one by one image or
array of images are loaded from the dataset and
compression at each layer is shown on the output
window and after compression decompressed
image is obtained.
4. EXPERIMENTAL RESULTS
Fig-11: Input Image
Fig-12: Compressed image at hidden layer 1
Fig-13: Compressed image at hidden layer 2
Fig- 14: Compressed image at hidden layer 3
Fig-15: Decompressed image output
5. CONCLUSION
In this paper the review of various compression of image
techniques are studied and compared. The main purpose of
the image compression to obtain less storage and low
bandwidth which is satisfied by the deep learning as
compared to machine learning and conventional image
compression technique by comparing the results of above
mentioned papers. In the deep learning, it learns features
and classifies automatically which is the advantage deep
learning. Due to large dataset the training required more
time and this is the main disadvantage of deep learning. But
once training is done testing can be done in less time. The
dimensionality reduction of the input image can be done
very precisely from the very large dataset of images. And
compression at each layer gives the more compressed
representation which is required and also as we move on
increasing the layers can get more compressed image. The
decompression of image can also be done very easily
because there is no need to apply separate decompression
method.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 360
REFERENCES
[1] J. Jiang, “Image compression with neural networks- A
survey", Signal Processing: Image Communication
14(1999) 737-760
[2] C. Dong, Y. Deng, C. C. Loy and X. Tang, "Compression
Artifacts Reduction by a Deep Convolutional Network,"
2015 IEEE International ConferenceonComputerVision
(ICCV), Santiago, 2015, pp. 576-584.
[3] G.E. Hinton and R. Salakhutdinov, "Reducing the
Dimensionality of Data with Neural Networks," Science,
vol. 313, no. 5786, pp. ,504-507, 2006
[4] Adna Sento “Image Compression with Auto-encoder
Algorithm using Deep Neural Network (DNN)" 2016
IEEE Management and Innovation Technology
International Conference (MITiCON-2016 ),pp. 99-103
[5] Hongda Shen, W. David Pan, Yuhang Dong, and
Mohammad Alim, “Lossless Compression of Curated
Erythrocyte Images Using Deep Autoencoders for
Malaria Infection Diagnosis", 2016 IEEE
[6] Jasem Almotiri, Khaled Elleithy, Abdelrahman Elleithy,
“Comparison of Autoencoder and Principal Component
Analysis Followed by Neural Network for E-Learning
Using Handwritten Recognition",2017 IEEE
[7] Robert Torfason, Fabian Mentzer, Eirikur Agustsson,
Michael Tschannen, Radu Timofte, Luc Van Gool
“Towards image understanding from deep
compressioon without decoding" Published as a
conference paper at ICLR 2018
[8] Ahmed Ben Said, Amr Mohamed, Tarek Elfouly, Khaled
Harras, Z. Jane Wang, “Multimodal deep learning
approach for joint EEG-EMG data compression and
classification",2017 IEEE
[9] J. Papitha, G. Merlin Nancy and D. Nedumaran,
“Compression Techniques on MR Image-A Comparative
Study",2013 IEEE
[10] Gaurav Kumar, Pradeep Kumar Bhatia, “Empirical
analysis of Image Compression using Wavelets,Discrete
Cosine Transform and Neural Network”, 2016
International Conference on Computing for Sustainable
Global Development (INDIACom), pp. 3862-3866
[11] W. K. Yeo, David F. W. Yap, T.H. Oh, D.P. Andito, S. L. Kok,
Y. H. Ho, M. K. Suaidi, Grayscale medical image
compression using feedforwrd neural network", 2011
International Conference on ComputerApplicationsand
Industrial Electronics (ICCAIE 2011), pp.,633-638
[12] http://yann.lecun.com/exdb/mnist
[13] Josh Patterson and Adam Gibson, “Deep Learning A
Practitioner's Approach", Published by O'Reilly Media,
Inc., 1005 Gravenstein Highway North, Sebastopol, CA
95472
[14] Martin Fodslette Meiller, “A Scaled Conjugate Gradient
Algorithm for Fast Supervised Learning", Neural
Networks, Vol. 6, pp. 525-533, 1993
[15] Salman Khan, Hossein Rahmani, Syed Afaq Ali Shah,
Mohammed Bennamoun, “A Guide to Convolutional
Neural Networks for Computer Vision", A Publicationin
the Morgan and Claypool Publishers series

More Related Content

What's hot (20)

PDF
Improved block based segmentation for jpeg compressed document images
eSAT Journals
 
PDF
Improved block based segmentation for jpeg
eSAT Publishing House
 
PDF
DYNAMIC NETWORK ANOMALY INTRUSION DETECTION USING MODIFIED SOM
cscpconf
 
PDF
Development of 3D convolutional neural network to recognize human activities ...
journalBEEI
 
PDF
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
PDF
IRJET - Symmetric Image Registration based on Intensity and Spatial Informati...
IRJET Journal
 
PDF
Fuzzy Type Image Fusion Using SPIHT Image Compression Technique
IJERA Editor
 
PDF
40120140507006
IAEME Publication
 
PDF
Kq3518291832
IJERA Editor
 
PDF
Lossless Image Compression Techniques Comparative Study
IRJET Journal
 
PDF
IRJET - Effective Workflow for High-Performance Recognition of Fruits using M...
IRJET Journal
 
PPT
Common image compression formats
Clyde Lettsome
 
PDF
Face recognition using assemble of low frequency of DCT features
journalBEEI
 
PDF
International Journal on Soft Computing ( IJSC )
ijsc
 
PDF
Comparison of different Fingerprint Compression Techniques
sipij
 
PDF
Novel approach for hybrid MAC scheme for balanced energy and transmission in ...
IJECEIAES
 
PDF
A Comparative Case Study on Compression Algorithm for Remote Sensing Images
DR.P.S.JAGADEESH KUMAR
 
PDF
Ieee projects 2012 2013 - Digital Image Processing
K Sundaresh Ka
 
PDF
AN EFFICIENT M-ARY QIM DATA HIDING ALGORITHM FOR THE APPLICATION TO IMAGE ERR...
IJNSA Journal
 
PDF
D0325016021
theijes
 
Improved block based segmentation for jpeg compressed document images
eSAT Journals
 
Improved block based segmentation for jpeg
eSAT Publishing House
 
DYNAMIC NETWORK ANOMALY INTRUSION DETECTION USING MODIFIED SOM
cscpconf
 
Development of 3D convolutional neural network to recognize human activities ...
journalBEEI
 
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
IRJET - Symmetric Image Registration based on Intensity and Spatial Informati...
IRJET Journal
 
Fuzzy Type Image Fusion Using SPIHT Image Compression Technique
IJERA Editor
 
40120140507006
IAEME Publication
 
Kq3518291832
IJERA Editor
 
Lossless Image Compression Techniques Comparative Study
IRJET Journal
 
IRJET - Effective Workflow for High-Performance Recognition of Fruits using M...
IRJET Journal
 
Common image compression formats
Clyde Lettsome
 
Face recognition using assemble of low frequency of DCT features
journalBEEI
 
International Journal on Soft Computing ( IJSC )
ijsc
 
Comparison of different Fingerprint Compression Techniques
sipij
 
Novel approach for hybrid MAC scheme for balanced energy and transmission in ...
IJECEIAES
 
A Comparative Case Study on Compression Algorithm for Remote Sensing Images
DR.P.S.JAGADEESH KUMAR
 
Ieee projects 2012 2013 - Digital Image Processing
K Sundaresh Ka
 
AN EFFICIENT M-ARY QIM DATA HIDING ALGORITHM FOR THE APPLICATION TO IMAGE ERR...
IJNSA Journal
 
D0325016021
theijes
 

Similar to IRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder (20)

PDF
CBIR by deep learning
Vigen Sahakyan
 
PDF
92 97
Editor IJARCET
 
PDF
92 97
Editor IJARCET
 
DOCX
Thesis on Image compression by Manish Myst
Manish Myst
 
PDF
OBDPC 2022
klepsydratechnologie
 
PDF
Quality assessment of deep-learning-based image compression
Marco Cagnazzo
 
PDF
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
Edge AI and Vision Alliance
 
PDF
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
PDF
2019-06-14:6 - Reti neurali e compressione immagine
uninfoit
 
PDF
A novel compression methodology for medical images using deep learning for hi...
International Journal of Reconfigurable and Embedded Systems
 
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
PDF
Autoencoders
CloudxLab
 
PDF
Comparison Between Levenberg-Marquardt And Scaled Conjugate Gradient Training...
CSCJournals
 
PDF
A Survey of Deep Learning Algorithms for Malware Detection
IJCSIS Research Publications
 
PDF
A systematic image compression in the combination of linear vector quantisati...
eSAT Publishing House
 
PPTX
UNIT-4.pptx
NiharikaThakur32
 
PDF
Introduction to Autoencoders
Yan Xu
 
PDF
UNIT-4.pdf
NiharikaThakur32
 
PDF
UNIT-4.pdf
NiharikaThakur32
 
PDF
Image Compression based on DCT and BPSO for MRI and Standard Images
IJERA Editor
 
CBIR by deep learning
Vigen Sahakyan
 
Thesis on Image compression by Manish Myst
Manish Myst
 
Quality assessment of deep-learning-based image compression
Marco Cagnazzo
 
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
Edge AI and Vision Alliance
 
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
2019-06-14:6 - Reti neurali e compressione immagine
uninfoit
 
A novel compression methodology for medical images using deep learning for hi...
International Journal of Reconfigurable and Embedded Systems
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
Autoencoders
CloudxLab
 
Comparison Between Levenberg-Marquardt And Scaled Conjugate Gradient Training...
CSCJournals
 
A Survey of Deep Learning Algorithms for Malware Detection
IJCSIS Research Publications
 
A systematic image compression in the combination of linear vector quantisati...
eSAT Publishing House
 
UNIT-4.pptx
NiharikaThakur32
 
Introduction to Autoencoders
Yan Xu
 
UNIT-4.pdf
NiharikaThakur32
 
UNIT-4.pdf
NiharikaThakur32
 
Image Compression based on DCT and BPSO for MRI and Standard Images
IJERA Editor
 
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
PDF
Kiona – A Smart Society Automation Project
IRJET Journal
 
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
PDF
Breast Cancer Detection using Computer Vision
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
Kiona – A Smart Society Automation Project
IRJET Journal
 
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
Breast Cancer Detection using Computer Vision
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Ad

Recently uploaded (20)

PDF
輪読会資料_Miipher and Miipher2 .
NABLAS株式会社
 
PDF
FSE-Journal-First-Automated code editing with search-generate-modify.pdf
cl144
 
PDF
CLIP_Internals_and_Architecture.pdf sdvsdv sdv
JoseLuisCahuanaRamos3
 
PPTX
Computer network Computer network Computer network Computer network
Shrikant317689
 
PDF
Designing for Tomorrow – Architecture’s Role in the Sustainability Movement
BIM Services
 
PDF
13th International Conference of Security, Privacy and Trust Management (SPTM...
ijcisjournal
 
PDF
Decision support system in machine learning models for a face recognition-bas...
TELKOMNIKA JOURNAL
 
PDF
Rapid Prototyping for XR: Lecture 2 - Low Fidelity Prototyping.
Mark Billinghurst
 
PPTX
Bitumen Emulsion by Dr Sangita Ex CRRI Delhi
grilcodes
 
PPTX
Stability of IBR Dominated Grids - IEEE PEDG 2025 - short.pptx
ssuser307730
 
PPTX
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
AsadShad4
 
PPTX
How to Un-Obsolete Your Legacy Keypad Design
Epec Engineered Technologies
 
PPTX
FSE_LLM4SE1_A Tool for In-depth Analysis of Code Execution Reasoning of Large...
cl144
 
PPTX
CST413 KTU S7 CSE Machine Learning Introduction Parameter Estimation MLE MAP ...
resming1
 
PDF
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Mark Billinghurst
 
PPTX
Introduction to File Transfer Protocol with commands in FTP
BeulahS2
 
PPTX
Work at Height training for workers .pptx
cecos12
 
PPTX
WHO And BIS std- for water quality .pptx
dhanashree78
 
PPTX
LECTURE 7 COMPUTATIONS OF LEVELING DATA APRIL 2025.pptx
rr22001247
 
PDF
Python Mini Project: Command-Line Quiz Game for School/College Students
MPREETHI7
 
輪読会資料_Miipher and Miipher2 .
NABLAS株式会社
 
FSE-Journal-First-Automated code editing with search-generate-modify.pdf
cl144
 
CLIP_Internals_and_Architecture.pdf sdvsdv sdv
JoseLuisCahuanaRamos3
 
Computer network Computer network Computer network Computer network
Shrikant317689
 
Designing for Tomorrow – Architecture’s Role in the Sustainability Movement
BIM Services
 
13th International Conference of Security, Privacy and Trust Management (SPTM...
ijcisjournal
 
Decision support system in machine learning models for a face recognition-bas...
TELKOMNIKA JOURNAL
 
Rapid Prototyping for XR: Lecture 2 - Low Fidelity Prototyping.
Mark Billinghurst
 
Bitumen Emulsion by Dr Sangita Ex CRRI Delhi
grilcodes
 
Stability of IBR Dominated Grids - IEEE PEDG 2025 - short.pptx
ssuser307730
 
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
AsadShad4
 
How to Un-Obsolete Your Legacy Keypad Design
Epec Engineered Technologies
 
FSE_LLM4SE1_A Tool for In-depth Analysis of Code Execution Reasoning of Large...
cl144
 
CST413 KTU S7 CSE Machine Learning Introduction Parameter Estimation MLE MAP ...
resming1
 
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Mark Billinghurst
 
Introduction to File Transfer Protocol with commands in FTP
BeulahS2
 
Work at Height training for workers .pptx
cecos12
 
WHO And BIS std- for water quality .pptx
dhanashree78
 
LECTURE 7 COMPUTATIONS OF LEVELING DATA APRIL 2025.pptx
rr22001247
 
Python Mini Project: Command-Line Quiz Game for School/College Students
MPREETHI7
 

IRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 356 Handwritten Decimal Image Compression using Deep Stacked Autoencoder Swati Pachare1, Shubhada Thakare2 1P.G. Scholar, Dept. of Electronics Engineering, GCOE, Amravati 2Assistant Professor, Dept. of Electronics Engineering, GCOE, Amravati ----------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Compression of image is a technique that is used to identify internal dataredundancyandtosubsequentlycome up with a compact representation. Compression of image has been a necessary and effective topic of research in the image processing domain. Earlier used algorithms for image compression, like JPEG and JPEG2000, depend on the encoder/decoder (codec) block diagram. They use the fixed transform matrixes, i.e wavelet transform and DiscreteCosine Transform (DCT), together with quantization and entropy coder to compress the image together with quantization and entropy coder to compress the image. A compression of image is important in the applications image processing like storing of data, classification of image, recognition of image etc. However, the image compression with auto-encoder has been found for a small number of the improvements. Therefore, the proposed work is to study and demonstrate the image compression algorithm using the deep neural network(DNN). Deep learning has much potentialtoenhancetheperformance in various computer vision tasks. Key Words: Image compression, image processing, auto- encoder, deep neural network 1. INTRODUCTION Image compression is the art and science art diminishing the data required in the representation of an image. It is most helpful and commercially successful techniques in the area of the digital image processing. Compression enables image transmission at very low bandwidth and decreases space needed for the storage of the data. Image compression has turn out to be necessary owing to increased demand for information transfer and storage. J. Jiang presents a review on image compression with neural network shows image compression existing technologylike, MPEG, H.26X and JPEG standards are being developed with assisting with neural network to provide improvementover traditional algorithms [1]. Chao Dong et.al shows lossy compression methodslikeJPEG introduces ringing effects, blocking artifacts. In wavelet transform thersholding and DCT transform in shape- adaptive are used to remove ringing and blocking artifacts but gives the blurred output. To eliminate the undesired artifacts Artifacts Reduction-Convolutional Neural Network (AR-CNN) is used showsimpressiveresultsitsuppressed the blocking artifacts while retains the edge patterns and sharp details [2]. G. E. Hinton, R. R. Salakhutdinov proposed dimensionality reduction method of data using the Restricted Boltzmann Machine (RBM) which outperforms over the Principal Component Analysis (PCA) method [3]. Adna Sento proposed the method of image compression method usingauto-encoder algorithmusesextendedKalman filter algorithm as learning algorithm which updates the weights in the network [4]. Hongda Shen [5] proposedthelosslesscompressionmethod of curated erythrocyte images using stacked auto-encoder and their variants. The dimensionality reduction of the images preserves all discriminative features of the images. This compression gives good compression performance as compared to JP2K-LM, JPEG-LSandCALIC.J.Almotiriet.al [6] presents the comparison between auto-encoder and Principal Component Analysis (PCA) inthe compressionand classification of the data. The auto-encoder gives 98.1% accuracy while PCA gives 97.2% accuracy. Robert Torfason et al. proposed two distinct computer vision tasks from compressed image representations which are image classification andsemanticsegmentationareconsider.When combining classification and image compression training, observe an increase in MS-SSIM and SSIM and at the same time, a better segmentation and classification precision [7]. A.B. Said et al. [8] presents the combined compression and classification of EMG and EEG signals by means of deep learning approach. Deep architecture extracts the features and also reconstructs the data using greedy layer wise training. This experiment is conduct on DEAP dataset. It consists of EEG, EMG and multiple physiological signals recorded from 32 participants. For both EEG and EMG data they have 23,040 samples of dimensionality 896. While comparison with DWT andcompresssensingshowsthat this approach performs better with high compression ratio. J. Papitha, G. Merlin Nancy, D. Nedumaran proposed eight compression algorithms were compareon200andmore MR images for the evaluation of quality and also the performance of it [9]. Gaurav Kumar and Pradeep Kumar Bhatia [10] expound the brief comparison between wavelet, Discrete Cosine Transform and Neural network methods used for image compression. The comparison is based on performance parameters such as PSNR, retained energyand output image size. From this comparison wavelet based image compression shows the optimumresultsascompared with ANN and DCT. W. K. Yeo et al., proposed the method to compress the MRI images using the feed forward neural network and for training back-propagation algorithm is used. The lossless algorithms like JPEG, JPEG 2000
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 357 outperforms well in terms of image qualityandcompression ratio as compared to feed forwardneural network (FNN)but the FNN in the field of medical [11]. 2. PROPOSED SYSTEM This section includes the required compression and decompression of the handwritten decimal numeralsimages from the dataset. For this compression and reconstructionof the images stacked auto-encoder is used. The auto-encoder encodes the image into its compressed representation and from this compressed representation reconstruction of the images is carried out. Fig- 1: Block Diagram 1. Modified National Institute of Standard and Technology (MNIST) is the database used consisting of collection of handwritten digit images. It consists of 60,000 training images and 10,000 testing images [12]. 2. Image compression with stacked auto-encoder: Auto-encoders [13] can be stackedtoformadeepnetworkby feeding the internalrepresentation(outputcode)oftheAuto- encoder at the layer below as input to the considered layer. The unsupervised pre-trainingofthearchitectureisdoneone layer at a time. Suppose we consider k layers, once the first k layers are trained, it is possible to train the (k + 1)th layer using the internal representation of the kth layer. The advantage of this architecture is that by using more hidden layers than a single auto-encoder, a high-dimensional input data can be reduced to a much smaller code representing the important features. Fig-2: Stacked Autoencoder Training of Auto-encoder Step 1: Start Step 2: Create and configure the auto-encoder network Step 3: Initialize the weights and biases Step 4: Train the auto-encoder network with Scaled Conjugate Gradient (SCG) method Step 5: Output of hidden layer of 1st auto-encoder network is given as input to the 2nd auto-encoder Step 6: Repeat step 4 Step 7: Again the output of hidden layer of 2nd auto-encoder network is given as input to the 3rd auto-encoder. Step 8: Repeat step 4 Step 9: And after training of all three auto-encoder the output of hidden layer 3rd Auto-encoder is given as input to the softmax layer which is the activation function. Step 10: All this auto-encoder are stacked together to form deep stacked auto-encoder. Step 11: End Scaled Conjugate Gradient (SCG) [14] training algorithm is used. SCG is the supervised learning algorithm. SCG is fully automated, includes no critical user-dependent parameters, and avoids a time consuming line search, which CGB and BFGS uses in each of its iteration in order to determine an appropriate step size. SCG belongs to class of conjugate gradient methods whichshows super-linear convergenceon most problems. The speed is depends upon convergence criteria i.e bigger demand for reduction of error, bigger the speed-up. 3. EXPERIMENTATION 1. Data collection The dataset chosen for the analysis of the deep networks is MNIST which contains a set of hand- written digit samples from 0 to 9. Each digit is in the form of a grey-scale image, of size 20 × 20 pixels, with values in range [0;1]
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 358 Fig- 3: MNIST Sample Images 2. Training of auto-encoder The MNIST handwritten digits dataset have been used for this experiment. This architecture is consists of three hidden layers where first layer has 100 units, second layer has 50 units and third layer has 25 units. This structure can be denoted as [400- >100->50->25]. The training of these three auto- encoder is performedonebyone.Thenaftertraining all three auto-encoders are stacked togethertoform a deep architecture. Fig- 4 : Auto-encoder Layer 1 Fig-5 : Auto-encoder Layer 2 Fig- 6 : Auto-encoder Layer 3 Fig-7: Stacked Auto-encoder Fig- 8: MSE obtained at layer 1 Fig- 9: MSE obtained at layer 2
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 359 Fig- 10: MSE obtained at layer 3 3. Testing of Auto-encoder While testing, 10,000 images from the MNIST dataset are taken for the testing. All trained neural networks are saved as a .mat file that can be used for testing. These .mat file can be directly loaded when testing is carried out. So one by one image or array of images are loaded from the dataset and compression at each layer is shown on the output window and after compression decompressed image is obtained. 4. EXPERIMENTAL RESULTS Fig-11: Input Image Fig-12: Compressed image at hidden layer 1 Fig-13: Compressed image at hidden layer 2 Fig- 14: Compressed image at hidden layer 3 Fig-15: Decompressed image output 5. CONCLUSION In this paper the review of various compression of image techniques are studied and compared. The main purpose of the image compression to obtain less storage and low bandwidth which is satisfied by the deep learning as compared to machine learning and conventional image compression technique by comparing the results of above mentioned papers. In the deep learning, it learns features and classifies automatically which is the advantage deep learning. Due to large dataset the training required more time and this is the main disadvantage of deep learning. But once training is done testing can be done in less time. The dimensionality reduction of the input image can be done very precisely from the very large dataset of images. And compression at each layer gives the more compressed representation which is required and also as we move on increasing the layers can get more compressed image. The decompression of image can also be done very easily because there is no need to apply separate decompression method.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 360 REFERENCES [1] J. Jiang, “Image compression with neural networks- A survey", Signal Processing: Image Communication 14(1999) 737-760 [2] C. Dong, Y. Deng, C. C. Loy and X. Tang, "Compression Artifacts Reduction by a Deep Convolutional Network," 2015 IEEE International ConferenceonComputerVision (ICCV), Santiago, 2015, pp. 576-584. [3] G.E. Hinton and R. Salakhutdinov, "Reducing the Dimensionality of Data with Neural Networks," Science, vol. 313, no. 5786, pp. ,504-507, 2006 [4] Adna Sento “Image Compression with Auto-encoder Algorithm using Deep Neural Network (DNN)" 2016 IEEE Management and Innovation Technology International Conference (MITiCON-2016 ),pp. 99-103 [5] Hongda Shen, W. David Pan, Yuhang Dong, and Mohammad Alim, “Lossless Compression of Curated Erythrocyte Images Using Deep Autoencoders for Malaria Infection Diagnosis", 2016 IEEE [6] Jasem Almotiri, Khaled Elleithy, Abdelrahman Elleithy, “Comparison of Autoencoder and Principal Component Analysis Followed by Neural Network for E-Learning Using Handwritten Recognition",2017 IEEE [7] Robert Torfason, Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc Van Gool “Towards image understanding from deep compressioon without decoding" Published as a conference paper at ICLR 2018 [8] Ahmed Ben Said, Amr Mohamed, Tarek Elfouly, Khaled Harras, Z. Jane Wang, “Multimodal deep learning approach for joint EEG-EMG data compression and classification",2017 IEEE [9] J. Papitha, G. Merlin Nancy and D. Nedumaran, “Compression Techniques on MR Image-A Comparative Study",2013 IEEE [10] Gaurav Kumar, Pradeep Kumar Bhatia, “Empirical analysis of Image Compression using Wavelets,Discrete Cosine Transform and Neural Network”, 2016 International Conference on Computing for Sustainable Global Development (INDIACom), pp. 3862-3866 [11] W. K. Yeo, David F. W. Yap, T.H. Oh, D.P. Andito, S. L. Kok, Y. H. Ho, M. K. Suaidi, Grayscale medical image compression using feedforwrd neural network", 2011 International Conference on ComputerApplicationsand Industrial Electronics (ICCAIE 2011), pp.,633-638 [12] http://yann.lecun.com/exdb/mnist [13] Josh Patterson and Adam Gibson, “Deep Learning A Practitioner's Approach", Published by O'Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472 [14] Martin Fodslette Meiller, “A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning", Neural Networks, Vol. 6, pp. 525-533, 1993 [15] Salman Khan, Hossein Rahmani, Syed Afaq Ali Shah, Mohammed Bennamoun, “A Guide to Convolutional Neural Networks for Computer Vision", A Publicationin the Morgan and Claypool Publishers series