SlideShare a Scribd company logo
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. V (Nov – Dec. 2015), PP 79-85
www.iosrjournals.org
DOI: 10.9790/0661-17657985 www.iosrjournals.org 79 | Page
Data Compression using Multiple Transformation Techniques for
Audio Applications.
Arashpreet Kaur1
, Rajesh Mehra2
1
(M.E Scholar in Electronics and Communication Engineering, National Institute for technical teachers
Training and Research, India)
2
(Associate Professor in Electronics and Communication Engineering, National Institute for Technical Teachers
Training and Research, India)
Abstract: As Multimedia Technology is growing day by day over past decades, therefore demand for digital
information increasing rapidly. This digital information contains multimedia files like image files, audio files
that require a large space so no other option than compression. In Compression high input stream of data is
converted into small size. Data Compression for audio purposes is a field of digital signal processing that
focuses on reducing bit-rate of audio signals to enhance transmission speed and storage capacity of fast
developing multimedia by removing large amount of unnecessary duplicate data. The advantages of the
compression technique are reduction in storage space, bandwidth, transmission power and energy. This paper
is based on transform technology for compression of the audio signal. In this methodology, different transforms
such as Discrete Wavelet Transform (DWT) and Discrete Cosine Transform (DCT) are used. Mean compression
ratio is calculated for DCT & DWT. Performance measures like peak signal-to-noise ratio (PSNR), signal-to-
noise ratio (SNR) & normalized root mean square error (NRMSE) are calculated and then compared.
Keywords: Compression Ratio, DCT, DWT, NRMSE, PSNR, SNR.
I. Introduction
Data compression is a technique in which data content of the input signal to system is compressed so
that original signal is obtained as output and unwanted or undesired signals are removed [1]. Audio is an
electrical representation of sound within the range of human hearing that specifically lies between 20 Hz to 20
kHz range of frequency which is detectable by the human ear. [2]The concept of Audio Compression is to
compress the data in the form of audio so that it occupies less space for storing it. The need for audio
compression is to accommodate more data in the available storage area so that the storage capacity can be
enhanced. Due to less storage space occupancy, large amounts of data can be placed in the available memory.
Therefore less storage room for information inhabitance, large amount of information can be transmitted with
less transmission capacity[4] That implies compressed audio signal can be transmitted over the web with less
transmission bandwidth at higher speeds. As speed is increased, audio files can be transferred and downloaded
over the web faster with higher bit rates. Because of quick downloading and transferring of audio files, time
delay is minimized. [1]
Signals compression is based on removing the redundancy between adjacent samples and/or between
the adjacent cycles. In data compression, it is desired to represent data by as small as possible number of
coefficients within an acceptable loss of visual quality.
Compression techniques has two main categories: lossless and lossy.
Compression methods can be classified into three functional categories:
 Direct Methods: The samples of the signal are directly handled to provide compression.
 Transformation Methods: such as Fourier Transform (FT), Wavelet Transform (WT), and Discrete Cosine
Transform (DCT).
 Parameter Extraction Methods: A preprocessor is employed to extract some features that are later used to
reconstruct the signal.[2],[5]
In this paper audio compression is carried out in two levels. In the first level a transform function
(technique) like Discrete cosine transform, discrete wavelet transform are applied on audio signal which gives a
result with a new set of data with smaller values. By applying transform technique compression ratio for each
transform technique is obtained on different audio samples. Parameters like Signal to noise ratio (SNR), mean
square error (MSE) are measured for the reconstructed audio obtained from DCT, WT these transform
techniques. Second level is encoding. This step will present data in minimal form by using these encoding
techniques. Compression ratios are also calculated.
Data Compression using Multiple Transformation Techniques for Audio Applications.
DOI: 10.9790/0661-17657985 www.iosrjournals.org 80 | Page
II. Transformation Techniques
1. Discrete cosine transform
A discrete cosine transform (DCT) communicates a limited succession of information focuses as far as
cosine functions wavering at distinctive frequencies[1] DCTs are critical to various applications in science and
designing, from lossy compression of sound (e.g. MP3) and pictures (e.g. JPEG) (where little high-recurrence
segments can be discarded), to spectral methods for the numerical arrangement of partial differential equations.
The utilization of cosine instead of sine functions is critical for compression, since it turns out (as portrayed
beneath) that less cosine functions are needed to approximate a typical signal, while for differential equations
the cosines express a particular choice of boundary conditions. The DCT is similar to the discrete Fourier
transform: it changes a sign or picture from the spatial area to the recurrence space.
Discrete cosine transform, linearly transforms information into frequency domain, so that the
information can be represented by a set of coefficients. The advantage of discrete cosine transform is that, the
vitality of the genuine information may be moved in just a couple low recurrence parts of DCT depending on the
correlation present in the data. Equations (1) represents the D.C.T of 1-dimentional arrangement of length N.
𝒚 𝒌 = 𝒘 𝒌 𝒙 𝒏 𝒄𝒐𝒔
𝝅 𝟐𝒏−𝟏 𝒌−𝟏
𝟐𝑵
𝑵
𝒏=𝟏 𝒌 = 𝟏, 𝟐 … … . . 𝑵 (1)
Where
𝒘 𝒌 =
𝟏
𝑵
𝒌 = 𝟏
𝟐
𝑵
𝟐 ≤ 𝒌 ≤ 𝑵
(2)
N is the length of x, and x and y are the same size. If x is a matrix, DCT transforms its columns. The
series is indexed from n = 1 and k = 1 instead of the usual n = 0 and k = 0 because MATLAB vectors run from 1
to N instead of from 0 to N- 1. It often reconstructs a sequence very accurately from only a few DCT
coefficients, a useful property for applications requiring data reduction.[12]
Wavelets: Wavelet is a part of waveform, whose average energy value approximately tends to zero. Generally,
wavelet is a varying window length considered as a part of waveform. Wavelet transform are based on small
wavelets with limited duration.
Equation (3) and (4) shows mathematical representation of wavelet [6]. Ѱ(t) is a function called mother
wavelet.
Ѱ(𝐭) 𝒏∞
−∞
𝒅𝒕 < ∞ (3)
Ѱ(𝐭)
∞
−∞
𝒅𝒕 < ∞ (4)
2. Discrete Wavelet Transform
Jean Morlet introduced the idea of wavelet transform in 1982 and provided mathematical tool for
seismic wave analysis [1],[4].
A discrete wavelet transform is characterized as a "small wave" that has its energy concentrated in time
to give an apparatus to the examination of transient, non-stationary, or time-varying phenomena.[8] It has the
oscillating wave like properties additionally can permit concurrent time and recurrence investigation. Wavelet
Transform has risen as an effective mathematical tool in numerous regions of science and engineering, although
in the field of sound and data compression. [1]
Wavelet transform decomposes a signal into a set of basic functions [12]. These basis functions are
called “wavelets”.
Ѱ 𝒂,𝒃 𝒕 =
𝟏
𝒂
Ѱ
𝐭−𝐛
𝐚
(5)
Where „a‟ is the scaling parameter and „b‟ is the shifting parameter. And parameter „b‟ is translation
parameter and it determines the time location of the wavelet. DWT decompose signal in to several n levels in
different frequency bands. At each step there are two outputs, scaling and wavelet coefficients. The equations
(6) and (7) represent the scaling and wavelet coefficients respectively.
Data Compression using Multiple Transformation Techniques for Audio Applications.
DOI: 10.9790/0661-17657985 www.iosrjournals.org 81 | Page
𝒙𝒋+𝟏
𝒏 = 𝒉 𝟐𝒏 − 𝒊 𝒙𝒋
(𝒏)𝟐𝒏
𝒊=𝟏 (6)
𝒚𝒋+𝟏
𝒏 = 𝒈 𝟐𝒏 − 𝒊 𝒙𝒋
(𝒏)𝟐𝒏
𝒊=𝟏 (7)
Wavelet transforms convert a signal into series of wavelets and they give an approach to breaking
down waveforms in both frequency and time duration. Wavelet transform are a numerical intends to perform
signal investigation when signal frequency varies with time [14], [15]. The Wavelet transform gives the time-
frequency representation. That is, wavelet transforms give time and frequency information in the meantime,
thus giving a time frequency representation of the signal [13]. The wavelet transform is processed independently
for distinctive sections of the time-domain signal at different frequencies. Multi-resolution analysis: investigates
the signal at different frequencies giving different resolutions. . MRA is intended to give good time resolution &
poor frequency resolution at high frequencies and good frequency resolution & poor time resolution at low
frequencies. Wavelet transforms are good for signal having high frequency components for short durations and
low frequency components for long duration .e.g. images, video frames and speech signal.
III. Implementation Using Matlab
MATLAB could be a great tool that is employed to investigate audio signals which are read in wav
format. Following commands are used for analyzing the audio signal
• wavread: it reads audio signal
• windowsize: defines window operate of transformation
• wavplay: it produces speech signal once transformation
• length: defines length of speech to be processed by
• reworking principle dct: performs distinct trigonometric function rework
• idct: performs inverse distinct trigonometric function rework[5],[7]
This is shown in figure (3) , that is design flow of DCT coder. After getting the reconstructed signal , the audio
is further processed for calculating the peak signal-to-noise ratio (PSNR), signal-to-noise ratio (SNR) and
normalised root mean square error (NRMSE). These values comparison is shown in table 1.
Here figure (1) is the original audio signal which is to be transformed. Figure (2) is the design flow for the DWT
.Fig 1: audio signal
Data Compression using Multiple Transformation Techniques for Audio Applications.
DOI: 10.9790/0661-17657985 www.iosrjournals.org 82 | Page
Fig 2 : Design flow of wavelet based audio coder
Fig 3: Design flow of DCT audio coder
Decoder
Wavelet
decomposition of
frames
Computation of
variables
Decompose into N
equal frames
Wavelet
Compression
scheme
Performance
calculations
Play sound
Results
Expander
Compression
Choose
wavelet function.
Psychoacoustic
model
Quantization
Read the wave file and convert it to a vector
Choose the block size
Define the compression percentages
Reshape into matrix form
Plot the expanded view, spectrogram of audio signals
DCT compression
Reshape the matrix and play the reconstructed wave file Performance calculations
Results
Display
Data Compression using Multiple Transformation Techniques for Audio Applications.
DOI: 10.9790/0661-17657985 www.iosrjournals.org 83 | Page
IV. Output In Matlab
The output for the different waveforms is shown below. Figure (4) shows the audio signals in different
compression ratios that is audio with different amplitudes. Figure(5) shows the portion of audio signal to be
processed. Figure(6) represents the audio spectrograms while in Figure(7) compressed audio output using DWT
is shown.
Fig 4: Audio signals with different amplitudes
Fig 5: Portion of speech signal to be processed
Data Compression using Multiple Transformation Techniques for Audio Applications.
DOI: 10.9790/0661-17657985 www.iosrjournals.org 84 | Page
Fig 6: Audio Spectograms
Fig 7: Compression by DWT
V. Performance Measures
For this audio compression method, based on transform techniques, the performance is measured in
terms of Compression ratio, SNR, MSE.
1.Compression ratio: The definition for Compression ratio (C.R) is defined as ratio between Length of
original signal and Length of compressed signal
C. R =
original audio length
compressed audio length
2.Signal to noise ratio: It is a measure that compare the level of desired signal to the level of background
noise.
Data Compression using Multiple Transformation Techniques for Audio Applications.
DOI: 10.9790/0661-17657985 www.iosrjournals.org 85 | Page
𝐒𝐍𝐑 = 𝟏𝟎 𝐥𝐨𝐠 𝟏𝟎 𝛔 𝐬
𝟐
𝛔 𝐞
𝟐 𝟐
(8)
𝜎𝑠
2
is the mean square of audio signal and 𝜎𝑒
2
is mean square difference between original and reconstructed
audio signal.
3.Normalized Root Mean Square Error (NRMSE):
𝑵𝑹𝑴𝑺𝑬 =
(𝒙 𝒏 −𝒙′(𝒏)) 𝟐
𝒏
(𝒙 𝒏 −𝒖 𝒙(𝒏)) 𝟐
𝒏
(9)
Here, X(n) is the speech signal, x‟(n) is reconstructed speech signal and μ x(n) is the mean of speech signal.
4.Peak Signal to Noise Ratio (PSNR):
𝑷𝑺𝑵𝑹 = 𝟏𝟎 𝐥𝐨𝐠 𝟏𝟎
𝑵𝑿 𝟐
𝒙−𝒙′ 𝟐 (10)
Where N is the length of reconstructed signal, X is the maximum absolute square value of signal x and ||x-x`||2
is the energy of the difference between the original and reconstructed signal.
Signal CR MSE SNR(db) PSNR(db)
funky 0.2639 0.02990 31.83 45.21
Table 1: Results of DCT based technique in terms of CF, SNR, PSNR & MSE
Signal CR MSE SNR(db) PSNR(db)
funky 0.0587 0.08 21.02 36.24
Table 2: Results of DWT based technology in terms of CF,SNR,PSNR & MSE
VI. Conclusion
In this paper a simple DWT & DCT based audio compression schemes are presented. These data
compression is done by using MATLAB CODING. From the results shown above it is clear that DWT gives
less compression ratio in comparison to DCT, while MSE for DCT is less. SNR and PSNR for DWT is less in
comparison to DCT.DWT is better than DCT for audio compression.
Here audio is compressed in different factors in case of DCT by 2 , 4 and 8.
Acknowledgements
I am very thankful to my college and guide Dr. Rajesh Mehra for providing time to time help in
studying the topic and providing me a background to understand it in deeper details.
References:
[1] M. Arora, N. Maurya, P. Pathak, and V. Singh, “SPEECH COMPRESSION ANALYSIS USING MATLAB,” pp. 538–542, 2014.
[2] M. V Patil, A. Gupta, A. Varma, and S. Salil, “AUDIO AND SPEECH COMPRESSION USING DCT AND DWT
TECHNIQUES,” vol. 2, no. 5, pp. 1712–1719, 2013
[3] J. S. Jacaba, “AUDIO COMPRESSION USING MODIFIED DISCRETE COSINE TRANSFORM : THE MP3,” no. October, 2001.
[4] H. S. Malvar, “LOSSLESS AND NEAR-LOSSLESS AUDIO COMPRESSION USING INTEGER- REVERSIBLE
MODULATED LAPPED TRANSFORMS.”
[5] H. kaur and R. kaur ,D. Cosine, “SPEECH COMPRESSION AND DECOMPRESSION USING DWT AND DCT,” vol. 3, no.
August, pp. 1501–1503, 2012
[6] M. Arora and N. Maurya, “AUDIO COMPRESSION IN MPEG TECHNOLOGY PEG IS AN ACRONYM FOR MOVING
PICTURE EXERTS,” vol. 3, no. 12, pp. 1–4, 2013.
[7] M.Siffuzzaman, M.R.Islam, and M.Z. Ali, “APPLICATIONS OF WAVELET TRANSFORM AND ITS ADVANTAGES
COMPARED TO FOURIER TRANSFORM, by Journal of Physical Sciences, Vol. 13, pp:121-134, October-2009.
[8] Z. T. Drweesh and L. E. George, “AUDIO COMPRESSION BASED ON DISCRETE COSINE TRANSFORM , RUN LENGTH
AND HIGH ORDER SHIFT ENCODING,” vol. 4, no. 1, pp. 45–51, 2014.
[9] S. M. Ali, “ INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN SPEEDING UP AUDIO FRACTAL
COMPRESSION ,” vol. 3, no. 6, pp. 86–92, 2013
[10] K.P. Soman, K.I. Ramachandran, “INSIGHT TO WAVELETS” second edition 2005, by Prentice Hall of India. ISBN- 81-203-
2902-3
[11] Raghuveer M. Rao, Ajit S. Bopadikar, “WAVELET TRANSFORMS – INTRODUCTION TO THEORY AND APPLICATIONS”.
Pearson Education Asia. 1998 Pearson education, Inc. ISBN:81-7808-251-9.
[12] J. James and V. J. Thomas, “AUDIO COMPRESSION USING DCT AND DWT TECHNIQUES,” vol. 4, no. 4, pp. 119–125, 2014.

More Related Content

What's hot (20)

PDF
Deep Learning Based Voice Activity Detection and Speech Enhancement
NAVER Engineering
 
PDF
Cb34474478
IJERA Editor
 
PDF
Dc3210881096
IJMER
 
PDF
145 153
Editor IJARCET
 
PDF
Digital signal processing techniques for lti fiber
eSAT Publishing House
 
PDF
Digital signal processing techniques for lti fiber impairment compensation
eSAT Journals
 
DOCX
speech enhancement
senthilrajvlsi
 
PPT
Lecture: Digital Signal Processing Batch 2009
ubaidis
 
PPTX
Voice Activity Detection using Single Frequency Filtering
Tejus Adiga M
 
PPTX
Introduction to wavelet transform
Raj Endiran
 
PDF
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
PDF
A METHOD FOR ENCRYPTING AND DECRYPTINGWAVE FILES
IJNSA Journal
 
PDF
Discrete wavelet transform using matlab
IAEME Publication
 
PDF
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
NAVER Engineering
 
PDF
K31074076
IJERA Editor
 
PDF
A novel speech enhancement technique
eSAT Publishing House
 
PDF
Analysis of Microstrip Finger on Bandwidth of Interdigital Band Pass Filter u...
IJREST
 
PDF
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd Iaetsd
 
PPTX
Subband Coding
Mihika Shah
 
Deep Learning Based Voice Activity Detection and Speech Enhancement
NAVER Engineering
 
Cb34474478
IJERA Editor
 
Dc3210881096
IJMER
 
Digital signal processing techniques for lti fiber
eSAT Publishing House
 
Digital signal processing techniques for lti fiber impairment compensation
eSAT Journals
 
speech enhancement
senthilrajvlsi
 
Lecture: Digital Signal Processing Batch 2009
ubaidis
 
Voice Activity Detection using Single Frequency Filtering
Tejus Adiga M
 
Introduction to wavelet transform
Raj Endiran
 
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
A METHOD FOR ENCRYPTING AND DECRYPTINGWAVE FILES
IJNSA Journal
 
Discrete wavelet transform using matlab
IAEME Publication
 
캡슐 네트워크를 이용한 엔드투엔드 음성 단어 인식, 배재성(KAIST 석사과정)
NAVER Engineering
 
K31074076
IJERA Editor
 
A novel speech enhancement technique
eSAT Publishing House
 
Analysis of Microstrip Finger on Bandwidth of Interdigital Band Pass Filter u...
IJREST
 
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd Iaetsd
 
Subband Coding
Mihika Shah
 

Viewers also liked (20)

PDF
Basics of Image Compression
Punnam Chandar
 
PPTX
Audio encoding principles
Phillip Doyle
 
PPTX
6. Data Compression
Napier University
 
PDF
Compression basics
Joel Nally
 
PPT
Image video processing and canvas element by Abhay Rao
Mrinmay Kulkarni
 
PPT
Speech Compression
anithabalaprabhu
 
PPT
Basics of audio coding
sakshij91
 
PPTX
Atm Networks
Rishabh RJ
 
PPTX
3 multimedia elements - audio
Kelly Bauer
 
PPTX
Audio compression
Miled Othmen
 
PPT
Lecture 8 audio compression
Mr SMAK
 
PPT
Basic ISDN
Chris McAndrew
 
PPTX
Audio compression
Sahil Garg
 
PPT
Audio and video compression
neeraj9217
 
PPTX
Broadband isdn
Er Varun Kumar
 
PPTX
Audio compression
Madhawa Gunasekara
 
PPT
Image compression
Ale Johnsan
 
PDF
ISDN Integrated Services Digital Network
Madhumita Tamhane
 
PPT
Fax Machine
muhammad shoaib
 
Basics of Image Compression
Punnam Chandar
 
Audio encoding principles
Phillip Doyle
 
6. Data Compression
Napier University
 
Compression basics
Joel Nally
 
Image video processing and canvas element by Abhay Rao
Mrinmay Kulkarni
 
Speech Compression
anithabalaprabhu
 
Basics of audio coding
sakshij91
 
Atm Networks
Rishabh RJ
 
3 multimedia elements - audio
Kelly Bauer
 
Audio compression
Miled Othmen
 
Lecture 8 audio compression
Mr SMAK
 
Basic ISDN
Chris McAndrew
 
Audio compression
Sahil Garg
 
Audio and video compression
neeraj9217
 
Broadband isdn
Er Varun Kumar
 
Audio compression
Madhawa Gunasekara
 
Image compression
Ale Johnsan
 
ISDN Integrated Services Digital Network
Madhumita Tamhane
 
Fax Machine
muhammad shoaib
 
Ad

Similar to Data Compression using Multiple Transformation Techniques for Audio Applications. (20)

PDF
Compression Using Wavelet Transform
CSCJournals
 
PDF
Dk33669673
IJERA Editor
 
PDF
Dk33669673
IJERA Editor
 
PDF
Mk3422222228
IJERA Editor
 
PDF
H010234144
IOSR Journals
 
PDF
Bg044357364
IJERA Editor
 
PPTX
slideshare.pptx
RUDRAGANITHANMAIRUDR
 
PPTX
slideshare.pptx
RUDRAGANITHANMAIRUDR
 
PDF
www.ijerd.com
IJERD Editor
 
PDF
Analysis of PEAQ Model using Wavelet Decomposition Techniques
idescitation
 
PPTX
Image transforms
11mr11mahesh
 
PDF
H0144952
Shetty Brothers
 
PDF
Ecg signal compression for diverse transforms
Alexander Decker
 
PDF
A Review on Image Compression using DCT and DWT
IJSRD
 
PPTX
discrete wavelet transform
piyush_11
 
PDF
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
Kate Campbell
 
PPTX
Introduction to Wavelet Transform with Applications to DSP
Hicham Berkouk
 
PDF
G0352039045
inventionjournals
 
PPTX
Image TransformsXSsSsSsSsSsSsSsSccd.pptx
rajukolluri
 
PDF
Presentation
Tabish Fawad
 
Compression Using Wavelet Transform
CSCJournals
 
Dk33669673
IJERA Editor
 
Dk33669673
IJERA Editor
 
Mk3422222228
IJERA Editor
 
H010234144
IOSR Journals
 
Bg044357364
IJERA Editor
 
slideshare.pptx
RUDRAGANITHANMAIRUDR
 
slideshare.pptx
RUDRAGANITHANMAIRUDR
 
www.ijerd.com
IJERD Editor
 
Analysis of PEAQ Model using Wavelet Decomposition Techniques
idescitation
 
Image transforms
11mr11mahesh
 
H0144952
Shetty Brothers
 
Ecg signal compression for diverse transforms
Alexander Decker
 
A Review on Image Compression using DCT and DWT
IJSRD
 
discrete wavelet transform
piyush_11
 
A COMPARATIVE STUDY OF IMAGE COMPRESSION ALGORITHMS
Kate Campbell
 
Introduction to Wavelet Transform with Applications to DSP
Hicham Berkouk
 
G0352039045
inventionjournals
 
Image TransformsXSsSsSsSsSsSsSsSccd.pptx
rajukolluri
 
Presentation
Tabish Fawad
 
Ad

More from iosrjce (20)

PDF
An Examination of Effectuation Dimension as Financing Practice of Small and M...
iosrjce
 
PDF
Does Goods and Services Tax (GST) Leads to Indian Economic Development?
iosrjce
 
PDF
Childhood Factors that influence success in later life
iosrjce
 
PDF
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
iosrjce
 
PDF
Customer’s Acceptance of Internet Banking in Dubai
iosrjce
 
PDF
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
iosrjce
 
PDF
Consumer Perspectives on Brand Preference: A Choice Based Model Approach
iosrjce
 
PDF
Student`S Approach towards Social Network Sites
iosrjce
 
PDF
Broadcast Management in Nigeria: The systems approach as an imperative
iosrjce
 
PDF
A Study on Retailer’s Perception on Soya Products with Special Reference to T...
iosrjce
 
PDF
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
iosrjce
 
PDF
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
iosrjce
 
PDF
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
iosrjce
 
PDF
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
iosrjce
 
PDF
Media Innovations and its Impact on Brand awareness & Consideration
iosrjce
 
PDF
Customer experience in supermarkets and hypermarkets – A comparative study
iosrjce
 
PDF
Social Media and Small Businesses: A Combinational Strategic Approach under t...
iosrjce
 
PDF
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
iosrjce
 
PDF
Implementation of Quality Management principles at Zimbabwe Open University (...
iosrjce
 
PDF
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
iosrjce
 
An Examination of Effectuation Dimension as Financing Practice of Small and M...
iosrjce
 
Does Goods and Services Tax (GST) Leads to Indian Economic Development?
iosrjce
 
Childhood Factors that influence success in later life
iosrjce
 
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
iosrjce
 
Customer’s Acceptance of Internet Banking in Dubai
iosrjce
 
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
iosrjce
 
Consumer Perspectives on Brand Preference: A Choice Based Model Approach
iosrjce
 
Student`S Approach towards Social Network Sites
iosrjce
 
Broadcast Management in Nigeria: The systems approach as an imperative
iosrjce
 
A Study on Retailer’s Perception on Soya Products with Special Reference to T...
iosrjce
 
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
iosrjce
 
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
iosrjce
 
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
iosrjce
 
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
iosrjce
 
Media Innovations and its Impact on Brand awareness & Consideration
iosrjce
 
Customer experience in supermarkets and hypermarkets – A comparative study
iosrjce
 
Social Media and Small Businesses: A Combinational Strategic Approach under t...
iosrjce
 
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
iosrjce
 
Implementation of Quality Management principles at Zimbabwe Open University (...
iosrjce
 
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
iosrjce
 

Recently uploaded (20)

PDF
CLIP_Internals_and_Architecture.pdf sdvsdv sdv
JoseLuisCahuanaRamos3
 
PPTX
darshai cross section and river section analysis
muk7971
 
PDF
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
PPT
FINAL plumbing code for board exam passer
MattKristopherDiaz
 
PPTX
template.pptxr4t5y67yrttttttttttttttttttttttttttttttttttt
SithamparanaathanPir
 
PDF
MODULE-5 notes [BCG402-CG&V] PART-B.pdf
Alvas Institute of Engineering and technology, Moodabidri
 
PPTX
Engineering Quiz ShowEngineering Quiz Show
CalvinLabial
 
PDF
lesson4-occupationalsafetyandhealthohsstandards-240812020130-1a7246d0.pdf
arvingallosa3
 
PPTX
Engineering Quiz ShowEngineering Quiz Show
CalvinLabial
 
PPTX
Precooling and Refrigerated storage.pptx
ThongamSunita
 
PDF
13th International Conference on Artificial Intelligence, Soft Computing (AIS...
ijait
 
PPTX
FSE_LLM4SE1_A Tool for In-depth Analysis of Code Execution Reasoning of Large...
cl144
 
PDF
A Brief Introduction About Robert Paul Hardee
Robert Paul Hardee
 
PPTX
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
PDF
Artificial Neural Network-Types,Perceptron,Problems
Sharmila Chidaravalli
 
PDF
Module - 5 Machine Learning-22ISE62.pdf
Dr. Shivashankar
 
PPTX
Introduction to File Transfer Protocol with commands in FTP
BeulahS2
 
PDF
William Stallings - Foundations of Modern Networking_ SDN, NFV, QoE, IoT, and...
lavanya896395
 
PPTX
Artificial Intelligence jejeiejj3iriejrjifirirjdjeie
VikingsGaming2
 
PDF
FSE-Journal-First-Automated code editing with search-generate-modify.pdf
cl144
 
CLIP_Internals_and_Architecture.pdf sdvsdv sdv
JoseLuisCahuanaRamos3
 
darshai cross section and river section analysis
muk7971
 
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
FINAL plumbing code for board exam passer
MattKristopherDiaz
 
template.pptxr4t5y67yrttttttttttttttttttttttttttttttttttt
SithamparanaathanPir
 
MODULE-5 notes [BCG402-CG&V] PART-B.pdf
Alvas Institute of Engineering and technology, Moodabidri
 
Engineering Quiz ShowEngineering Quiz Show
CalvinLabial
 
lesson4-occupationalsafetyandhealthohsstandards-240812020130-1a7246d0.pdf
arvingallosa3
 
Engineering Quiz ShowEngineering Quiz Show
CalvinLabial
 
Precooling and Refrigerated storage.pptx
ThongamSunita
 
13th International Conference on Artificial Intelligence, Soft Computing (AIS...
ijait
 
FSE_LLM4SE1_A Tool for In-depth Analysis of Code Execution Reasoning of Large...
cl144
 
A Brief Introduction About Robert Paul Hardee
Robert Paul Hardee
 
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
Artificial Neural Network-Types,Perceptron,Problems
Sharmila Chidaravalli
 
Module - 5 Machine Learning-22ISE62.pdf
Dr. Shivashankar
 
Introduction to File Transfer Protocol with commands in FTP
BeulahS2
 
William Stallings - Foundations of Modern Networking_ SDN, NFV, QoE, IoT, and...
lavanya896395
 
Artificial Intelligence jejeiejj3iriejrjifirirjdjeie
VikingsGaming2
 
FSE-Journal-First-Automated code editing with search-generate-modify.pdf
cl144
 

Data Compression using Multiple Transformation Techniques for Audio Applications.

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. V (Nov – Dec. 2015), PP 79-85 www.iosrjournals.org DOI: 10.9790/0661-17657985 www.iosrjournals.org 79 | Page Data Compression using Multiple Transformation Techniques for Audio Applications. Arashpreet Kaur1 , Rajesh Mehra2 1 (M.E Scholar in Electronics and Communication Engineering, National Institute for technical teachers Training and Research, India) 2 (Associate Professor in Electronics and Communication Engineering, National Institute for Technical Teachers Training and Research, India) Abstract: As Multimedia Technology is growing day by day over past decades, therefore demand for digital information increasing rapidly. This digital information contains multimedia files like image files, audio files that require a large space so no other option than compression. In Compression high input stream of data is converted into small size. Data Compression for audio purposes is a field of digital signal processing that focuses on reducing bit-rate of audio signals to enhance transmission speed and storage capacity of fast developing multimedia by removing large amount of unnecessary duplicate data. The advantages of the compression technique are reduction in storage space, bandwidth, transmission power and energy. This paper is based on transform technology for compression of the audio signal. In this methodology, different transforms such as Discrete Wavelet Transform (DWT) and Discrete Cosine Transform (DCT) are used. Mean compression ratio is calculated for DCT & DWT. Performance measures like peak signal-to-noise ratio (PSNR), signal-to- noise ratio (SNR) & normalized root mean square error (NRMSE) are calculated and then compared. Keywords: Compression Ratio, DCT, DWT, NRMSE, PSNR, SNR. I. Introduction Data compression is a technique in which data content of the input signal to system is compressed so that original signal is obtained as output and unwanted or undesired signals are removed [1]. Audio is an electrical representation of sound within the range of human hearing that specifically lies between 20 Hz to 20 kHz range of frequency which is detectable by the human ear. [2]The concept of Audio Compression is to compress the data in the form of audio so that it occupies less space for storing it. The need for audio compression is to accommodate more data in the available storage area so that the storage capacity can be enhanced. Due to less storage space occupancy, large amounts of data can be placed in the available memory. Therefore less storage room for information inhabitance, large amount of information can be transmitted with less transmission capacity[4] That implies compressed audio signal can be transmitted over the web with less transmission bandwidth at higher speeds. As speed is increased, audio files can be transferred and downloaded over the web faster with higher bit rates. Because of quick downloading and transferring of audio files, time delay is minimized. [1] Signals compression is based on removing the redundancy between adjacent samples and/or between the adjacent cycles. In data compression, it is desired to represent data by as small as possible number of coefficients within an acceptable loss of visual quality. Compression techniques has two main categories: lossless and lossy. Compression methods can be classified into three functional categories:  Direct Methods: The samples of the signal are directly handled to provide compression.  Transformation Methods: such as Fourier Transform (FT), Wavelet Transform (WT), and Discrete Cosine Transform (DCT).  Parameter Extraction Methods: A preprocessor is employed to extract some features that are later used to reconstruct the signal.[2],[5] In this paper audio compression is carried out in two levels. In the first level a transform function (technique) like Discrete cosine transform, discrete wavelet transform are applied on audio signal which gives a result with a new set of data with smaller values. By applying transform technique compression ratio for each transform technique is obtained on different audio samples. Parameters like Signal to noise ratio (SNR), mean square error (MSE) are measured for the reconstructed audio obtained from DCT, WT these transform techniques. Second level is encoding. This step will present data in minimal form by using these encoding techniques. Compression ratios are also calculated.
  • 2. Data Compression using Multiple Transformation Techniques for Audio Applications. DOI: 10.9790/0661-17657985 www.iosrjournals.org 80 | Page II. Transformation Techniques 1. Discrete cosine transform A discrete cosine transform (DCT) communicates a limited succession of information focuses as far as cosine functions wavering at distinctive frequencies[1] DCTs are critical to various applications in science and designing, from lossy compression of sound (e.g. MP3) and pictures (e.g. JPEG) (where little high-recurrence segments can be discarded), to spectral methods for the numerical arrangement of partial differential equations. The utilization of cosine instead of sine functions is critical for compression, since it turns out (as portrayed beneath) that less cosine functions are needed to approximate a typical signal, while for differential equations the cosines express a particular choice of boundary conditions. The DCT is similar to the discrete Fourier transform: it changes a sign or picture from the spatial area to the recurrence space. Discrete cosine transform, linearly transforms information into frequency domain, so that the information can be represented by a set of coefficients. The advantage of discrete cosine transform is that, the vitality of the genuine information may be moved in just a couple low recurrence parts of DCT depending on the correlation present in the data. Equations (1) represents the D.C.T of 1-dimentional arrangement of length N. 𝒚 𝒌 = 𝒘 𝒌 𝒙 𝒏 𝒄𝒐𝒔 𝝅 𝟐𝒏−𝟏 𝒌−𝟏 𝟐𝑵 𝑵 𝒏=𝟏 𝒌 = 𝟏, 𝟐 … … . . 𝑵 (1) Where 𝒘 𝒌 = 𝟏 𝑵 𝒌 = 𝟏 𝟐 𝑵 𝟐 ≤ 𝒌 ≤ 𝑵 (2) N is the length of x, and x and y are the same size. If x is a matrix, DCT transforms its columns. The series is indexed from n = 1 and k = 1 instead of the usual n = 0 and k = 0 because MATLAB vectors run from 1 to N instead of from 0 to N- 1. It often reconstructs a sequence very accurately from only a few DCT coefficients, a useful property for applications requiring data reduction.[12] Wavelets: Wavelet is a part of waveform, whose average energy value approximately tends to zero. Generally, wavelet is a varying window length considered as a part of waveform. Wavelet transform are based on small wavelets with limited duration. Equation (3) and (4) shows mathematical representation of wavelet [6]. Ѱ(t) is a function called mother wavelet. Ѱ(𝐭) 𝒏∞ −∞ 𝒅𝒕 < ∞ (3) Ѱ(𝐭) ∞ −∞ 𝒅𝒕 < ∞ (4) 2. Discrete Wavelet Transform Jean Morlet introduced the idea of wavelet transform in 1982 and provided mathematical tool for seismic wave analysis [1],[4]. A discrete wavelet transform is characterized as a "small wave" that has its energy concentrated in time to give an apparatus to the examination of transient, non-stationary, or time-varying phenomena.[8] It has the oscillating wave like properties additionally can permit concurrent time and recurrence investigation. Wavelet Transform has risen as an effective mathematical tool in numerous regions of science and engineering, although in the field of sound and data compression. [1] Wavelet transform decomposes a signal into a set of basic functions [12]. These basis functions are called “wavelets”. Ѱ 𝒂,𝒃 𝒕 = 𝟏 𝒂 Ѱ 𝐭−𝐛 𝐚 (5) Where „a‟ is the scaling parameter and „b‟ is the shifting parameter. And parameter „b‟ is translation parameter and it determines the time location of the wavelet. DWT decompose signal in to several n levels in different frequency bands. At each step there are two outputs, scaling and wavelet coefficients. The equations (6) and (7) represent the scaling and wavelet coefficients respectively.
  • 3. Data Compression using Multiple Transformation Techniques for Audio Applications. DOI: 10.9790/0661-17657985 www.iosrjournals.org 81 | Page 𝒙𝒋+𝟏 𝒏 = 𝒉 𝟐𝒏 − 𝒊 𝒙𝒋 (𝒏)𝟐𝒏 𝒊=𝟏 (6) 𝒚𝒋+𝟏 𝒏 = 𝒈 𝟐𝒏 − 𝒊 𝒙𝒋 (𝒏)𝟐𝒏 𝒊=𝟏 (7) Wavelet transforms convert a signal into series of wavelets and they give an approach to breaking down waveforms in both frequency and time duration. Wavelet transform are a numerical intends to perform signal investigation when signal frequency varies with time [14], [15]. The Wavelet transform gives the time- frequency representation. That is, wavelet transforms give time and frequency information in the meantime, thus giving a time frequency representation of the signal [13]. The wavelet transform is processed independently for distinctive sections of the time-domain signal at different frequencies. Multi-resolution analysis: investigates the signal at different frequencies giving different resolutions. . MRA is intended to give good time resolution & poor frequency resolution at high frequencies and good frequency resolution & poor time resolution at low frequencies. Wavelet transforms are good for signal having high frequency components for short durations and low frequency components for long duration .e.g. images, video frames and speech signal. III. Implementation Using Matlab MATLAB could be a great tool that is employed to investigate audio signals which are read in wav format. Following commands are used for analyzing the audio signal • wavread: it reads audio signal • windowsize: defines window operate of transformation • wavplay: it produces speech signal once transformation • length: defines length of speech to be processed by • reworking principle dct: performs distinct trigonometric function rework • idct: performs inverse distinct trigonometric function rework[5],[7] This is shown in figure (3) , that is design flow of DCT coder. After getting the reconstructed signal , the audio is further processed for calculating the peak signal-to-noise ratio (PSNR), signal-to-noise ratio (SNR) and normalised root mean square error (NRMSE). These values comparison is shown in table 1. Here figure (1) is the original audio signal which is to be transformed. Figure (2) is the design flow for the DWT .Fig 1: audio signal
  • 4. Data Compression using Multiple Transformation Techniques for Audio Applications. DOI: 10.9790/0661-17657985 www.iosrjournals.org 82 | Page Fig 2 : Design flow of wavelet based audio coder Fig 3: Design flow of DCT audio coder Decoder Wavelet decomposition of frames Computation of variables Decompose into N equal frames Wavelet Compression scheme Performance calculations Play sound Results Expander Compression Choose wavelet function. Psychoacoustic model Quantization Read the wave file and convert it to a vector Choose the block size Define the compression percentages Reshape into matrix form Plot the expanded view, spectrogram of audio signals DCT compression Reshape the matrix and play the reconstructed wave file Performance calculations Results Display
  • 5. Data Compression using Multiple Transformation Techniques for Audio Applications. DOI: 10.9790/0661-17657985 www.iosrjournals.org 83 | Page IV. Output In Matlab The output for the different waveforms is shown below. Figure (4) shows the audio signals in different compression ratios that is audio with different amplitudes. Figure(5) shows the portion of audio signal to be processed. Figure(6) represents the audio spectrograms while in Figure(7) compressed audio output using DWT is shown. Fig 4: Audio signals with different amplitudes Fig 5: Portion of speech signal to be processed
  • 6. Data Compression using Multiple Transformation Techniques for Audio Applications. DOI: 10.9790/0661-17657985 www.iosrjournals.org 84 | Page Fig 6: Audio Spectograms Fig 7: Compression by DWT V. Performance Measures For this audio compression method, based on transform techniques, the performance is measured in terms of Compression ratio, SNR, MSE. 1.Compression ratio: The definition for Compression ratio (C.R) is defined as ratio between Length of original signal and Length of compressed signal C. R = original audio length compressed audio length 2.Signal to noise ratio: It is a measure that compare the level of desired signal to the level of background noise.
  • 7. Data Compression using Multiple Transformation Techniques for Audio Applications. DOI: 10.9790/0661-17657985 www.iosrjournals.org 85 | Page 𝐒𝐍𝐑 = 𝟏𝟎 𝐥𝐨𝐠 𝟏𝟎 𝛔 𝐬 𝟐 𝛔 𝐞 𝟐 𝟐 (8) 𝜎𝑠 2 is the mean square of audio signal and 𝜎𝑒 2 is mean square difference between original and reconstructed audio signal. 3.Normalized Root Mean Square Error (NRMSE): 𝑵𝑹𝑴𝑺𝑬 = (𝒙 𝒏 −𝒙′(𝒏)) 𝟐 𝒏 (𝒙 𝒏 −𝒖 𝒙(𝒏)) 𝟐 𝒏 (9) Here, X(n) is the speech signal, x‟(n) is reconstructed speech signal and μ x(n) is the mean of speech signal. 4.Peak Signal to Noise Ratio (PSNR): 𝑷𝑺𝑵𝑹 = 𝟏𝟎 𝐥𝐨𝐠 𝟏𝟎 𝑵𝑿 𝟐 𝒙−𝒙′ 𝟐 (10) Where N is the length of reconstructed signal, X is the maximum absolute square value of signal x and ||x-x`||2 is the energy of the difference between the original and reconstructed signal. Signal CR MSE SNR(db) PSNR(db) funky 0.2639 0.02990 31.83 45.21 Table 1: Results of DCT based technique in terms of CF, SNR, PSNR & MSE Signal CR MSE SNR(db) PSNR(db) funky 0.0587 0.08 21.02 36.24 Table 2: Results of DWT based technology in terms of CF,SNR,PSNR & MSE VI. Conclusion In this paper a simple DWT & DCT based audio compression schemes are presented. These data compression is done by using MATLAB CODING. From the results shown above it is clear that DWT gives less compression ratio in comparison to DCT, while MSE for DCT is less. SNR and PSNR for DWT is less in comparison to DCT.DWT is better than DCT for audio compression. Here audio is compressed in different factors in case of DCT by 2 , 4 and 8. Acknowledgements I am very thankful to my college and guide Dr. Rajesh Mehra for providing time to time help in studying the topic and providing me a background to understand it in deeper details. References: [1] M. Arora, N. Maurya, P. Pathak, and V. Singh, “SPEECH COMPRESSION ANALYSIS USING MATLAB,” pp. 538–542, 2014. [2] M. V Patil, A. Gupta, A. Varma, and S. Salil, “AUDIO AND SPEECH COMPRESSION USING DCT AND DWT TECHNIQUES,” vol. 2, no. 5, pp. 1712–1719, 2013 [3] J. S. Jacaba, “AUDIO COMPRESSION USING MODIFIED DISCRETE COSINE TRANSFORM : THE MP3,” no. October, 2001. [4] H. S. Malvar, “LOSSLESS AND NEAR-LOSSLESS AUDIO COMPRESSION USING INTEGER- REVERSIBLE MODULATED LAPPED TRANSFORMS.” [5] H. kaur and R. kaur ,D. Cosine, “SPEECH COMPRESSION AND DECOMPRESSION USING DWT AND DCT,” vol. 3, no. August, pp. 1501–1503, 2012 [6] M. Arora and N. Maurya, “AUDIO COMPRESSION IN MPEG TECHNOLOGY PEG IS AN ACRONYM FOR MOVING PICTURE EXERTS,” vol. 3, no. 12, pp. 1–4, 2013. [7] M.Siffuzzaman, M.R.Islam, and M.Z. Ali, “APPLICATIONS OF WAVELET TRANSFORM AND ITS ADVANTAGES COMPARED TO FOURIER TRANSFORM, by Journal of Physical Sciences, Vol. 13, pp:121-134, October-2009. [8] Z. T. Drweesh and L. E. George, “AUDIO COMPRESSION BASED ON DISCRETE COSINE TRANSFORM , RUN LENGTH AND HIGH ORDER SHIFT ENCODING,” vol. 4, no. 1, pp. 45–51, 2014. [9] S. M. Ali, “ INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN SPEEDING UP AUDIO FRACTAL COMPRESSION ,” vol. 3, no. 6, pp. 86–92, 2013 [10] K.P. Soman, K.I. Ramachandran, “INSIGHT TO WAVELETS” second edition 2005, by Prentice Hall of India. ISBN- 81-203- 2902-3 [11] Raghuveer M. Rao, Ajit S. Bopadikar, “WAVELET TRANSFORMS – INTRODUCTION TO THEORY AND APPLICATIONS”. Pearson Education Asia. 1998 Pearson education, Inc. ISBN:81-7808-251-9. [12] J. James and V. J. Thomas, “AUDIO COMPRESSION USING DCT AND DWT TECHNIQUES,” vol. 4, no. 4, pp. 119–125, 2014.